文本替换——模式是字符串的集合列表[r]_R_Apply_Code Cleanup_Stringr_Text Manipulation - Fatal编程技术网

文本替换——模式是字符串的集合列表[r]

r

文本替换——模式是字符串的集合列表[r],r,apply,code-cleanup,stringr,text-manipulation,R,Apply,Code Cleanup,Stringr,Text Manipulation,我在一个大数据集中有一个字符串变量，我想根据一组字符串列表来清理它。例如，pattern我们将“pattern”向量粘贴在一起以创建一个字符串，在将“vec1”改为小写（tolower（vec1））后，使用该字符串从“vec1”中提取单词数据 pattern使用base R的另一种方法是： #data vec <- c('black Dog', 'white dOG', 'doggie','black CAT','thatdamcat') #regexpr finds the loca

我在一个大数据集中有一个字符串变量，我想根据一组字符串列表来清理它。例如，pattern我们

将“pattern”向量粘贴在一起以创建一个字符串，在将“vec1”改为小写（tolower（vec1）
）后，使用该字符串从“vec1”中提取单词
数据
pattern使用base R的另一种方法是：
#data
vec <- c('black Dog', 'white dOG', 'doggie','black CAT','thatdamcat')

#regexpr finds the locations of cat and dog ignoring the cases
a <- regexpr( 'dog|cat', vec, ignore.case=TRUE )

#regmatches returns the above locations from vec (here we use tolower in order 
#to convert to lowercase)
regmatches(tolower(vec), a)
[1] "dog" "dog" "dog" "cat" "cat"

#数据
vec我也试过这个，它也能工作，但它不是我想要的，因为我的数据集中也有“bird”，我想要一个NA占位符。我的解释错了。非常感谢。
dog
dog
dog
cat
cat

new <- vector()

lapply(pattern, function(x){
  where<- grep(x,a,value = FALSE, ignore.case = TRUE)
  new[where]<-x
  })

library(stringr)
str_extract(tolower(vec1), paste(pattern, collapse='|'))
#[1] "dog" "dog" "dog" "cat" "cat"

pattern <- c("dog","cat") 
vec1 <- c('black Dog', 'white dOG', 'doggie','black CAT', 'thatdamcat')

#data
vec <- c('black Dog', 'white dOG', 'doggie','black CAT','thatdamcat')

#regexpr finds the locations of cat and dog ignoring the cases
a <- regexpr( 'dog|cat', vec, ignore.case=TRUE )

#regmatches returns the above locations from vec (here we use tolower in order 
#to convert to lowercase)
regmatches(tolower(vec), a)
[1] "dog" "dog" "dog" "cat" "cat"




[jekyll]相关文章推荐



                                                        
使用替换过滤器mangles jekyll site.tags
jekyll 
jekyll serve未在本地主机中提供服务并显示其他页面
jekyll 
Jekyll index.html以外的默认文件
jekyll 
每个Jekyll布局有多个内容
jekyll 
使用液体标签在Jekyll网站上导航
jekyll 
如何将参数从Jekyll传递到RedCloth（纺织处理器）
jekyll 
Jekyll无法加载这样的文件--Jekyll watch
jekyll 
Jekyll中标记的不同列表
jekyll 
将Jekyll:：Drops:：DocumentDrop与Jekyll:：Drops:：DocumentDrop进行比较失败
jekyll 
具有不同输出的Jekyll子文件夹
jekyll 
Ruby背景对于使用Jekyll有多重要？
jekyll 
Jekyll 如何将类添加到kramdown生成的目录中？
jekyll 
我可以拥有一个在文件名中使用日期的Jekyll集合吗？
jekyll 
Jekyll复制md文件而不是处理它们
jekyll 
Travis CI显示Jekyll生成的网站生成时存在0个问题，但网站上未显示更改
jekyll 
更改Jekyll中nasm代码示例的代码示例格式
jekyll 
Jekyll 在github页面上获取页面贡献者
jekyll 
Jekyll 引用包括forloop构造中的参数
jekyll 
kramdown（Jekyll）：如何缩进整个段落而不将其转换为代码块
jekyll 
Jekyll 尽可能快地按路径/名称获取集合页变量
jekyll 
                                       





随机文章推荐



                                                        
使用foursquare api v2从一个场地到达这里
api 
Api 使用JSF的库的Maven依赖项，应该与Mojarra或MyFaces一起使用
apijsfmavendependencies 
Api 在Google从Google Insights切换到Google Trends后，Excel VB宏停止工作
apivbaexcel 
Linkedin公司帐户API密钥
apilinkedin 
在Web API中发布时DTO中的标识符？
apirestasp.net-web-api 
如何实现Heroku API的维护模式
apiheroku 
Api 基本配置文件-当前位置-仅一个？
apilinkedin 
检索当前共享字段时连接API出现内部服务器错误
apilinkedin 
Lucee服务器上的API响应文件内容为空
apicoldfusion 
REST API和Express新手不了解重复结果
apirestexpress 
谷歌API向IBM区块链发布请求错误
apipostibm-cloudblockchain 
使用Instagram API端点获取历史数据。
apiinstagram 
Azure Api managament和kubernetes
apiasp.net-corekubernetes 
具有逐指令API的Risc-V模拟器
 Pr>是否有一个RISC-V模拟器提供了一个C++ API来执行指令？和API来设置寄存器？我还研究了“未执行分支”模拟的“撤消”指令的功能。我想通过API将RISC-V模拟器用作库。我的C++程序生成RISC-V汇
api 
Api 邮递员呼叫自动任务
apipostman 
未找到航班最便宜日期搜索API的价格结果
api 
OpenAPI3-如何在模式中使用允许的键值属性描述数组？
apiswagger 
Api Pinterest重定向URI
apiredirect 
如何使用GCP计算API作为当前用户？
apigoogle-cloud-platformgoogle-compute-engine 
$\通过API密钥进行服务器和标头身份验证
apiauthenticationserver


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
R 创建具有总计行的表格摘要
									R
							 
在R中正确链接向量
									R
							 
R 为什么ggplot2中的PDF绘图不显示标题或标签？
									R
							 
将具有多行元素的SpatialLinesDataframe转换为R中的KML
									R
							 
Lyx+；具有读块的knitr代码外部化（'；foo.R'；）失败
									R
							 
在R中将n×m数据帧转换为1×n×m数据帧
									R
							 									Dataframe
							 
R ggplot2中多列的刻面条形图
									R
							 
R 绝对和相对文件路径检入器
									R
							 									Path
							 
外生变量VAR
									R
							 
如何在GNU R中的两种给定颜色之间创建颜色比例
									R
							 
R：拆分-应用-合并以获得累积变量
									R
							 
在R中格式化数据？
									R
							 									Formatting
							 
R 类似sqldf的内部连接
									R
							 
为什么按行变异在R中使用偶数而不是奇数？
									R
							 
传单R弹出突然改变输出？
									R
							 									Shiny
							 									Leaflet
							 
R-由于假定的无效UTF-8字符串，textcat未执行
									R
							 									Utf 8
							 
linux centos中的R基本包grDevices中缺少cairo.so
									R
							 									Linux
							 									Svg
							 									Centos
							 
R 错误$运算符对于原子向量或“0”无效；“没有垃圾箱”；
									R
							 
R 将向量拆分为块，直到事件发生
									R
							 									Vector
							 									Dataframe
							 
基于向量new_varname、old_varname重命名dplyr中的变量名
									R
							 
使用R将数据插入BigQuery
									R
							 									Google Bigquery
							 
R数据表：标识显示的行
									R
							 									Shiny
							 
R 使用plotly过滤链接视图中的图例
									R
							 									Plot
							 									Charts
							 
如何在tidyverse setNames（）中索引tibble
									R
							 
R 将ggplot图例标签直接放置在其填充颜色上
									R
							 
R 有人能解释一下ggplot和geom_point之间的区别吗？
									R
							 
使用R lappy和函数参数列表迭代函数
									R
							 									List
							 									Function
							 
R 使用ggplot2创建甘特图：scale_x_date函数能否将日期显示为季度（例如2020年第4季度）？
									R
							 
R 如何根据x轴上的指定点对绘图的整个区域进行选择性着色？
									R
							 
R 当有多个案例时，突变und case_
									R
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Amazon Dynamodb
Amazon Web Services
Tsql
Xaml
Apache Pig
Drupal
Apache Nifi
Numpy
Web Crawler
Corda
Python 2.7
Google App Engine
Prestashop
Ldap
Windows Phone 8
Woocommerce
Javafx 2
Keras
Safari
Msbuild
C# 4.0
Instagram
Hibernate
Go
Hbase
Sugarcrm
Binding
Visual C++
Responsive Design
Sprite Kit
Coding Style
Servlets
Gridview
Dynamics Crm
Spotify
Pentaho
Ethereum
Architecture
Performance
Transactions
Nativescript
Types
Svg
Jmeter
Vmware
Cocos2d Iphone
Vector
Opengl
Selenium
X86
Map
Datetime
Aws Lambda
Rspec
Isabelle
Unix
Kernel
Silverstripe
Sql
Snowflake Cloud Data Platform
Asp.net Mvc 3
Xslt
Appium
Ffmpeg
Layout
Titanium
Markdown
Matrix
Encoding
Netbeans
Tabs
Hazelcast
Powershell
Glassfish
Browser
Flash
Sdk
Jaxb
Netty
Jhipster
Apache Storm
Cloud Foundry
Filesystems
Asp.net
Postman
Time
Smtp
Websphere
Camera
Clang
Git
Sqlalchemy
Llvm
Dart
Windows Phone 7
Model
Indexing
Influxdb
Netlogo
Python 3.x
Fortran
Javafx
Hyperlink
Cucumber
Highcharts
Meteor
Dialogflow Es
Talend
Geometry
Gwt
File Io
Arm
Checkbox
Sphinx
Content Management System
Ios5
Build
Apache Camel
Ios4
Log4net
Datatables
Twitter Bootstrap 3
Monitoring
Algorithm
Ipad
Process
Ravendb
Ibm Midrange
Apache Spark
Applescript
Batch File
Networking
Apache Flink
Swift2
Hyperledger Fabric
If Statement
Properties
Cocoa Touch
Mvvm
Tensorflow
Timer
Elm
Julia
Gremlin
Localization
Windows 8
Compiler Errors
Mod Rewrite
Download
Django Rest Framework
Ubuntu
Open Source
Nunit
Google Maps Api 3
Fonts
Nest
Vuejs2
Jira
Opencv
Dynamics Crm 2011
Optimization
Formatting
Kdb
Css
Interface
Drools
Activemq
Button
Ios8
Zsh
Grails
Visual Studio 2013
Polymer
Postgresql
Cors
Menu
Select
Extjs4
Vue.js
Assembly
Robotframework
Twilio
Qml
Yaml
Rust
Entity Framework 4
Csv
Url Rewriting
Django
Typo3
Computer Vision
Liferay
Image
Xampp
Serial Port
Protractor
Odoo
Salesforce
Openlayers
Filter
Tkinter


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网