在R中将字符向量分解为单个单词
我有这样一个字符向量(vec):在R中将字符向量分解为单个单词,r,vector,character,R,Vector,Character,我有这样一个字符向量(vec): [1] "super good dental associates" "cheap dentist in bel air md" "dentures " "dentures " "in office teeth whitening" "in office teeth whitening" "dental gum surg
[1] "super good dental associates" "cheap dentist in bel air md"
"dentures " "dentures "
"in office teeth whitening" "in office teeth whitening"
"dental gum surgery bel air, md"
[8] "dental implants" "dental implants"
"veneer teeth pictures"
我需要把它分解成几个词。我试过这个:
singleWords <- strsplit(vec, ' ')[[1]]
如何获得所有单词作为单个元素的单个向量?您可以尝试:
strsplit(paste(vec, collapse = " "), ' ')[[1]]
你可以试试:
strsplit(paste(vec, collapse = " "), ' ')[[1]]
为了证实我的评论,既然你提到它不起作用,那就看看吧。由于有两个元素有额外的空格,我建议使用
\\s+
作为要拆分的正则表达式,而不是注释中的单个空格。干杯
> ( newVec <- unlist(sapply(vec, strsplit, "\\s+", USE.NAMES = FALSE)) )
# [1] "super" "good" "dental" "associates" "cheap" "dentist"
# [7] "in" "bel" "air" "md" "dentures" "dentures"
#[13] "in" "office" "teeth" "whitening" "in" "office"
#[19] "teeth" "whitening" "dental" "gum" "surgery" "bel"
#[25] "air," "md" "dental" "implants" "dental" "implants"
#[31] "veneer" "teeth" "pictures"
为了证实我的评论,既然你提到它不起作用,那就看看吧。由于有两个元素有额外的空格,我建议使用
\\s+
作为要拆分的正则表达式,而不是注释中的单个空格。干杯
> ( newVec <- unlist(sapply(vec, strsplit, "\\s+", USE.NAMES = FALSE)) )
# [1] "super" "good" "dental" "associates" "cheap" "dentist"
# [7] "in" "bel" "air" "md" "dentures" "dentures"
#[13] "in" "office" "teeth" "whitening" "in" "office"
#[19] "teeth" "whitening" "dental" "gum" "surgery" "bel"
#[25] "air," "md" "dental" "implants" "dental" "implants"
#[31] "veneer" "teeth" "pictures"
试试
sapply(vec,strsplit,”)
,如果你想把它们放在一个向量中,你可以用unlist
来包装它。这并不是拆分单个单词:(我用sapply
Trysapply(vec,strsplit,”)添加了一个例子)
,如果您想将它们放在一个向量中,可以使用取消列出
将其包装。这不是拆分单个单词:(我添加了一个例子,使用代码< SpIs< /Cord>)如果这个帖子解决了你的问题,请控制它,点击它的左下方的支票来接受它。@贡恩也谢谢你!如果这个帖子解决了你的问题,请控制它,点击它的左下方的支票来接受它。g也谢谢你!