Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/r/76.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
在R中将字符向量分解为单个单词_R_Vector_Character - Fatal编程技术网

在R中将字符向量分解为单个单词

在R中将字符向量分解为单个单词,r,vector,character,R,Vector,Character,我有这样一个字符向量(vec): [1] "super good dental associates" "cheap dentist in bel air md" "dentures " "dentures " "in office teeth whitening" "in office teeth whitening" "dental gum surg

我有这样一个字符向量(vec):

[1] "super good dental associates"   "cheap dentist in bel air md"    
    "dentures   "                    "dentures   "                    
    "in office teeth whitening"      "in office teeth whitening"      
    "dental gum surgery bel air, md"
[8] "dental implants"                "dental implants"                
    "veneer teeth pictures"
我需要把它分解成几个词。我试过这个:

singleWords <- strsplit(vec, ' ')[[1]]
如何获得所有单词作为单个元素的单个向量?

您可以尝试:

strsplit(paste(vec, collapse = " "), ' ')[[1]]
你可以试试:

strsplit(paste(vec, collapse = " "), ' ')[[1]]

为了证实我的评论,既然你提到它不起作用,那就看看吧。由于有两个元素有额外的空格,我建议使用
\\s+
作为要拆分的正则表达式,而不是注释中的单个空格。干杯

> ( newVec <- unlist(sapply(vec, strsplit, "\\s+", USE.NAMES = FALSE)) )
# [1] "super"      "good"       "dental"     "associates" "cheap"      "dentist"   
# [7] "in"         "bel"        "air"        "md"         "dentures"   "dentures"  
#[13] "in"         "office"     "teeth"      "whitening"  "in"         "office"    
#[19] "teeth"      "whitening"  "dental"     "gum"        "surgery"    "bel"       
#[25] "air,"       "md"         "dental"     "implants"   "dental"     "implants"  
#[31] "veneer"     "teeth"      "pictures" 

为了证实我的评论,既然你提到它不起作用,那就看看吧。由于有两个元素有额外的空格,我建议使用
\\s+
作为要拆分的正则表达式,而不是注释中的单个空格。干杯

> ( newVec <- unlist(sapply(vec, strsplit, "\\s+", USE.NAMES = FALSE)) )
# [1] "super"      "good"       "dental"     "associates" "cheap"      "dentist"   
# [7] "in"         "bel"        "air"        "md"         "dentures"   "dentures"  
#[13] "in"         "office"     "teeth"      "whitening"  "in"         "office"    
#[19] "teeth"      "whitening"  "dental"     "gum"        "surgery"    "bel"       
#[25] "air,"       "md"         "dental"     "implants"   "dental"     "implants"  
#[31] "veneer"     "teeth"      "pictures" 

试试
sapply(vec,strsplit,”)
,如果你想把它们放在一个向量中,你可以用
unlist
来包装它。这并不是拆分单个单词:(我用
sapply
Try
sapply(vec,strsplit,”)添加了一个例子)
,如果您想将它们放在一个向量中,可以使用
取消列出
将其包装。这不是拆分单个单词:(我添加了一个例子,使用代码< SpIs< /Cord>)如果这个帖子解决了你的问题,请控制它,点击它的左下方的支票来接受它。@贡恩也谢谢你!如果这个帖子解决了你的问题,请控制它,点击它的左下方的支票来接受它。g也谢谢你!