Regex 删除R中字符向量的引号?
我有一些文字:Regex 删除R中字符向量的引号?,regex,r,quotes,Regex,R,Quotes,我有一些文字: version of mackinnon’s “dominance approach,” 我已经读入了一个字符向量: > my.char.vector [1] "version" "of" "mackinnon’s" "“dominance" "approach,”" 如何删除双引号(和单引号),使my.char.vector [1] "version" "of" "mackinnons" "dominance" "approach," 另一个问题与这个确切的标题
version of mackinnon’s “dominance approach,”
我已经读入了一个字符向量:
> my.char.vector
[1] "version" "of" "mackinnon’s" "“dominance" "approach,”"
如何删除双引号(和单引号),使my.char.vector
[1] "version" "of" "mackinnons" "dominance" "approach,"
另一个问题与这个确切的标题不是,事实上,问同样的问题-它试图打印没有引用。我的字符向量中的元素确实包含引号,我正在尝试删除这些引号。这些是“奇特的”引号——我从屏幕上剪切并粘贴了它们(如果您只使用SHIFT-'[或键盘上的任何东西],您只会得到常规的“引号…”
这些是“奇特的”引号——我从屏幕上剪切并粘贴了它们(如果你只使用SHIFT-'[或键盘上的任何东西],你只会得到常规的“引号…”
试试这个
gsub("[^[:print:]]", "", my.char.vector)
试试这个
gsub("[^[:print:]]", "", my.char.vector)
您可以为此使用
stringi
。我们可以使用ICU元字符\\P
对匹配的值求反,并--
减去逗号的反
library(stringi)
mcv <- c("version", "of", "mackinnon’s", "“dominance", "approach,”")
stri_replace_all_regex(mcv, "[\\P{Ll}--,]", "")
# [1] "version" "of" "mackinnons" "dominance" "approach,"
库(stringi)
mcv您可以使用stringi
进行此操作。我们可以使用ICU元字符\\P
对匹配的值求反,并--
减去逗号的反
library(stringi)
mcv <- c("version", "of", "mackinnon’s", "“dominance", "approach,”")
stri_replace_all_regex(mcv, "[\\P{Ll}--,]", "")
# [1] "version" "of" "mackinnons" "dominance" "approach,"
库(stringi)
mcv另一个带有qdap的选项
library(qdap)
strip(mcv, char.keep=',')
#[1] "version" "of" "mackinnons" "dominance" "approach,"
或使用stringi
library(stringi)
stri_replace_all_regex(mcv, '[^[:alnum:],]+', '')
#[1] "version" "of" "mackinnons" "dominance" "approach,"
或base R
vapply(regmatches(mcv,gregexpr('[A-Za-z,]+', mcv)), paste,
collapse="", character(1L))
#[1] "version" "of" "mackinnons" "dominance" "approach,"
数据
mcv另一个带有qdap的选项
library(qdap)
strip(mcv, char.keep=',')
#[1] "version" "of" "mackinnons" "dominance" "approach,"
或使用stringi
library(stringi)
stri_replace_all_regex(mcv, '[^[:alnum:],]+', '')
#[1] "version" "of" "mackinnons" "dominance" "approach,"
或base R
vapply(regmatches(mcv,gregexpr('[A-Za-z,]+', mcv)), paste,
collapse="", character(1L))
#[1] "version" "of" "mackinnons" "dominance" "approach,"
数据
mcv@BenBolker使用Sys.setenv(LANG=“en_US.UTF-8”)
并尝试。也许它与系统区域设置有关。@BenBolker使用Sys.setenv(LANG=“en_US.UTF-8”)
并尝试。也许它与系统区域设置有关。