如何在R中找出语料库中所有的大写词_R_Text Mining - Fatal编程技术网

如何在R中找出语料库中所有的大写词

r

如何在R中找出语料库中所有的大写词,r,text-mining,R,Text Mining,所以，我有一个文档语料库，我需要在R中的所有文档中找到所有都是大写的单词（即，该单词中的每个字符都是大写）。我不确定如何找到。我已经看过R中的文本挖掘“tm”包，没有这样的函数可以找到它输入字符串：“俄罗斯是最大的国家” 所需输出：“该” 如何使用“tm”软件包实现这一点？尝试使用正则表达式 sub('.*(\\b[A-Z]+\\b).*','\\1',string) #[1] "THE" 尝试使用正则表达式 sub('.*(\\b[A-Z]+\\b).*','\\1',string) #[

所以，我有一个文档语料库，我需要在R中的所有文档中找到所有都是大写的单词（即，该单词中的每个字符都是大写）。我不确定如何找到。我已经看过R中的文本挖掘“tm”包，没有这样的函数可以找到它

输入字符串：

“俄罗斯是最大的国家”

所需输出：

“该”

如何使用“tm”软件包实现这一点？

尝试使用正则表达式

sub('.*(\\b[A-Z]+\\b).*','\\1',string)
#[1] "THE"

尝试使用正则表达式

sub('.*(\\b[A-Z]+\\b).*','\\1',string)
#[1] "THE"

您可以使用gregexpr和regmatches：

unlist(regmatches(abc, gregexpr('\\b[A-Z]+\\b', abc)))
[1] "THE"

数据

abc您可以使用gregexpr和regmatches：
unlist(regmatches(abc, gregexpr('\\b[A-Z]+\\b', abc)))
[1] "THE"

数据
abc带stringr（如果您想查找所有带大写的单词（作为向量），而不仅仅是第一个）：
使用stringr（如果您希望查找所有此类带有大写而不仅仅是第一个大写的单词（作为向量）：
这将只找到一个单词，例如尝试使用string。这将只找到一个单词，例如尝试使用string




[spring]相关文章推荐



                                                        
Spring mvc通用多部件解析器
springspring-mvc 
Spring Hibernate/JPA简单创建/删除实体
springhibernatejpa 
Spring数据JPA方法保存（Iterable<；S>；）不适用于参数（Book）
springjpa 
spring安全性2阶段身份验证
springspring-security 
Spring 使用MethodInterceptor包装对受保护方法的调用
spring 
向Spring安全注销添加参数
springspring-security 
Spring框架安全漏洞CVE是否适用于Grails
springsecuritygrails 
Grails-WebFlowSpring-单元测试中的重定向错误
springunit-testinggrails 
Spring 空请求正文给出400个错误
springspring-mvc 
Spring集成-装入通道适配器在读取新文件时再次发送最后一个文件
springspring-integration 
Spring 未找到id为'；短期'；
springspring-mvc 
Spring数据（Hibernate）JPA单个字段的更新在事务中不可见
springhibernatejpaorm 
Spring Oauth 2访问被拒绝，作用域投票者返回0
springoauth 
在测试套件结束后保持Spring上下文运行，IntelliJ
springspring-boot 
Spring Security+；休息+；postgreSQL
springsecurityspring-security 
为Spring引导应用程序中的多个登录页配置Spring安全性
springspring-bootspring-security 
获取应用程序域+；spring中的端口和路径编程？
springspring-boot 
Spring MailComponent中的字段templateEngine需要类型为'；org.thymeleaf.TemplateEngine'；那是找不到的
springspring-bootspring-mvc 
Spring 如何让OpenMQ重新传递消息？
springjms 
Spring Mongotemplate未显示任何异常
springmongodbspring-boot 
                                       





随机文章推荐



                                                        
Leaflet 如何在传单中使用自定义标记？
leafletmapbox 
Leaflet 筛选标记为其指定新ID
leafletmapbox 
Leaflet 传单：从geojson创建图层；物业；？
leaflet 
Leaflet 传单地图中心不正确
leaflet 
Leaflet 将传单与原始传单插件进行反应
leaflet 
Leaflet 升级到1.0.3时的传单标签
leaflet 
Leaflet 将地图框文字标记添加到传单
leafletmapbox 
Leaflet 如何缩放到cql_过滤器功能？
leaflet 
Leaflet 是否可以为其他坐标系执行map.getBounds？
leaflet


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
如何使每个唯一的观测值成为R中的一个因子w/a二元响应？
									R
							 									Loops
							 
将R中的两个向量合并为一个，不删除NAs
									R
							 									Merge
							 
R包'；的预测函数输出错误；推荐标签&x27；？
									R
							 
R：使字符串引用对象
									R
							 									Object
							 
使用treemapify在R中绘制树地图
									R
							 
r图中的控制颜色
									R
							 									Plot
							 
R 使用光栅：堆栈函数时NA/NaN参数错误
									R
							 
R 如何通过在每个数据帧中匹配3列，将一列从一个数据帧复制到另一个数据帧中
									R
							 
R 将文本背景设置为ggplot轴文本
									R
							 
R 基于列名的第一个字符修剪数据
									R
							 
使用Knitr时，Pandoc转换失败
									R
							 
R 创建循环以子采样n-1行
									R
							 									Loops
							 
R 设置空气质量数据子集时选择的未定义列
									R
							 
用d3tree显示参差不齐的树
									R
							 
在r中的某些单词后的文本字符串中插入逗号
									R
							 									Regex
							 									Nlp
							 
在R中阅读Swift 910消息
									R
							 									Dataframe
							 
在R函数中传递参数
									R
							 									Function
							 
将现有的r studio项目写入github
									R
							 									Git
							 									Github
							 
R 从2列映射值
									R
							 									Mapping
							 
R 在特定组中排列值
									R
							 
R tmap动态修改图例以防止图例值重叠
									R
							 									Maps
							 
R 如何通过比较具有相同变量值的不同行来创建新变量？
									R
							 
R 创建自己的模糊比例
									R
							 
R 对于多个数据帧或TIBLES之间的循环
									R
							 									For Loop
							 
RStudio，按列类应用函数
									R
							 
Optim（）函数按小数变化？
 最优（）/代码>函数中的PAR参数在小数中有所不同？如果没有，怎么做？在下面的代码中，我将下限设置为0，上限设置为1，这是否意味着它将从0变为1乘1，或者从0变为1乘0.01
wacc2
									R
							 
Google Analytics API请求googleAnalyticsR的身份验证范围问题
									R
							 									Google Analytics
							 
R 来自cv.glmnet的警告消息
									R
							 
编织rmarkdown时的formattable问题
									R
							 
R 如何从ggplot2创建的图例中删除百分比细分
									R
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Bluetooth
Apache Flex
Umbraco
Cmd
Xmpp
Cryptography
Express
Composer Php
Module
Vector
Algorithm
Interface
Ms Access
Web Applications
Compilation
Uml
Windows Phone 7
Seo
Ms Office
Database Design
Perl
Powerbi
Object
Sphinx
Certificate
Jar
Jquery Plugins
Bazel
Firefox
Https
Actionscript 3
Racket
Performance
Blockchain
Pip
Exception Handling
Camera
Cocos2d X
Deep Learning
Haskell
Continuous Integration
Html5 Canvas
Xamarin.ios
Sonarqube
Raspberry Pi
Binding
Sockets
Types
C
Visual Studio 2013
Jetty
Core Data
Architecture
Twitter
Phpstorm
Twig
Pyspark
Pointers
Amazon S3
Solr
Meteor
Botframework
Resharper
Xcode4
Gcc
Virtual Machine
Kotlin
Prolog
Opencart
Jvm
Autohotkey
Ruby On Rails 4
Webview
Compiler Construction
Lambda
Reflection
Server
Glassfish
Dependencies
Mysql
Erlang
Apache Spark
Qt4
Ionic Framework
Protractor
Azure
Go
Shell
Boost
Google Api
Spring Batch
Ruby On Rails 3
Telegram
Indexing
Logstash
Formatting
Adobe
Jquery Ui
Jboss
Import
Google App Maker
Coffeescript
Puppet
Asp.net Core Mvc
Plot
Linker
Angular Material
Nest
Xna
Breeze
Animation
Layout
Sql Server 2005
Logging
Latex
Sharepoint 2007
Visual Studio Code
Clearcase
Coding Style
Vuejs2
Debugging
Filter
Properties
Spring Cloud
Com
Flask
Windows Runtime
Python Sphinx
Dictionary
Gstreamer
Tabs
Dom
Single Sign On
Netty
Vaadin
Emacs
Kendo Ui
Google Chrome Devtools
Fiware
Odata
Soap
Blazor
Nestjs
Nunit
Scripting
Twitter Bootstrap
Cakephp
Azure Ad B2c
Web
Omnet++
Menu
Uwp
Project Management
Virtualbox
Deployment
Stripe Payments
Visual Studio
Appium
Twitter Bootstrap 3
Linux Kernel
Atom Editor
Outlook
Ionic2
Npm
Wpf
Llvm
Redirect
Windows Services
Safari
Templates
Db2
Tags
Doctrine
D
Nosql
Debian
Cloud
Joomla
Vhdl
Google Bigquery
Graphviz
Mips
Xampp
Internet Explorer 8
Dynamic
Sparql
Azure Sql Database
Computer Vision
If Statement
Here Api
Rest
Generics
Openssl
Windows 10
Excel
Wicket
Playframework 2.0
Sed
Exchange Server
Telerik
Anaconda


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网