如何使用R将文件的相关部分转换为语料库_R_Tm - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/r/64.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
如何使用R将文件的相关部分转换为语料库_R_Tm - Fatal编程技术网

如何使用R将文件的相关部分转换为语料库

r

如何使用R将文件的相关部分转换为语料库,r,tm,R,Tm,我是一个使用R的初学者，目前正在处理一个包含多列的文件。我想专注于一列（csv文件中标记的文本），创建一个语料库，然后更改文本列中的文本，使其全部为小写，删除标点等等以下代码是我目前掌握的代码： # Import text data ALL_tweets_df <- read.csv("All_tweets.csv", stringsAsFactors = FALSE) library(tm) # View the structure of tweets str(ALL_twee

我是一个使用R的初学者，目前正在处理一个包含多列的文件。我想专注于一列（csv文件中标记的文本），创建一个语料库，然后更改文本列中的文本，使其全部为小写，删除标点等等

以下代码是我目前掌握的代码：

# Import text data

ALL_tweets_df <- read.csv("All_tweets.csv", stringsAsFactors = FALSE)

library(tm)

# View the structure of tweets

str(ALL_tweets_df)

# Print out the number of rows in tweets

nrow(ALL_tweets_df)

# Isolate text from tweets: All_tweets

ALL_tweets_df <- ALL_tweets_df$text

#converts the relevant part of your file into a corpus

mycorpus<-Corpus(VectorSource(ALL_tweets_df$text)) 

# change to lower case, remove stop words, remove punctuation

mycorpus2 = tm_map(mycorpus, tolower)

mycorpus3 = tm_map(mycorpus2, removeWords, stopwords("english"))

mycorpus4 = tm_map(mycorpus3, removePunctuation)

#导入文本数据
所有tweets\u df我想命令所有tweets\u df谢谢，这就解决了。我现在正试图用以下代码将其转换回数据帧：mycorpus5前面的答案可能会在这里有所帮助。。。




[grep]相关文章推荐



                                                        
grep-R是否仅在命令由*参数化时才起作用？
grep 
Grep 是否存在包含€；
grep 
查找grep排除dos2unix的某些文件名
grep 
GREP创建一个包含sting的单词列表
grep 
返回“字符”之间的字符串的grep
greplinuxcommand-line 
grep警告：递归目录循环
grep 
Grep 通过多个xarg传递-I模式
grep 
为什么pcregrep比grep快？
grepcentos 
使用grep搜索美元金额
grep 
如何通过AppleScript grep umlauts和其他重音文本字符
grepapplescript 
grep：查找所有包含“星”字的文件，但不包括“开始”字`
grep 
来自定位结果的grep字符串
grep 
如何在不改变输出顺序的情况下使用grep
grep 
grep从一组可能的值中搜索字符串中的字符
grep 
                                       





随机文章推荐



                                                        
Build &引用；致命错误U1087:同一目标不能有：和：：依赖项；
build 
Build 使用MacPorts 1.8在Mac OS X 10.6上安装sqlite3期间生成失败
buildmacossqlite 
Build symfony 1.4：条令构建模型警告
buildmodelsymfony1doctrine 
Build Jenkins（Hudson）插件将修订号转换为版本号
buildjenkins 
Build 是否可以在特定代理上运行TFS生成活动？
buildtfs 
Build 如何在llvm数据布局中抑制128位浮点？
buildllvm 
Build 为不同目录中的构建和源代码创建Sublime文本构建系统
buildsublimetext2 
Build 有人设法从源代码构建WSO2吗？
buildwso2 
Build 如何在Jenkins中使用Jelly显示所有构建历史或最近的5个构建
buildjenkins 
Build 用于在fedora-20中编译nfs ganesha的cmake
build 
Build 如何根据现有图层文件构建Dojo图层？
buildmoduledojo 
Build 如何从源代码构建核心库（libstd、libcore等），而不构建整个编译器工具链？
buildrust 
Build 添加uri.js bower组件会破坏我的Grunfile.js，它会认为uri.js目录是一个文件。如何修复？
buildgruntjs 
Build 齐柏林飞艇建造失败
build 
Build TFS 2015构建：是否可以在存储库映射中使用变量？
buildasp.net-core 
Build CMakeLists和属性文件
buildcmake 
Build TFS 2013生成失败，出现“找不到文件”异常，但该文件存在
build 
Build 离子2生成错误中的简介页
buildionic2 
Build 构建TL-MR3020仿真器
build 
Build 是thinto'；并行构建系统中的并发有用吗？
buildlinkerllvm


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Here Api
Xslt
Cloud Foundry
Blackberry
Raspberry Pi
Verilog
Sms
Excel Formula
Parse Platform
Iis
Entity Framework 4
Couchdb
Google Cloud Firestore
Automated Tests
Install4j
Browser
Sencha Touch
Cors
Msbuild
Socket.io
Docker Compose
Centos
Ruby On Rails 3.1
Cypress
Telerik
Memory Management
Hbase
Apache2
Ssl
Linkedin
Heroku
Dynamics Crm 2011
Vim
Google App Engine
Python Sphinx
Mapbox
Compression
Google Bigquery
Pascal
Shopify
Twitter Bootstrap
Colors
Apache Storm
Single Sign On
Http
3d
Meteor
Xsd
Sublimetext2
Pentaho
Google Plus
Sql Server 2008 R2
Build
Virtual Machine
Grid
Xmpp
Sml
Apache Flink
Encoding
Snmp
Antlr4
Terminal
Joomla
Autocomplete
Ionic2
Layout
Silverlight 4.0
Synchronization
Dataframe
Asp.net Web Api
Sugarcrm
Sharepoint
Azure Service Fabric
Jasper Reports
Plsql
Function
Vuejs2
Sapui5
R
Lucene
Responsive Design
Interface
Neural Network
Asp.net
Spotify
Google Visualization
Angularjs
Domain Driven Design
C#
Editor
Xamarin.android
Bootstrap 4
Excel
Youtube
Jqgrid
Jmeter
Charts
Abap
Visual Studio Code
Mdx
Jwt
Sharepoint 2010
Hazelcast
Sql Server
Listview
Crystal Reports
Asterisk
Visual Studio 2010
Antlr
Javascript
Jhipster
Functional Programming
Wix
Mvvm
Modelica
Java Me
Xamarin.forms
Python 3.x
Jdbc
Mono
Jaxb
Windows
Download
Jquery Plugins
Servlets
Bison
Gwt
Content Management System
Post
Clang
Oracle10g
Asp.net Mvc 2
Machine Learning
Cucumber
Inheritance
Microsoft Graph Api
Architecture
Wicket
Playframework 2.0
Elm
Windows 7
Ag Grid
Winforms
Computer Science
Error Handling
Amazon Redshift
Orchardcms
Android Ndk
Dependency Injection
Language Agnostic
Sip
Notifications
Select
Imagemagick
Shiny
Xampp
Discord
Discord.py
Oracle11g
Google Sheets
Anaconda
Embedded
Oauth
Mapreduce
Numpy
Parallel Processing
Ubuntu
Windows Phone 7
Swiftui
Asp.net Mvc 3
Yii2
Vmware
Tomcat
Qt4
Azure Ad B2c
Signalr
Laravel
Llvm
Merge
Asynchronous
Qml
Zend Framework
Jersey
Jquery Mobile
Aws Lambda
Wordpress
Windows Installer
Jquery
Geolocation
Frameworks
Oauth 2.0
Sprite Kit
Grafana
Batch File
Filesystems
Docker
Ssrs 2008
Visual Studio 2008
Model View Controller
Oracle Apex
Hybris


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网