为什么wordcloud中缺少一些西里尔字母？_R - Fatal编程技术网

Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/r/73.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
为什么wordcloud中缺少一些西里尔字母？_R - Fatal编程技术网

为什么wordcloud中缺少一些西里尔字母？

r

为什么wordcloud中缺少一些西里尔字母？,r,R,我有大量的俄语文本。当我构建wordcloud时，我看到一些像“ч”这样的字符没有被渲染。代码如下所示： dat <- read.csv("news.csv",sep=";",header=TRUE,stringsAsFactors=FALSE) corpus <- Corpus(VectorSource(dat$Article), readerControl = list(reader=readPlain,language="ru")) corpus <- tm_map(co

我有大量的俄语文本。当我构建wordcloud时，我看到一些像“ч”这样的字符没有被渲染。代码如下所示：

dat <- read.csv("news.csv",sep=";",header=TRUE,stringsAsFactors=FALSE)
corpus <- Corpus(VectorSource(dat$Article),
readerControl = list(reader=readPlain,language="ru"))
corpus <- tm_map(corpus, removePunctuation)
corpus <- tm_map(corpus, tolower)
corpus <- tm_map(corpus, removeNumbers)
corpus <- tm_map(corpus, removeWords,
stopwords("russian")))
dtm <- TermDocumentMatrix(corpus)
m <- as.matrix(dtm)
v <- sort(rowSums(m),decreasing=TRUE)
d <- data.frame(word = names(v),freq=v)
pal2 <- brewer.pal(8,"Dark2")
png("wordcloud.png", width=640,height=640)
wordcloud(d$word,d$freq, scale=c(8,.2), min.freq=5, max.words=200, 
random.order=FALSE, rot.per=0, colors=pal2)
dev.off()

dat[来自OP自己的编辑，但在此处重复以完成问题答案]

您需要添加，以及其他tm\u map（）
调用
语料库
corpus <- tm_map(corpus, iconv, 'cp1251', 'UTF-8')

corpus <- tm_map(corpus, iconv, 'cp1251', 'UTF-8')




[x86]相关文章推荐



                                                        
X86汇编-访问芯片
x86 
X86 cpu如何在实模式下计算20位地址
x86 
以x86或x64运行的应用程序？
x86 
x86内存模型中的互斥实现
x86 
X86 TLB invlpg指令具有较长的延迟
x86 
X86 一次添加2个、4个或更多短值
x86 
LLVM后端：替换x86后端的间接JMP
x86llvm 
X86 使用AVX2计算8个长整数的最小值
x86 
X86 将预编译的GRUB 2安装到原始映像
x86 
X86 将8个16位SSE寄存器转换为8位数据
x86 
X86 INT 10h未打印通过堆栈的字符
x86 
X86 PCIe枚举后BIOS卡滞
x86 
x86为什么我在移动变量时总是在al寄存器中获取CD
x86 
X86 如何以及何时在VMCS主机状态区域中保存主机CPU状态？
x86 
                                       





随机文章推荐



                                                        
Playframework 2.0 如何使用play 2.0渲染二进制文件？
playframework-2.0 
Playframework 2.0 playframework 2，getServletContext
playframework-2.0 
Playframework 2.0 Play 2.x中的约定优于配置
playframework-2.0 
Playframework 2.0 使用Ebean播放框架检查数据库中是否存在某些内容
playframework-2.0 
Playframework 2.0 游戏框架是否使用；“动态查找器”；
playframework-2.0 
如何连接游戏！从Ebean到ElasticSearch的框架
playframework-2.0


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
                                                        
                                                

                                                
                                                        Tags
                                                        
Amazon Web Services
Maven 2
Google Cloud Platform
Memory Management
Artificial Intelligence
Prestashop
Odoo
Url Rewriting
Login
Process
Coq
Gradle
Encryption
Exchange Server
Cmake
Oop
Perforce
Blazor
Data Structures
Gmail
Yii2
User Interface
Xpages
Sequelize.js
Vhdl
Dictionary
Swagger
Api
Dll
Responsive Design
Ibm Midrange
Jsf 2
Sbt
Xpath
Printing
Dependency Injection
Playframework 2.0
Netsuite
Laravel
Transactions
Webstorm
Mfc
Design Patterns
Sharepoint
Jira
Dialogflow Es
Grep
Caching
Google Chrome Extension
Permissions
Haskell
Php
Windows Phone
Angular
Lambda
Azure Devops
Image
Memory Leaks
Cobol
Cordova
Cuda
Rx Java
Tridion
Axapta
Ssrs 2008
Css
Ffmpeg
Ios6
Actionscript 3
Oauth 2.0
Linkedin
Android Fragments
Nativescript
Google Maps Api 3
Cocoa Touch
Macos
Selenium
Plone
Xmpp
System Verilog
Teradata
Zend Framework
Streaming
Ruby
Clojure
Cluster Computing
Rdf
Openlayers 3
Uiview
Google Compute Engine
Gruntjs
D
Facebook
Parse Platform
Ipad
Java
Asp.net Mvc 5
Windows 10
Testing
Binary
Gdb
Sencha Touch
Gitlab
Nsis
Computer Science
Kubernetes
Unix
Jwt
Swift2
Ecmascript 6
Jetty
Dependencies
Shopify
Compression
Gwt
Breeze
Wix
Doctrine Orm
Internet Explorer
Emacs
Error Handling
Binding
Floating Point
Validation
Material Ui
Pycharm
Arangodb
Identityserver4
Grails
Vim
Sharepoint 2010
Redirect
Glassfish
Testng
Debugging
Angularjs
Character Encoding
Aframe
Titanium
Editor
Express
Hyperledger Fabric
Tensorflow
Windows
Ms Office
Google Analytics
Nginx
Database Design
Bootstrap 4
Asp.net Mvc 3
Akka
Sas
Apache
Listview
Docker Compose
Templates
Tomcat
Matlab
Twig
Visual Studio 2013
Internet Explorer 8
Wicket
Io
Netty
Timer
Network Programming
Google Cloud Dataflow
Sorting
Primefaces
Leaflet
Raspberry Pi
Spring Integration
Time
Gis
Jdbc
Report
Autocomplete
Arduino
Amp Html
Typescript
Snowflake Cloud Data Platform
Openlayers
Latex
Sprite Kit
Karate
Mapbox
Command Line
Jqgrid
Directx
Smtp
Sencha Touch 2
Animation
Rabbitmq
Automation
Routes
Windbg
Content Management System
Indexing
Ftp
Ibm Cloud
Windows 8


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网