通过传递术语共现矩阵，使用TextmineR包按主题加载文档_R_Text Mining_Word Embedding - Fatal编程技术网

通过传递术语共现矩阵，使用TextmineR包按主题加载文档

r

通过传递术语共现矩阵，使用TextmineR包按主题加载文档,r,text-mining,word-embedding,R,Text Mining,Word Embedding,我使用包查找与给定文档列表最相似的文档。我使用以下代码生成tcm而不是dtm tcm <- CreateTcm(doc_vec = text_df$Description, skipgram_window = 20, verbose = FALSE, cpus = 2) tcm11个月大的问题。但无论如何，还是要尝试一下从技术上讲，带有LDA嵌入的theta给你p（主题|单词），而ph

我使用包查找与给定文档列表最相似的文档。我使用以下代码生成tcm而不是dtm

tcm <- CreateTcm(doc_vec = text_df$Description,
                 skipgram_window = 20,
                 verbose = FALSE,
                 cpus = 2)

tcm11个月大的问题。但无论如何，还是要尝试一下
从技术上讲，带有LDA嵌入的theta
给你p（主题|单词），而phi
仍然给你p（主题|单词）。如果我理解正确，您希望在此模型下嵌入整个文档吗？如果是这样的话，你可以这样做
library(textmineR)

# create a tcm
tcm <- CreateTcm(nih_sample$ABSTRACT_TEXT, skipgram_window = 10)

# fit an LDA model
m <- FitLdaModel(dtm = tcm, k = 100, iterations = 100, burnin = 75)

# pull your documents into a dtm
d <- nih_sample_dtm

# get them predicted under the model
# I recommend using the "dot" method for prediction with embeddings as sparsity may
# result in underflow and throw an error using the default "gibbs" method
p <- predict(object = m, newdata = d, method = "dot")

库（textmineR）
#创建一个tcm
中医药
library(textmineR)

# create a tcm
tcm <- CreateTcm(nih_sample$ABSTRACT_TEXT, skipgram_window = 10)

# fit an LDA model
m <- FitLdaModel(dtm = tcm, k = 100, iterations = 100, burnin = 75)

# pull your documents into a dtm
d <- nih_sample_dtm

# get them predicted under the model
# I recommend using the "dot" method for prediction with embeddings as sparsity may
# result in underflow and throw an error using the default "gibbs" method
p <- predict(object = m, newdata = d, method = "dot")




[discord.py]相关文章推荐



                                                        
分页-Discord.py重写
discord.py 
如何在discord.py中检查用户的创建日期？
discord.py 
Discord.py 获取踢机器人的作者ID
discord.py 
如何提及使用discord.py的人？
discord.py 
Discord.py discord 1.0.1的小问题让机器人离开
discord.py 
Discord.py WebSocket正在关闭垃圾邮件控制台？
discord.py 
如何使用带有discord.py的现有bot使用webhook
discord.py 
Discord.py正在从审核日志中删除多条消息
discord.py 
                                       





随机文章推荐



                                                        
Windows phone 7 Windows phone 7中的ProgressBar？
windows-phone-7 
Windows phone 7 WP7的LoopingSelector的源
windows-phone-7 
Windows phone 7 wp7播发-响应处理过程中出现意外错误ECN
windows-phone-7 
Windows phone 7 如何仅在一个全景页面（屏幕）上放置按钮？
windows-phone-7 
Windows phone 7 如何在Windows Phone上实现单点登录
windows-phone-7single-sign-on 
Windows phone 7 故事板动画中的Bug？
windows-phone-7 
Windows phone 7 是否可以安装或转换.cab而不是.xap文件？
windows-phone-7 
Windows phone 7 phonegap windows phone 7的后退按钮功能
windows-phone-7cordova 
Windows phone 7 在Windows Phone 7中转换时区
windows-phone-7 
Windows phone 7 ExpanderView不显示所有项目
windows-phone-7 
Windows phone 7 在WP7列表框控件中显示应用程序设置
windows-phone-7 
Windows phone 7 无法在windows phone 7中更改PhoneBackgroundColor
windows-phone-7 
Windows phone 7 Windows Phone:使用HTTPWebRequest加载HTML源
windows-phone-7 
Windows phone 7 WP7背景音频资源不再可用
windows-phone-7audio 
Windows phone 7 BitmapImage使用SetSource同步，但用于本地（项目）文件（或者，如何将本地项目文件获取为流）
windows-phone-7windows-phone-8 
Windows phone 7 Windows Phone Emulator 7.1未打开
windows-phone-7windows-phonewinapi 
Windows phone 7 “抓压”；“后退”；mvvmlight中的按钮
windows-phone-7 
Windows phone 7 我想在点击按钮后打开带有特定应用程序的市场
windows-phone-7 
Windows phone 7 如何在WP7中使用诺基亚地图
windows-phone-7mapshere-api 
Windows phone 7 如何使用PhoneGap for windows phone 7将网站内容转换为移动应用程序？
windows-phone-7jquery-mobilecordovarss


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
R：如何清除所有警告
									R
							 
R 将y轴变换为百分比ggplot
									R
							 
R 向量化函数：带衰减参数的累积和
									R
							 
R 预测样式函数：在字符串和符号之间转换
									R
							 
R 将data.table有条件地划分为子表以获得列值
									R
							 
R 更改临时目录
									R
							 
函数中的统计测试（R）
									R
							 									Statistics
							 
将R图与其他R输出相结合
									R
							 									Plot
							 
在dplyr mutate（）中返回列表
									R
							 
循环遍历日期，以按天获取R中数据库中的所有表
									R
							 									Sql
							 
用R语言将数据帧传递给函数
									R
							 									Function
							 
R 为什么要删去小数部分？
									R
							 									Dataframe
							 
R 如何在文件更改时更新UI
									R
							 
在R中创建数据帧时保留引号
									R
							 
R 在ggplot中添加第二个geom_瓷砖层
									R
							 
R 嵌套列表到数据框并返回到嵌套列表
									R
							 									Dataframe
							 
R grepl检查字符串是否包含所有单词
									R
							 									Regex
							 									Nlp
							 
R 如何在检查NaN后用日志替换数据帧中的所有值
									R
							 
R 在外部LaTeX文件中将YAML参数作为宏访问
									R
							 
R 如何选择在“分组依据”之后未汇总的列？
df%总结（新的a=总和（a），新的b=总和（b））%>%选择（新的a，新的b，c）
错误：`c`必须计算为列位置或名称，而不是函数
									R
							 
R 如何以分位数计算观测值的数量？
									R
							 									Statistics
							 
R 从每一行转置从json获得的数据帧
									R
							 									Json
							 									Dataframe
							 
R k模式聚类后为新数据分配聚类的简单方法
									R
							 
如何在R中编辑线图上的标签？
									R
							 
.R文件的MIME类型
									R
							 									Wordpress
							 
R 日期/时间数据作为因子出现？
									R
							 									Date
							 
R ggplot用于分组功能区的单个通用图例条目
									R
							 
将H M S转换为R中的H:mm
									R
							 
如何使用Lappy在R中多次运行来自不同数据帧的变量的模型
									R
							 									Loops
							 
对于R中的事件前窗口，如何将lm（）应用于特定事件？
									R
							 									Events
							 									Filter
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Racket
Soap
Jqgrid
Jaxb
Winforms
Chart.js
Cloud
Spotify
Clearcase
Cygwin
Sql Server
Zsh
Sap
Ignite
Openshift
Sed
Raspberry Pi
Mips
Jsf
Openlayers
Actionscript
Debugging
Telegram
Gnuplot
Grafana
Logstash
Keras
Crystal Reports
Triggers
Sharepoint 2007
Asp.net Mvc
Printing
Automation
Dictionary
Menu
Tfs
Utf 8
Ocaml
Keyboard
Breeze
Apache Camel
Vagrant
Testing
Arrays
Enums
Twig
Text
Scheme
Android Emulator
Rxjs
Html
Delphi
Symfony
Google Analytics
Merge
Machine Learning
Asp.net Core Mvc
Usb
Tcl
C++ Cli
Magento2
Centos
Yii2
Apache Flink
Apache Nifi
Routes
Asp.net Mvc 4
Curl
C++
Google Cloud Platform
Sdk
Safari
Liferay
Hazelcast
Installation
Amazon Redshift
Memory Management
Python
Templates
Single Sign On
Streaming
String
Netty
Jdbc
Cakephp
Coffeescript
Seo
Doctrine Orm
Parameters
Bootstrap 4
Xamarin.android
Email
Acumatica
Webpack
Junit
Azure Data Factory
Material Ui
Makefile
Jquery
Activemq
Pascal
Graphviz
Pandas
Google Drive Api
Migration
C# 3.0
D3.js
Image Processing
Jestjs
Xna
Svg
Jboss
Woocommerce
Dynamics Crm
Asp.net Mvc 2
C# 4.0
Apache Kafka
Database
Azure Functions
Karate
Passwords
Lotus Notes
Sparql
Ruby On Rails
Excel
Drupal 6
Clang
Report
Wolfram Mathematica
Gmail
Webgl
Atom Editor
Reactjs
Graphics
Lucene
Azure Sql Database
Cron
Google Colaboratory
Unicode
Encryption
Xamarin
Joomla
Ravendb
Couchdb
Localization
Wcf
Sprite Kit
Http
Meteor
Sql Server 2008
Codeigniter
Postgresql
Nestjs
System Verilog
Exception
Telerik
Glsl
Playframework
Rdf
Cuda
Swift
Rest
Smtp
R
Youtube Api
Shiny
Kdb
Mfc
Biztalk
Hbase
Jasper Reports
Aurelia
Xamarin.ios
Dataframe
Core Data
Silverlight 4.0
Firefox
Google Plus
Nsis
Arduino
Pdf
Random
Paypal
Couchbase
Applescript
Sonarqube
Gtk
Date
Oracle
Workflow
Exception Handling
Spring Boot
Responsive Design
Puppet
Azure Service Fabric
Ionic Framework
Intellij Idea
Perforce
Ansible
Backbone.js
Sorting


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网