在R中从internet下载数据时出错_R

在R中从internet下载数据时出错

在R中从internet下载数据时出错,r,R,我正试图从internet下载一些数据，以便在R中使用文本挖掘，但运行该命令失败这些命令是： url <- 'http://www.gutenberg.org/cache/epub/100/pg100.txt' arquivo <- read.csv(url) 我尝试了几个read.csv（）函数的参数，但没有成功这是古腾堡项目的文本（.txt）文档。使用读线 url <- 'http://www.gutenberg.org/cache/epub/100/pg100.t

我正试图从internet下载一些数据，以便在

中使用

文本挖掘

，但运行该命令失败

这些命令是：

url <- 'http://www.gutenberg.org/cache/epub/100/pg100.txt' 
arquivo <- read.csv(url)

我尝试了几个

read.csv（）

函数的参数，但没有成功

这是古腾堡项目的文本（.txt）文档。使用读线

url <- 'http://www.gutenberg.org/cache/epub/100/pg100.txt' 
arquivo <- readLines(url)

urltidyverse

package

readr

是一个选项：

arquivo <- readr::read_file(url)

arquivo这：
告诉您流中有非文本数据。经检查，这似乎是一个GZ编码的流，web浏览器将动态解码以呈现纯文本。R可能不想这么做。您可以从此URL获取纯文本版本
> txt = readLines("http://www.gutenberg.org/files/100/100-0.txt")
> txt[14532]
[1] "ADRIANA. To fetch my poor distracted husband hence."
> txt[143532]
[1] "    He looks like sooth. He says he loves my daughter;"

这也给了我关于嵌入式nul字符的警告。正在进行的在线课程讲师可以毫无问题地下载数据。我使用的是Linux-Manjaro，是这样吗？
Error in make.names(col.names, unique = TRUE) : 
  invalid multibyte string 1
In addition: Warning message:
In read.table(file = file, header = header, sep = sep, quote = quote,  :
  line 1 appears to contain embedded nulls

> txt = readLines("http://www.gutenberg.org/files/100/100-0.txt")
> txt[14532]
[1] "ADRIANA. To fetch my poor distracted husband hence."
> txt[143532]
[1] "    He looks like sooth. He says he loves my daughter;"