Warning: file_get_contents(/data/phpspider/zhask/data//catemap/4/regex/19.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Regex 读入data.frame_Regex_R - Fatal编程技术网

Regex 读入data.frame

Regex 读入data.frame,regex,r,Regex,R,我需要读入一个data.frame中BYU的学费数据 使用readHTMLTable函数。我还需要清理数据并将三个变量命名为“年”、“lds”和“nonlds” 我有以下代码: library("XML") download.file("http://yfacts.byu.edu/Article?id=85", destfile = "tuitiondata.html") BYUtuition <- readHTMLTable("tuitiondata.html",

我需要读入一个data.frame中BYU的学费数据 使用readHTMLTable函数。我还需要清理数据并将三个变量命名为“年”、“lds”和“nonlds”

我有以下代码:

library("XML")
download.file("http://yfacts.byu.edu/Article?id=85",
          destfile = "tuitiondata.html")

BYUtuition <- readHTMLTable("tuitiondata.html",
             header=T, skip.rows=4,
             colClasses=c("character","FormattedNumber","FormattedNumber"))
names(BYUtuition)<-c("year","lds","nonlds")
库(“XML”)
下载文件(“http://yfacts.byu.edu/Article?id=85",
destfile=“tutiondata.html”)

BYUtuition您的BYUtuition是一个列表。使用[[1]]提取内的data.frame。然后您可以执行格式化,而不是使用FormattedNumber

BYUtuition <- readHTMLTable("tuitiondata.html",header=T,skip.rows=4)[[1]]

#remove rows with any NA
BYUtuition <- na.omit(BYUtuition)

#set names
names(BYUtuition) <- c("year","lds","nonlds")

#convert course fee into numeric
BYUtuition$lds <- as.numeric(gsub("[^0-9a-zA-Z]+", "",BYUtuition$lds))
BYUtuition$nonlds <- as.numeric(gsub("[^0-9a-zA-Z]+", "",BYUtuition$nonlds))

#show final table
BYUtuition
BYUtuition
BYUtuition <- readHTMLTable("tuitiondata.html",header=T,skip.rows=4)[[1]]

#remove rows with any NA
BYUtuition <- na.omit(BYUtuition)

#set names
names(BYUtuition) <- c("year","lds","nonlds")

#convert course fee into numeric
BYUtuition$lds <- as.numeric(gsub("[^0-9a-zA-Z]+", "",BYUtuition$lds))
BYUtuition$nonlds <- as.numeric(gsub("[^0-9a-zA-Z]+", "",BYUtuition$nonlds))

#show final table
BYUtuition