R 维基百科查看比赛交易日

R 维基百科查看比赛交易日,r,wikipedia,trading,R,Wikipedia,Trading,我有这样的维基百科页面浏览数据 library(wikipediatrend) views <-wp_trend(page = "European debt crisis",from = "2010-01-01",to = "2014-12-31",lang = "en",friendly = TRUE,requestFrom = "wp.trend.tester at wptt.wptt",userAgent = TRUE) date count 2010-01-01

我有这样的维基百科页面浏览数据

library(wikipediatrend)

views <-wp_trend(page = "European debt crisis",from = "2010-01-01",to = "2014-12-31",lang = "en",friendly = TRUE,requestFrom = "wp.trend.tester at wptt.wptt",userAgent = TRUE)

date        count
2010-01-01    128
2010-01-02    142

现在,我只想为维基百科页面提取交易发生时的天数,即不包括周末和节假日以及飓风桑迪等非自然关闭的天数。提取这些值的最简单方法是什么?这是一个严重的子设置(或过滤)问题:

因此,我们希望将
视图
数据框按
日期
列进行子集划分,以便它只包含
索引(GSPS)
中的值:

#按股票数据对维基百科数据进行子集
#模式是:
#表\u至\u子集[规则\u至\u子集\u行,规则\u至\u子集\u列]
#因此,根据股票交易日期对维基百科视图数据进行子集划分
#我们可以这样做:

wiki_data是的,后来理解并解决了它

gspcdf<-data.frame(date = index(GSPC), GSPC, row.names=NULL)


CombDF<-merge(views,gspcdf, by.x='date', by.y='date')

gspcdfIt使用空格的好风格
你能多分享一点源代码吗?
# get the wikipedia views data
library(wikipediatrend)
views <-wp_trend(page = "European debt crisis",from = "2010-01-01",to = "2014-12-31",lang = "en",friendly = TRUE,requestFrom = "wp.trend.tester at wptt.wptt",userAgent = TRUE)

# get the stock trading data
library(quantmod)
startDate = as.Date("2010-01-01")
endDate = as.Date("2014-12-31") 
getSymbols("^GSPC", src = "yahoo", from = startDate, to = endDate) 
# where are the trading dates in the stock data?
index(GSPC)

# where are the dates in the wikipedia data?
views$date
# subset wikipedia data by stock data
# pattern is:
# table_to_subset[rule_to_subset_rows, rule_to_subset_columns]
# so to subset the wikipedia view data by the dates of the stock trading
# data we can do this:

wiki_data <- views[views$date %in% index(GSPC), ]
gspcdf<-data.frame(date = index(GSPC), GSPC, row.names=NULL)


CombDF<-merge(views,gspcdf, by.x='date', by.y='date')