使用R中的API限制通过时间刮取数据_R_Web Scraping_Data Science - Fatal编程技术网

使用R中的API限制通过时间刮取数据

r web-scraping

使用R中的API限制通过时间刮取数据,r,web-scraping,data-science,R,Web Scraping,Data Science,我正试图从Nordpool网站上获取消息。链接到消息： API: 不幸的是，API只允许一次刮取1000条消息。但是，我想将2012年1月1日到今天为止SE中有关核能的所有信息包括在内（肯定超过1000条）这是我必须抓取最后1000条消息的代码 url <- "https://ummapi.nordpoolgroup.com/messages?fuelTypes=14&IncludeOutdated=true&publicationStopDate=2019-

我正试图从Nordpool网站上获取消息。

链接到消息：

API:

不幸的是，API只允许一次刮取1000条消息。但是，我想将2012年1月1日到今天为止SE中有关核能的所有信息包括在内（肯定超过1000条）

这是我必须抓取最后1000条消息的代码

url <- "https://ummapi.nordpoolgroup.com/messages?fuelTypes=14&IncludeOutdated=true&publicationStopDate=2019-10-11&areas=10Y1001A1001A46L&limit=1000"

data <- as.data.frame(fromJSON(url))

url查看API文档，似乎有跳过/跳转X记录的规定，因此基本上您可以通过将跳过值设置为等于2000来访问接下来的1000条记录
因此，您可能可以编写一个for循环来循环，直到一天结束，例如：
library(jsonlite)

url <- "https://ummapi.nordpoolgroup.com/messages?fuelTypes=14&IncludeOutdated=true&publicationStopDate=2019-10-11&areas=10Y1001A1001A46L&limit=1000&skip="
max <- 0
skiprec <- 0
df1 <- c()

repeat {
  url <- paste0(url,skiprec)
  req <- fromJSON(url)

  max <- req$total

  article <- req$items$reasonCode
  df1 <- append(df1,article)

  if(skiprec >= max) {
    break
  }
  else {
    skiprec <- skiprec + 1000
  }

}


library（jsonlite）
网址
library(jsonlite)

url <- "https://ummapi.nordpoolgroup.com/messages?fuelTypes=14&IncludeOutdated=true&publicationStopDate=2019-10-11&areas=10Y1001A1001A46L&limit=1000&skip="
max <- 0
skiprec <- 0
df1 <- c()

repeat {
  url <- paste0(url,skiprec)
  req <- fromJSON(url)

  max <- req$total

  article <- req$items$reasonCode
  df1 <- append(df1,article)

  if(skiprec >= max) {
    break
  }
  else {
    skiprec <- skiprec + 1000
  }

}




[web scraping]相关文章推荐



                                                        
Web scraping 是否可以提取Linkedin组成员的个人资料信息？
web-scrapinglinkedin 
Web scraping 获取API和CasperJS
web-scrapingphantomjs 
Web scraping wget似乎忽略了我的--wait和--random wait参数
web-scraping 
Web scraping 使飞溅，刮擦和Scrapoxy一起工作
web-scrapingscrapy 
Web scraping 创建循环以解析scrapy中的表数据
web-scrapingscrapy 
Web scraping 如何在PHP中抓取ajax调用
web-scraping 
Web scraping 痛风刮除-JSON-LD-仅限“@“类型”为：&引用；产品「；
web-scrapingweb-crawler 
Web scraping 联机服务或命令行实用程序，用于从json api可变页面参数获取多个页面
web-scraping 
Web scraping googlesheets中的IMPORTXML函数
web-scrapinggoogle-sheets 
Web scraping 我在做网页垃圾所以这个错误正在发生请检查这个？
web-scraping 
Web scraping 网站抓取雅虎金融推荐评级
web-scraping 
Web scraping 获取链接并在页面中滚动-网页抓取木偶演员
web-scrapingscroll 
Web scraping 提取在线词汇表
web-scrapingms-wordweb-crawler 
                                       





随机文章推荐



                                                        
Google api 固定：SailsJs+；谷歌博客正在暂停
google-apisails.js 
Google api 获取GMail用户档案图片（适用于未加入Google Plus的用户）
google-apigmailgoogle-plus 
Google api 如何通过hadoop群集为Google Compute Engine启用Snappy/Snappy编解码器
google-apigoogle-compute-engine 
Google api 如何在Google API中发现当前的费率使用情况？
google-api 
Google api Google日历API列表（）限制
google-apigoogle-calendar-api 
Google api 无法查看要启用的Youtube内容ID API
google-api 
Google api 通过RESTAPI创建域（谷歌）
google-api 
Google api 如何通过API获取Google或Authy OTP
google-api 
Google api Google busyFree端点，意外行为
google-apigoogle-calendar-api 
Google api 搜索在google drive api中设置了父文件夹的文件夹
google-apigoogle-drive-api 
Google api “如何批量更新”；“授权JavaScript源代码”；在谷歌API控制台？
google-api 
Google api 课堂API-跨域注册
google-api 
Google api 下一页标记
google-api 
Google api 为其他用户创建日历事件
google-apigoogle-calendar-api 
Google api 如何从谷歌替换people.me+；使用Google人员/Google登录api？
google-apigoogle-plus 
Google api 谷歌搜索控制台API
google-api 
Google api Spreadsheets.get获取所选文档时返回404
google-api 
Google api 如何从Youtube分析API获取浏览率？
google-apiyoutube-api 
Google api 我可以从谷歌获取公共事件API吗？唯一要做的是谷歌日历API，而不是事件
google-api 
Google api 在内部OAuth API项目中使用Google品牌帐户
google-apiyoutube-api


                                        

                                        
                                        


                                                
                                                        [r]相关推荐
                                                        
data.table中的groupBy：使用第一个值
									R
							 
多行文字在R中的奇异行为
									R
							 
用R表示单侧极限
									R
							 									Function
							 
R 重复一个循环，直到它满足特定条件
									R
							 
R：查找与行名称的子字符串匹配的列名
									R
							 
我怎样才能把字符放在数字前面？
xx=矩阵（，ncol=4，nrow=6）
x=虹膜[，1:4]
i=1
而（i
									R
							 
如何使用for循环使用R分割矩阵或数据帧？
									R
							 									Matrix
							 									Dataframe
							 
R 汇总表中的值
									R
							 
R 读取固定宽度的文本文件
									R
							 
R ifelse语句仅返回编号
									R
							 									If Statement
							 
R 如何将csv中的时间格式更改为hh:mm:ss
									R
							 									Datetime
							 
R 用另一行中相同单元格的值替换df中一行中的多个单元格
									R
							 									Dataframe
							 
R 在“官员”中使用表格时出错；没有适用于'的方法；ph#u与'；适用于“类”的对象；c（&"x27；xml"文档&"x27；，&"x27；xml"节点&"x27；）；
									R
							 
R 如何使用mvJointModelBayes（）修复不兼容的矩阵维度？
									R
							 
在R中运行PerformanceAnalytics函数时出错
									R
							 
按R中的降序在列之间交换值
									R
							 
R 在条形图（ggplot2）中打印修改数据点
									R
							 
R 第n次乘法
									R
							 
在R版本3.6.1中加载mailR包时出现问题
									R
							 
R ggplot中条形图的方向带有对数刻度轴的条形图
									R
							 
在比较两个不同长度的独立数据集时估计模拟数据的预测精度（使用R）
									R
							 									Statistics
							 
从dataframe中的前2个元素创建一个字符串，并添加到R中的新列
									R
							 
嵌套df以使用Rvest、Glue和Purrr刮取多个页面时出现命名错误
									R
							 									Web Scraping
							 
R 数据帧子集循环
									R
							 									Function
							 									Dataframe
							 
将3列（日、月、年）转换为单个日期列R
									R
							 									Date
							 
R 从图像中检索文本是不合适的
									R
							 									Image Processing
							 
并行和foreach包的排列错误；任务1失败-“；找不到“；错误
									R
							 
R 我可以对主成分进行事后分析吗？
									R
							 									Statistics
							 
R 在应用程序中使用AmazonS3图像
									R
							 									Amazon S3
							 									Shiny
							 
如何将i列第2行的数据更新为j列第1行，但在R数据帧中由两个变量（dplyr）分组？
									R
							 									Loops
							 									Recursion
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Puppet
Windbg
Ip
Redis
Dynamic
Z3
Vaadin
Pointers
Sap
Process
Pytorch
Xcode
Jar
Tomcat
.net 4.0
Build
Protractor
Variables
Ruby On Rails
String
Sprite Kit
Google Cloud Firestore
Cocoa Touch
Android
Xquery
Dom
Openlayers 3
Sip
Listview
Opencv
Protocol Buffers
Gitlab
Parsing
Plugins
Swagger
Devexpress
Weblogic
Css
Vuejs2
Postman
Vue.js
Couchbase
Artificial Intelligence
Sphinx
Networking
Usb
Compiler Errors
Resharper
Discord.js
Amazon Cloudformation
Enums
Keyboard
Fluent Nhibernate
Modelica
Plone
Quickbooks
Youtube Api
Ios8
Automated Tests
Swing
Tinymce
.htaccess
Erlang
Web Applications
Dialogflow Es
Lotus Notes
Mapping
Automation
Django Models
Graph
Azure Active Directory
Blazor
Activerecord
Angular Material
Internet Explorer
Jira
Tags
Visual Studio 2017
Ionic2
Web Scraping
Vhdl
Dask
Hadoop
Collections
Installation
Phantomjs
Routing
Compiler Construction
Navigation
Visual Studio 2013
Rxjs
Scala
Ffmpeg
Codenameone
Jenkins
Cmake
Google Api
Embedded
Node.js
Drupal 6
Highcharts
Windows Phone 8.1
Scroll
Yii
Webstorm
Streaming
Opencart
Firefox Addon
Sql Server 2012
Plsql
Mediawiki
Dojo
Sharepoint 2010
Windows Mobile
Nuget
Responsive Design
Memory Management
Visual Studio 2012
Wix
Xslt
Prestashop
Go
Seo
Netlogo
Bash
Nservicebus
Apache Storm
Google Chrome Extension
React Native
Wxpython
Teradata
Latex
Shell
Parse Platform
Electron
Autohotkey
Tensorflow
Google Drive Api
Sms
Synchronization
Arm
Three.js
Tableau Api
Subsonic
Rdf
Youtube
Ms Office
Dataframe
Download
Elm
Asp.net Web Api
Image
Logic
Corda
Spring Cloud
Filter
Exception Handling
Triggers
Cocos2d Iphone
Angular6
Ethereum
C# 4.0
Netsuite
Sharepoint 2007
Unix
Calendar
.net Core
Nest
Mongoose
Model
Ajax
Sails.js
Log4net
Types
Jdbc
For Loop
Performance
Symfony1
Rx Java
Vector
Dependency Injection
Spring
Menu
File Io
File
Vba
Internet Explorer 8
C# 3.0
Network Programming
Amazon S3
Fortran
Web Services
Magento
Parallel Processing
Tabs
Class
Syntax
Search
Rspec
Sas
Knockout.js


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网