ruby nokogiri无法访问雅虎财经新闻发布日期

ruby nokogiri无法访问雅虎财经新闻发布日期,ruby,rss,nokogiri,yahoo-finance,Ruby,Rss,Nokogiri,Yahoo Finance,我可以访问Yahoo Finance news标题,但很难解析pubDate,因此我只能查看(比如)上周的新闻,而忽略任何较旧的内容 require 'nokogiri' sym = "1313.HK" url = "https://feeds.finance.yahoo.com/rss/2.0/headline?s=#{sym}&region=US&lang=en-US" doc = Nokogiri::HTML(open(url)) titles = doc.css("ti

我可以访问Yahoo Finance news标题,但很难解析pubDate,因此我只能查看(比如)上周的新闻,而忽略任何较旧的内容

require 'nokogiri'
sym = "1313.HK"
url = "https://feeds.finance.yahoo.com/rss/2.0/headline?s=#{sym}&region=US&lang=en-US"
doc = Nokogiri::HTML(open(url))

titles = doc.css("title")
puts titles.length # works, comes back with 0-20
puts titles.text # works

pubDates = doc.css("pubDate")
puts pubDates.length #does NOT work, always 0
puts pubDates.text #does NOT work, always blank

keywordregex = "bad news"
nodes = doc.search('title') # search title tags only, for keywords
puts found_title = nodes.select{ |n| n.name=='title' && n.text =~ keywordregex } # TODO && pubDate > 7 days old 

用Nokogiri::XML试试看,rss实际上是XML

doc = Nokogiri::XML(open(url))

pubdate
XML源中的节点名称是小写的

 > doc.css("pubdate").length
 => 7

感谢您的提示,@pguardiario,从HTML切换到XML很有帮助。