用python 2.7解析facebook html
我使用的是python 2.7,我试图提取喜欢我的facebook页面照片的人的个人ID。我的代码是:用python 2.7解析facebook html,facebook,python-2.7,facebook-graph-api,beautifulsoup,html-parsing,Facebook,Python 2.7,Facebook Graph Api,Beautifulsoup,Html Parsing,我使用的是python 2.7,我试图提取喜欢我的facebook页面照片的人的个人ID。我的代码是: import urllib2 from bs4 import BeautifulSoup import mechanize br = mechanize.Browser() htmltext = br.open("url").read() soup = BeautifulSoup(htmltext) search = soup.findAll('div',attrs={'class'
import urllib2
from bs4 import BeautifulSoup
import mechanize
br = mechanize.Browser()
htmltext = br.open("url").read()
soup = BeautifulSoup(htmltext)
search = soup.findAll('div',attrs={'class':'_5j0e fsl fwb fcb'})
print search
但是当运行这段代码时,我得到了空括号[]。此外,当我运行相同的代码,但“打印汤”而不是“打印搜索”时,我得到了HTML,但ID不在那里,我甚至按住Ctrl+F键查找它们,但它们不在那里,因此我的代码似乎根本没有提取这些部分
import urllib2
from bs4 import BeautifulSoup
import mechanize
br = mechanize.Browser()
htmltext = br.open("url").read()
soup = BeautifulSoup(htmltext)
search = soup.findAll('div',attrs={'class':'_5j0e fsl fwb fcb'})
print search
谢谢大家! Facebook上不允许进行抓取,您必须使用Graph API获取数据。这是一个非常简单的API调用:
/post id/likes
import urllib2
from bs4 import BeautifulSoup
import mechanize
br = mechanize.Browser()
htmltext = br.open("url").read()
soup = BeautifulSoup(htmltext)
search = soup.findAll('div',attrs={'class':'_5j0e fsl fwb fcb'})
print search
更多信息:
import urllib2
from bs4 import BeautifulSoup
import mechanize
br = mechanize.Browser()
htmltext = br.open("url").read()
soup = BeautifulSoup(htmltext)
search = soup.findAll('div',attrs={'class':'_5j0e fsl fwb fcb'})
print search