Python mechanize找不到链接

Python mechanize找不到链接,python,mechanize,Python,Mechanize,我正在使用此代码打印网页中的链接 wb = mechanize.Browser() wb.open("https://example.com") print wb.response().read() for link in wb.links(): print "aa->" 答复如下: ... <a href=/Router/moneyReload.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02&g

我正在使用此代码打印网页中的链接

wb = mechanize.Browser()
wb.open("https://example.com")

print wb.response().read()
for link in wb.links():
     print "aa->"
答复如下:

...
    <a href=/Router/moneyReload.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Recarga De Saldo</a><br><a href=/Router/roamingMoneyReload.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Recarga en Roaming</a><br><a href=/Router/lastOperatorSale.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Ultima Recarga</a><br><a href=/Router/LastSubscriberTopUpPerform.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Consulta de Recarga</a><br><a href=/Router/stockReportCreditLimit.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Informe de Ventas</a><br><a href=/Router/stockReport.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Informe de Saldo Disponible</a><br><a href=/Router/dealerCloseBatch.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Cierre de Turno</a><br><a href=/Router/ChangePointOfSalePin.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Cambiar PIN</a><br><a href=/Router/activePortfolioQuery.web;JSESSIONID=2C832E9B3DB0EE7162D7434FC352B85E.coreAR02>Consulta Portfolio</a><br></body></html>
。。。









正如你所看到的,这里有一些链接,但我在“for”中没有得到任何链接

可能是因为他们在href属性中没有任何引号。但在这种情况下,我如何阅读它们


谢谢

您在迭代链接时不会打印链接

这项工作:

wb = mechanize.Browser()
wb.open("https://example.com")

for link in wb.links():
    print link

使用
urlopen
和一些
regex
怎么样?不,我知道,我只是打印“aa>”来检查它是否进入了for。不管怎样,我试过你的解决方案,但还是不起作用。我能够通过使用下一个构造函数来解决这个问题:
wb=mechanize.Browser(factory=mechanize.RobustFactory())
我认为这允许mechanize在格式错误的html中搜索。我也尝试过BeautifulSoup,并取得了成功