Python 3.x 如何跳过<;a>;使用selenium抓取数据时标记
HTML: 输出:Python 3.x 如何跳过<;a>;使用selenium抓取数据时标记,python-3.x,selenium,selenium-webdriver,Python 3.x,Selenium,Selenium Webdriver,HTML: 输出: applicationData = [td.text for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow"]')] record = {'Designation': applicationData[0], 'Designation': applicationData[1],'Designation': applicationData[2]} Designat
applicationData = [td.text for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow"]')]
record = {'Designation': applicationData[0],
'Designation': applicationData[1],'Designation': applicationData[2]}
Designation: Apple CEO all CEOs // Not required 'all CEOs'
Designation: Google CEO
Designation: Not, DEFINED
我正在从表和
试试上面的代码,这里我们使用TextCOntent,它在不同的行中返回不同的文本节点,这样您就可以使用“\n”来拆分它,如何跳过空文本节点?您可以用示例htmlYour sol不打印未定义的值作为单独的问题提问吗。我更新了代码。看一看,支持我。
Designation: Apple CEO all CEOs // Not required 'all CEOs'
Designation: Google CEO
Designation: Not, DEFINED
Designation: Apple CEO
Designation: Google CEO
Designation: // should have value 'NOT, DEFINED'
applicationData = [td.get_attribute("textContent").split("\n")[0] for td in webBrowser.find_elements_by_xpath('//td[@class="wpsTableNrmRow"]')]
record = {'Designation1': applicationData[0], 'Designation2': applicationData[1]}