Python 如何获取特定的<；td>；在a<；tr>；与美联_Python_Html_Beautifulsoup - Fatal编程技术网

Python 如何获取特定的<；td>；在a<；tr>；与美联

python html

Python 如何获取特定的<；td>；在a<；tr>；与美联,python,html,beautifulsoup,Python,Html,Beautifulsoup,试图从纽约市高中列表的wiki页面中获取所有高中的名字我已经写了足够多的脚本来获取包含高中、学术领域和入学标准列表的标签中包含的所有信息——但我如何才能将其缩小到我认为应该在td[0]中的范围（这会吐出关键错误）-只是学校的名字到目前为止，我编写的代码是： from bs4 import BeautifulSoup from urllib2 import urlopen NYC = 'https://en.wikipedia.org/wiki/List_of_high_schools_in

试图从纽约市高中列表的wiki页面中获取所有高中的名字

我已经写了足够多的脚本来获取包含高中、学术领域和入学标准列表的

标签中包含的所有信息——但我如何才能将其缩小到我认为应该在

td[0]

中的范围（这会吐出

关键错误）-只是学校的名字
到目前为止，我编写的代码是：
from bs4 import BeautifulSoup
from urllib2 import urlopen

NYC = 'https://en.wikipedia.org/wiki/List_of_high_schools_in_New_York_City'

html = urlopen(NYC)
soup = BeautifulSoup(html.read(), 'lxml')
schooltable = soup.find('table')
for td in schooltable:
    print(td)

我收到的输出：
<tr>
    <td><a href="/wiki/The_Beacon_School" title="The Beacon School">The Beacon School</a></td>
    <td>Humanities &amp; interdisciplinary</td>
    <td>Academic record, interview</td>
</tr>

如何获取页面上的第一个表
，迭代除第一个标题行以外的所有行，并获取每行的第一个td
元素。为我工作：
for row in soup.table.find_all('tr')[1:]:
    print(row.td.text)

我还通过查找
中的所有锚固件，然后查找标题来实现这一点：
titles = next(
    i.get('title') for i in [
        td.find('a') for td in soup.findAll('td') if td.find('a') is not None
        ]

titles = next(
    i.get('title') for i in [
        td.find('a') for td in soup.findAll('td') if td.find('a') is not None
        ]




[html]相关文章推荐



                                                        
Html 时代，铬和狩猎。不支持IE。有谁愿意说这个答案有什么问题吗？Align是一个不推荐使用的属性，所以Firefox没有理由支持它。嗯，是的，但是它不推荐的原因是什么？它似乎很有用。如果不是align属性，那么使用CSS设置样式将非常有用。我知道这可能会导致一
htmlcss 
设置`<；html>；`CSS中的元素？
htmlcss 
Html 如何渲染<；李>；肩并肩？
htmlcss 
Html 我想在下一个jsp中获取禁用文本框的值，但得到的是空值
htmljsp 
Html 文本区域溢出时打印
htmlprinting 
Html 如何防止浏览器询问favicon？
htmlbrowser 
Html 如何删除div标记之间的不可见空格
htmlcss 
HTML/CSS：总页面宽度比内容宽
htmlcss 
Html 内联级元素vs短语级元素vs块级元素
html 
Html 当我点击一个按钮时，它会一直导航到页面的顶部
htmlcss 
Html 错误：指定的强制转换无效。
htmlasp.netc#-4.0 
HTML CSS中的水平滚动
htmlcssscroll 
HTML：如何获得开发页面/站点的学分（作者身份）
html 
Html 字体族的含义是什么？
htmlcssfonts 
Html 如何设置身体背景的图像不透明度
htmlcss 
Html 如果未选中，则不会在HTTPRequest中发送复选框值
htmljspcheckbox 
Html 使用Firefox无法正确渲染TD边框
htmlcsswindowsfirefox 
Html 无法使用z索引将一个div与另一个div重叠
htmlcss 
通过CSS修改具有特定HTML属性的内联元素
htmlcss 
Html 我应该使用arial标签还是视觉隐藏的标签元素
html 
                                       





随机文章推荐



                                                        
File 在同一父目录中具有相同名称的文件和目录-Solaris 8，ufs
filedirectoryfilesystems 
File 如何将文件夹中的文件名提取为文本？
filedirectory 
File 将文件内容显示为二进制文件
filebinary 
File PowerShell:脚本一结束，成功写入的配置文件就消失了？
filepowershell 
FileInputFormat，其中文件名为键，文本内容为值
fileinputmaphadoop 
File SSIS 2008在具有不同布局的平面文件上循环
fileloopsssis 
File Bazaar：提交时自动修改文件并提交修改
file 
File 大文件通知
filehttp 
File Bash脚本+；逐行阅读+；含糊不清？
filebash 
File 如何向不同的用户发送桌面快捷方式（到他们机器上的本地html文件），并使其工作并保留自定义图标？
file 
File 用C++创建和写入UTF-8文件
我必须在我的HTTP流式C++服务器代码中创建M3U8播放列表。m3u8只不过是UTF-8 m3u文件
filec++utf-8 
File 带有空格的批处理文件循环
fileloopsbatch-filedirectory 
File Python-从文件读取
fileinput 
File 逐行比较：unix中的两个文件
fileunix 
File 如何将同一字符串附加到多个pdf名称文件
fileshellbatch-file 
File Delphi I/O错误103
filedelphi 
File 如何使用golang检查web文件是否存在？
filehttpgo 
File 批处理文件&x201C；错误”；
filebatch-file 
File 如果目标目录中不存在文件，则移动Powershell文件结构
filepowershell 
File 如何从文件中读取字节
filestreamprolog


                                        

                                        
                                        


                                                
                                                        [python]相关推荐
                                                        
用python下载大zip文件
									Python
							 									Download
							 
Python 是否可以与South保持全球（所有应用程序）迁移历史记录？
									Python
							 									Django
							 
Python “结果对象”的文档；队列“声明”；在皮卡
									Python
							 									Rabbitmq
							 
Python 在我的算法中实现原始输入
									Python
							 									Python 2.7
							 
python unittests中的Neo4j无常数据库
									Python
							 									Unit Testing
							 									Neo4j
							 
在Python3.x中从文件读取令牌
									Python
							 									Python 3.x
							 									Input
							 
Python 使用pyparsing从ovs转储流提取数据
									Python
							 									Python 2.7
							 
Python 如何使用boto在路由53中的DNS记录上设置地理位置
									Python
							 									Dns
							 
在python中使用生成器函数实现长除法
									Python
							 									Python 2.7
							 
Python urllib2代码返回；HTTP错误503“；在一台机器上，而不是在另一台机器上
									Python
							 									Http
							 									Google Compute Engine
							 
Python Flask Security'的返回值是多少；使用什么上下文处理器？
									Python
							 									Flask
							 
Python 正则表达式字符串替换：如果backref为空，则省略逗号
									Python
							 									Regex
							 									Sed
							 									Ansible
							 
内存密集型Python程序
									Python
							 									Mongodb
							 									Python 2.7
							 
Python Django管理和ImageField维度限制
									Python
							 									Django
							 
Can tensorflow'；s mod运算符匹配python'；s模实现？
									Python
							 									Tensorflow
							 
Python 我们是否必须将所有传入的参数放入其自身中。变量，然后在类中的另一个表达式中使用它们？
									Python
							 									Class
							 									Variables
							 
Python Selenium不在'；对于'；环
									Python
							 									Selenium
							 									Parsing
							 									Selenium Webdriver
							 
Python 为什么我的PyGame应用程序根本没有运行？
									Python
							 									Macos
							 
Python 从“保存已安装的模型”；statsmodels“；
									Python
							 
Python 找不到模块错误没有名为'；app&x27；希罗库：
									Python
							 									Flask
							 									Heroku
							 
Python 即使使用close（）函数，我也无法访问此文件
									Python
							 									Php
							 									Python 3.x
							 
Python ValueError:Layer MLP_模型需要1个输入，但收到460个输入张量。（在致密层）
									Python
							 									Keras
							 
如何使用代理python连接到网站
									Python
							 									Proxy
							 
使用Python从BAM文件和vcf文件中提取具有不同位置的读取和匹配（也可以使用PYSAM和PYSAM、bamnostics）
									Python
							 
Python 内容类型26用于<；类别''&燃气轮机#2不指向子类
									Python
							 									Django
							 
Python 即使在安装TeX软件包之后，通过LaTeX将.ipynb转换为pdf也会出现此错误
									Python
							 									Macos
							 									Jupyter Notebook
							 									Latex
							 
Python 以标签范围为中心的回归LSTM输出值
									Python
							 									C++
							 									Neural Network
							 
PythonKivymd-从一个屏幕输入，在另一个屏幕显示输出
									Python
							 
Python 选择jupyter中的所有引用
									Python
							 									Jupyter Notebook
							 									Pycharm
							 
Python 如何在创建数据库时在allauth中传递电子邮件验证帐户
									Python
							 									Django
							 
                                                        
                                                

                                                
                                                        Tags
                                                        
Installation
Stripe Payments
Oracle10g
Web Services
Google Drive Api
Math
Firebase
Pytorch
Download
Google Compute Engine
Applescript
Office365
Bots
Android Layout
Webpack
Gatsby
Angular6
Hibernate
Xampp
Db2
Python
Sql
Websocket
Gwt
Activemq
Salesforce
Struct
Inheritance
Silverlight 4.0
Iis 7
Biztalk
Material Ui
Aem
Grafana
Centos
Ecmascript 6
Asp.net
Path
Playframework
Crystal Reports
Cakephp
Pentaho
Blazor
Polymer
Sphinx
Vb.net
Arangodb
Zend Framework
Embedded
Log4net
Xsd
Module
Maven
Webrtc
Tensorflow
Dynamic
Web Crawler
Joomla
Rdf
Emacs
Formatting
Date
Single Sign On
Ember.js
Umbraco
Gmail
Cryptography
Antlr4
Google App Maker
Calendar
Plone
Ionic2
Class
Sprite Kit
Asp.net Core
Inno Setup
Nhibernate
Linq
Netty
Extjs4
Notifications
Graphviz
Netbeans
Active Directory
Libgdx
Plugins
Ada
Fortran
Logic
Vue.js
Matlab
Artificial Intelligence
Teamcity
Winforms
Html5 Canvas
Openlayers
Checkbox
Sbt
Eclipse Plugin
Sap
Solr
Apache Zookeeper
Encoding
Sonarqube
Virtualbox
Google Chrome Devtools
Flutter
Asterisk
Java Me
Kdb
Apache Spark
Cuda
Puppet
Compiler Errors
.net
Multithreading
Mysql
Junit
Google Cloud Firestore
Notepad++
Windows Installer
Grep
Windows 8
Winapi
Silverlight
Qt
Opengl
Msbuild
Sugarcrm
Wso2
Unicode
Asp.net Mvc 3
Ruby On Rails 3
Apache Nifi
Big O
Silverstripe
Mongodb
Tree
Extjs
Dataframe
Network Programming
Computer Science
Actionscript
Windows Mobile
Nestjs
Apache Pig
Node.js
Doxygen
Drupal 6
Apache2
Fonts
Shopify
Gstreamer
Terminal
Signalr
Model
Ipad
Python Sphinx
Appium
Qt4
Couchdb
Rss
Bash
Flask
Google Chrome Extension
Apache Flex
Concurrency
Memory
Mqtt
Generics
Dask
Kendo Ui
Statistics
C++
Uml
Powershell
Twig
Geometry
Anaconda
Project Management
Doctrine Orm
Architecture
Github
Drop Down Menu
Elixir
Entity Framework Core
Excel
Ruby On Rails
Jquery Mobile
Vim
Import
Omnet++
Cors
View
Markdown
Lisp
Ffmpeg
Memory Leaks
Bison
Titanium
Apache Storm


                

                        
						
                        
                                
                                        
                                                
                                                        
                                                                Copyright © 2024. All Rights Reserved by  - Fatal编程技术网