C# HtmlAlityPack从html页面读取数据
我试图使用HTMLAgilityPack从html表中获取数据,但一直只获取第一个表行中的数据 我正在阅读的html代码如下所示:C# HtmlAlityPack从html页面读取数据,c#,html-agility-pack,C#,Html Agility Pack,我试图使用HTMLAgilityPack从html表中获取数据,但一直只获取第一个表行中的数据 我正在阅读的html代码如下所示: <div id="mainDiv"> <table id="tbl"> <thead> <tr> <th class="tbl_col1">UserName</th> <th c
<div id="mainDiv">
<table id="tbl">
<thead>
<tr>
<th class="tbl_col1">UserName</th>
<th class="tbl_col2">Points</th>
</tr>
</thead>
<tbody>
<tr data-source="provider1">
<td class="tbl_col1">
<a href="/Users/1090" id="UserLink" target="_blank">UserName1</a>
</td>
<td class="tbl_col2">
<a href="/UserPoints/1090" id="PointLink" target="_blank">1892 <span class="up_arrow"> </span></a>
</td>
</tr>
<tr data-source="provider2">
<td class="tbl_col1">
<a href="/Users/1090" id="UserLink" target="_blank">UserName2</a>
</td>
<td class="tbl_col2">
<a href="/UserPoints/1090" id="PointLink" target="_blank">3217 <span class="down_arrow"> </span></a>
</td>
</tr>
</tbody>
</table>
</div>
但我一直得到这样的结果:
provider1 UserName1 1892
provider2 UserName1 1892
您做出了错误的假设:
//a[@id='UserLink']/text()
和//a[@id='PointLink']/text()
在整个文档中搜索。这就是为什么会得到第一个tr
节点。只需使用:
string UserName = row.SelectSingleNode("td[@class='tbl_col1']/a[@id='UserLink']/text()").InnerText;
string Points = row.SelectSingleNode("td[@class='tbl_col2']/a[@id='PointLink']/text()").InnerText;
此外,您还可以真正简化代码的其余部分:
var UserTable = doc.DocumentNode.SelectNodes("//div[@id='mainDiv']/table[@id='tbl']/tbody/tr");
foreach (var row in UserTable)
{
if (row.Attributes["data-source"] != null)
{
string Source = row.Attributes["data-source"].Value;
string UserName = row.SelectSingleNode("td[@class='tbl_col1']/a[@id='UserLink']/text()").InnerText;
string Points = row.SelectSingleNode("td[@class='tbl_col2']/a[@id='PointLink']/text()").InnerText;
Console.WriteLine(Source + "\t" + UserName + "\t" + Points);
}
}
var UserTable = doc.DocumentNode.SelectNodes("//div[@id='mainDiv']/table[@id='tbl']/tbody/tr");
foreach (var row in UserTable)
{
if (row.Attributes["data-source"] != null)
{
string Source = row.Attributes["data-source"].Value;
string UserName = row.SelectSingleNode("td[@class='tbl_col1']/a[@id='UserLink']/text()").InnerText;
string Points = row.SelectSingleNode("td[@class='tbl_col2']/a[@id='PointLink']/text()").InnerText;
Console.WriteLine(Source + "\t" + UserName + "\t" + Points);
}
}