Javascript 使用YQL提取HTML内容?
假设我想从具有以下标记的网页中提取数据:Javascript 使用YQL提取HTML内容?,javascript,json,yahoo,yql,Javascript,Json,Yahoo,Yql,假设我想从具有以下标记的网页中提取数据: <table> <tr> <td><a href="Link 1">Column 1 Text</a></td> <td>Column 2 Text</td> <td>Column 3 Text</td> </tr> <tr> <td><a href=
<table>
<tr>
<td><a href="Link 1">Column 1 Text</a></td>
<td>Column 2 Text</td>
<td>Column 3 Text</td>
</tr>
<tr>
<td><a href="Link 2">Column 1 Text</a></td>
<td>Column 2 Text</td>
<td>Column 3 Text</td>
</tr>
...
</table>
我们可以用YQL吗?如果是,请给我一个查询示例
任何帮助都将不胜感激 这是一个很好的起点,使用HTML表和一些XPath查询(有关此技术的更多详细信息,请参阅): 这会产生如下JSON结果:
{
"query": {
"count": 2,
"created": "2012-01-06T20:16:46Z",
"lang": "en-US",
"results": {
"tr": [
{
"td": [
{
"a": {
"href": "Link%201",
"content": "Column 1 Text"
}
},
{
"p": "Column 2 Text"
},
{
"p": "Column 3 Text"
}
]
},
{
"td": [
{
"a": {
"href": "Link%202",
"content": "Column 1 Text"
}
},
{
"p": "Column 2 Text"
},
{
"p": "Column 3 Text"
}
]
}
]
}
}
}
下面是一个很好的起点,使用HTML表和一些XPath查询(有关此技术的更多详细信息,请参阅): 这会产生如下JSON结果:
{
"query": {
"count": 2,
"created": "2012-01-06T20:16:46Z",
"lang": "en-US",
"results": {
"tr": [
{
"td": [
{
"a": {
"href": "Link%201",
"content": "Column 1 Text"
}
},
{
"p": "Column 2 Text"
},
{
"p": "Column 3 Text"
}
]
},
{
"td": [
{
"a": {
"href": "Link%202",
"content": "Column 1 Text"
}
},
{
"p": "Column 2 Text"
},
{
"p": "Column 3 Text"
}
]
}
]
}
}
}