Java 如何提取论坛帖子';谁是jsoup的作者?
Java 如何提取论坛帖子';谁是jsoup的作者?,java,html,tags,jsoup,Java,Html,Tags,Jsoup,<div class="username_container"> <div id="yui-gen11" class="popupmenu memberaction"> <a id="yui-gen13" class="username offline popupctrl" href="member.php?u=276113" title="PilotPhill is offline"><strong>**
<div class="username_container">
<div id="yui-gen11" class="popupmenu memberaction">
<a id="yui-gen13" class="username offline popupctrl" href="member.php?u=276113" title="PilotPhill is offline"><strong>**PilotPhill**</strong></a>
<ul id="yui-gen12" class="popupbody memberaction_body">
<li class="left">
-
我正在尝试使用jsoup提取论坛线程的作者名称。它是强标记中的名称。我试过接近所有东西,但就是没能得到它。有什么建议吗?我一直在使用jSoup.select(),但如果还有其他方法,我很乐意尝试。您只需使用
文档。选择,正如您已经发现的那样
<div class="username_container">
<div id="yui-gen11" class="popupmenu memberaction">
<a id="yui-gen13" class="username offline popupctrl" href="member.php?u=276113" title="PilotPhill is offline"><strong>**PilotPhill**</strong></a>
<ul id="yui-gen12" class="popupbody memberaction_body">
<li class="left">
Document doc = Jsoup.connect("http://www.foo.com").get();
Elements usernames = doc.select("a.username strong");
for (Element username: usernames) {
System.out.println("found username: " + username.text();
}
a.username strong
表示:标签strong
内部标签a
具有属性class
的值username
,如:
<div class="username_container">
<div id="yui-gen11" class="popupmenu memberaction">
<a id="yui-gen13" class="username offline popupctrl" href="member.php?u=276113" title="PilotPhill is offline"><strong>**PilotPhill**</strong></a>
<ul id="yui-gen12" class="popupbody memberaction_body">
<li class="left">
<a id="yui-gen13" class="username offline popupctrl"
href="member.php?u=276113" title="PilotPhill is offline">
<strong>**PilotPhill**</strong>
</a>