Java 我如何使用JSoup(开放供选择)解析它

Java 我如何使用JSoup(开放供选择)解析它,java,web-scraping,jsoup,Java,Web Scraping,Jsoup,我对JSoup比较陌生,我正试图解析从一个网站上刮下来的html,这就是我的思路 ..... <FONT COLOR=#2D8F26 FACE="Arial"><B>Claim:</B></FONT> &nbsp; Photograph shows a Chicago Bears fan holding a crude sign at the <NOBR>2006-07</NOBR> <NOBR>

我对JSoup比较陌生,我正试图解析从一个网站上刮下来的html,这就是我的思路

.....
    <FONT COLOR=#2D8F26 FACE="Arial"><B>Claim:</B></FONT> &nbsp; Photograph shows a Chicago Bears fan holding a crude sign at the <NOBR>2006-07</NOBR> <NOBR>NFC championship</NOBR> game.
    <BR><BR>
    <NOINDEX>
    <FONT COLOR=#2D8F26 FACE="Arial"><B>Status:</B></FONT> &nbsp; <FONT COLOR=#FF0000 FACE="Arial"><B><I>True.</I></B></FONT>
    </NOINDEX>
    <BR><BR>
    <FONT COLOR=#2D8F26 FACE="Arial"><B>Example:</B></FONT> &nbsp; <FONT COLOR=#2D8F26 FACE="Trebuchet MS,Bookman Old Style,Arial"><I>[Collected via e-mail, January 2007]</I></FONT>
    <BR><BR>
    <TABLE WIDTH=400 ALIGN=CENTER BORDER=0 BGCOLOR=#000000><TR><TD BGCOLOR=#EAF2E5>
    <FONT FACE="Verdana" SIZE=2">
    <DIV STYLE="text-align: justify; margin-top: 10px; margin-bottom: 10px; margin-left: 15px; margin-right: 15px">
    The attached photo has been circulating around the Gulf Coast region for a couple of days now (since Saturday's Bears-Saints game). Do you have any word on whether it is authentic or doctored? Was this individual really that tasteless and crude?
    <BR><BR>
    <CENTER>
......

查看JSoup文档后,它显示了基于标记获取信息的方法。但是如何使用JSoup获得所需的输出呢?任何样品或样品替代品将不胜感激

我认为您只想通过剥离HTML实体来获得文本部分。下面应该可以

Jsoup.parse("yoursInputString").text();

告诉我们你试过什么?为什么投反对票?这个问题问错了什么吗?@Pureferret:我遵循了下面的提示。我只是想看看是否有更方便的方法来满足我的需要。在问题上的投票不是为了“发表错误的东西”,而是为了进行糟糕的研究,以阻止人们过早地转向。我认为它被称为“gimmee代码”。不管怎样,这不是我的反对票,但我被诱惑了。
Jsoup.parse("yoursInputString").text();