Java Jsoup.parse().body().GetAllegements()将标记加倍

Java Jsoup.parse().body().GetAllegements()将标记加倍,java,html,outlook,jsoup,Java,Html,Outlook,Jsoup,这个JSoup将body标记的内容加倍有什么原因吗 public static void main(String[] args) { Jsoup.parse(myHtmlString).body().getAllElements() } 这种情况只发生在以下html代码中: <html> <head> <style> p{margin-bottom:0px;margin-top:0px;} body{font-family:Arial;fon

这个JSoup将body标记的内容加倍有什么原因吗

public static void main(String[] args) {
    Jsoup.parse(myHtmlString).body().getAllElements()
}
这种情况只发生在以下html代码中:

<html>
 <head> 
  <style> p{margin-bottom:0px;margin-top:0px;} body{font-family:Arial;font-size:10pt;} </style> 
 </head> 
 <body> 
  <div class="wordsection1"> 
   <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
   <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
   <div> 
    <div style="padding-top:3.0pt;padding-left:0cm;padding-right:0cm;padding-bottom:0cm;border-left-style:none;border-top-width:1.0pt;border-bottom-style:none;border-right-style:none;border-top-color:#B5C4DF;border-top-style:solid;"> 
     <p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] <br><b>Gesendet:</b> Dienstag, 6. August 2013 08:59<br><b>An:</b> Helmut Grashoff (dsfaasas@gmbh.de)<br><b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p> 
    </div> 
   </div> 
   <p class="msonormal">&nbsp;</p> 
   <p class="msonormal">Erst schauen wir mal, ob die Mail &uuml;berhaupt ankommt.</p> 
   <p class="msonormal">&nbsp;</p> 
   <p class="msonormal">Und <i> <u>gleichzeitig</u></i> <span style="font-size:18.0pt;">spiele</span> ich noch ein <span style="color:#31859C;">wenig </span> <span style="font-family:Algerian;">mit dieser Zeile</span></p> 
   <p class="msonormal">&nbsp;</p> 
   <p class="msonormal"> <span mso-fareast-language="DE">Mit freundlichen Gruessen</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE" style="color:#31849B;">Uasdsaa Aasdsaa</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE" style="font-size:9.0pt;">Iasdsaa-Sasdsaa</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
   <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
   <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
   <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">56218 M&uuml;lheim-K&auml;rlich</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
   <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
   <p class="msonormal">&nbsp;</p> 
  </div> 
 </body>
</html>

p{页边距底部:0px;页边距顶部:0px;}正文{字体系列:Arial;字体大小:10pt;}

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]
Gesendet:Dienstag,6。2013年8月08:59
An:Helmut Grashoff(dsfaasas@gmbh.de)
Betreff:erster测试fü;r die 2.2.1(mit HTML)

过去的工作、任务、邮件和uuml;贝豪普特·安科姆特。

和gleichzeitig spiele我不知道该怎么做

Mit freundlichen-Gruessen

Uasdsaa Aasdsaa

Iasdsaa-Sasdsaa

asdsaa

asdsaa

56218 Mü;lheim-Kä;rlich

上述解析方法的结果并不是说这里只是主体部分的内容:

<body> 
 <div class="wordsection1"> 
  <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
  <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
  <div> 
   <div style="padding-top:3.0pt;padding-left:0cm;padding-right:0cm;padding-bottom:0cm;border-left-style:none;border-top-width:1.0pt;border-bottom-style:none;border-right-style:none;border-top-color:#B5C4DF;border-top-style:solid;"> 
    <p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asdafsdf@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p> 
   </div> 
  </div> 
  <p class="msonormal">&nbsp;</p> 
  <p class="msonormal">Erst schauen wir mal, ob die Mail &uuml;berhaupt ankommt.</p> 
  <p class="msonormal">&nbsp;</p> 
  <p class="msonormal">Und <i> <u>gleichzeitig</u></i> <span style="font-size:18.0pt;">spiele</span> ich noch ein <span style="color:#31859C;">wenig </span> <span style="font-family:Algerian;">mit dieser Zeile</span></p> 
  <p class="msonormal">&nbsp;</p> 
  <p class="msonormal"> <span mso-fareast-language="DE">Mit freundlichen Gruessen</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE" style="color:#31849B;">Uasdsaa Aasdsaa</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE" style="font-size:9.0pt;">Iasdsaa-Sasdsaa</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
  <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
  <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
  <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">56218 M&uuml;lheim-K&auml;rlich</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
  <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
  <p class="msonormal">&nbsp;</p> 
 </div>  
</body>
<div class="wordsection1"> 
 <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
 <p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p> 
 <div> 
  <div style="padding-top:3.0pt;padding-left:0cm;padding-right:0cm;padding-bottom:0cm;border-left-style:none;border-top-width:1.0pt;border-bottom-style:none;border-right-style:none;border-top-color:#B5C4DF;border-top-style:solid;"> 
   <p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asfasd@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p> 
  </div> 
 </div> 
 <p class="msonormal">&nbsp;</p> 
 <p class="msonormal">Erst schauen wir mal, ob die Mail &uuml;berhaupt ankommt.</p> 
 <p class="msonormal">&nbsp;</p> 
 <p class="msonormal">Und <i> <u>gleichzeitig</u></i> <span style="font-size:18.0pt;">spiele</span> ich noch ein <span style="color:#31859C;">wenig </span> <span style="font-family:Algerian;">mit dieser Zeile</span></p> 
 <p class="msonormal">&nbsp;</p> 
 <p class="msonormal"> <span mso-fareast-language="DE">Mit freundlichen Gruessen</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE" style="color:#31849B;">Uasdsaa Aasdsaa</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE" style="font-size:9.0pt;">Iasdsaa-Sasdsaa</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
 <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
 <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p> 
 <p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">56218 M&uuml;lheim-K&auml;rlich</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
 <p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p> 
 <p class="msonormal">&nbsp;</p> 
</div>
<p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p>
<span style="color:#1F497D;">&nbsp;</span>
<p class="msonormal"> <span style="color:#1F497D;">&nbsp;</span></p>
<span style="color:#1F497D;">&nbsp;</span>
<div> 
 <div style="padding-top:3.0pt;padding-left:0cm;padding-right:0cm;padding-bottom:0cm;border-left-style:none;border-top-width:1.0pt;border-bottom-style:none;border-right-style:none;border-top-color:#B5C4DF;border-top-style:solid;"> 
  <p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asdffasd@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p> 
 </div> 
</div>
<div style="padding-top:3.0pt;padding-left:0cm;padding-right:0cm;padding-bottom:0cm;border-left-style:none;border-top-width:1.0pt;border-bottom-style:none;border-right-style:none;border-top-color:#B5C4DF;border-top-style:solid;"> 
 <p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asdfsad@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p> 
</div>
<p class="msonormal"> <b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b><span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asdfsdfa@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span></p>
<b> <span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span></b>
<span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;">Von:</span>
<span tahoma","sans-serif";mso-fareast-language:de"="" style="font-size:10.0pt;"> Uasdsaa Aasdsaa [mailto:bullet1@left.de] Br2nL<b>Gesendet:</b> Dienstag, 6. August 2013 08:59Br2nL<b>An:</b> Helmut Grashoff (asdfsdf@gmbh.de)Br2nL<b>Betreff:</b> erster Test f&uuml;r die 2.2.1 (mit HTML)</span>
<b>Gesendet:</b>
<b>An:</b>
<b>Betreff:</b>
<p class="msonormal">&nbsp;</p>
<p class="msonormal">Erst schauen wir mal, ob die Mail &uuml;berhaupt ankommt.</p>
<p class="msonormal">&nbsp;</p>
<p class="msonormal">Und <i> <u>gleichzeitig</u></i> <span style="font-size:18.0pt;">spiele</span> ich noch ein <span style="color:#31859C;">wenig </span> <span style="font-family:Algerian;">mit dieser Zeile</span></p>
<i> <u>gleichzeitig</u></i>
<u>gleichzeitig</u>
<span style="font-size:18.0pt;">spiele</span>
<span style="color:#31859C;">wenig </span>
<span style="font-family:Algerian;">mit dieser Zeile</span>
<p class="msonormal">&nbsp;</p>
<p class="msonormal"> <span mso-fareast-language="DE">Mit freundlichen Gruessen</span></p>
<span mso-fareast-language="DE">Mit freundlichen Gruessen</span>
<p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p>
<span mso-fareast-language="DE">&nbsp;</span>
<p class="msonormal"> <span mso-fareast-language="DE" style="color:#31849B;">Uasdsaa Aasdsaa</span></p>
<span mso-fareast-language="DE" style="color:#31849B;">Uasdsaa Aasdsaa</span>
<p class="msonormal"> <span mso-fareast-language="DE" style="font-size:9.0pt;">Iasdsaa-Sasdsaa</span></p>
<span mso-fareast-language="DE" style="font-size:9.0pt;">Iasdsaa-Sasdsaa</span>
<p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p>
<span mso-fareast-language="DE">&nbsp;</span>
<p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p>
<span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span>
<p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span></p>
<span verdana","sans-serif";mso-fareast-language:de"="">asdsaa</span>
<p class="msonormal"> <span verdana","sans-serif";mso-fareast-language:de"="">56218 M&uuml;lheim-K&auml;rlich</span></p>
<span verdana","sans-serif";mso-fareast-language:de"="">56218 M&uuml;lheim-K&auml;rlich</span>
<p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p>
<span mso-fareast-language="DE">&nbsp;</span>
<p class="msonormal"> <span mso-fareast-language="DE">&nbsp;</span></p>
<span mso-fareast-language="DE">&nbsp;</span>
<p class="msonormal">&nbsp;</p>

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asdafsdf@gmbh.de)Br2nLBetreff:erster测试fü;r die 2.2.1(mit HTML)

过去的工作、任务、邮件和uuml;贝豪普特·安科姆特。

和gleichzeitig spiele我不知道该怎么做

Mit freundlichen-Gruessen

Uasdsaa Aasdsaa

Iasdsaa-Sasdsaa

asdsaa

asdsaa

56218 Mü;lheim-Kä;rlich

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asfasd@gmbh.de)Br2nLBetreff:erster测试fü;r die 2.2.1(mit HTML)

过去的工作、任务、邮件和uuml;贝豪普特·安科姆特。

和gleichzeitig spiele我不知道该怎么做

Mit freundlichen-Gruessen

Uasdsaa Aasdsaa

Iasdsaa-Sasdsaa

asdsaa

asdsaa

56218 Mü;lheim-Kä;rlich

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asdffasd@gmbh.de)Br2nLBetreff:erster测试fü;r die 2.2.1(mit HTML)

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asdfsad@gmbh.de)Br2nLBetreff:erster测试fü;r die 2.2.1(mit HTML)

Von:Uasdsaa Aasdsaa[mailto:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asdfsdfa@gmbh.de)Br2nLBetreff:erster测试fü;r.2.1(麻省理工学院HTML)

冯: 冯: Uasdsaa Aasdsaa[邮寄:bullet1@left.de]Br2nLGesendet:Dienstag,6。2013年8月08:59Br2nLAn:Helmut Grashoff(asdfsdf@gmbh.de)Br2nLBetreff:erster测试fü;r.2.1(麻省理工学院HTML) Gesendet: 安: Betreff:

过去的工作、任务、邮件和uuml;贝豪普特安科姆特

我不知道该怎么做

格雷希特格 格雷希特格 斯皮尔 维尼格 麻省理工学院迪泽尔学院

Mit freundlichen-Gruessen

麻省理工学院弗伦德里希-格鲁森分校

Uasdsaa Aasdsaa

Uasdsaa Aasdsaa

Iasdsaa Sasdsaa

IASDSASASDSAA

asdsaa

asdsaa asdsaa

asdsaa 56218 Mü;lheim-Kä;利希

56218 Mü;lheim-Kä;利希


我正在使用Java6和JSoup1.7.2。

它显然是完美的

事实上,它不会加倍,而是重复
元素
到引用
节点的深度w.r.t的次数

正文
下的每个
节点
都被视为
元素
,包括
正文
,直到它到达叶子
节点

如果你考虑文本<代码> Betreff:< /C> >重复7次,因为它是根以下的7个级别(<代码> <代码>),而且它是树中更深的孩子。

body>div>div>div>p>span>b