Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/xpath/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
带有XPath和TagSoup的Java_Java_Xpath_Tag Soup - Fatal编程技术网

带有XPath和TagSoup的Java

带有XPath和TagSoup的Java,java,xpath,tag-soup,Java,Xpath,Tag Soup,我使用TagSoup和java来提取一些数据,但某些XPATH不起作用,我只得到空结果 FileReader frInHtml = new FileReader("doc.html"); BufferedReader brInHtml = new BufferedReader(frInHtml); SAXBuilder saxBuilder = new SAXBuilder("org.ccil.cowan.tagsoup.Parser"); org.jdom.Document

我使用TagSoup和java来提取一些数据,但某些XPATH不起作用,我只得到空结果

  FileReader frInHtml = new FileReader("doc.html");
  BufferedReader brInHtml = new BufferedReader(frInHtml);

  SAXBuilder saxBuilder = new SAXBuilder("org.ccil.cowan.tagsoup.Parser");
   org.jdom.Document jdomDocument = saxBuilder.build(brInHtml);

 // This is working                                
 XPath xpath = XPath.newInstance("/ns:html[1]/ns:body/ns:div[@class='content']/ns:table/ns:tr/ns:td/ns:h1");

// All 3 lines below didn't work , tried them 1 at a time 
  XPath xpath = XPath.newInstance("/ns:html/ns:body/ns:div[7]/ns:table/ns:tbody/ns:tr/ns:td/ns:h1");
  XPath xpath = XPath.newInstance("//html//body//div[7]//table//tbody//tr//td//h1");
  XPath xpath = XPath.newInstance("/html/body/div[7]/table/tbody/tr/td/h1");                               

   xpath.addNamespace("ns", "http://www.w3.org/1999/xhtml");

要调试它,您需要查看TagSoup生成的“等效XML”。为了让我们帮助您,您需要向我们展示等效的XML。

没有XML很难说。我注意到,在一个工作中,您不使用
tbody
标记,而它总是出现在其他3个工作中。