Java 如何在NekoHTML中将文档对象的内容写入字符串?
我正在使用NekoHTML解析一些HTML文件的内容 除了将Document对象的内容提取到某个字符串之外,一切都正常 我试过了Java 如何在NekoHTML中将文档对象的内容写入字符串?,java,html-parsing,transformer,neko,Java,Html Parsing,Transformer,Neko,我正在使用NekoHTML解析一些HTML文件的内容 除了将Document对象的内容提取到某个字符串之外,一切都正常 我试过了 TransformerFactory transformerFactory = TransformerFactory.newInstance(); Transformer transformer = transformerFactory.newTransformer(); DOMSource source = new DOMSource(doc); StreamRes
TransformerFactory transformerFactory = TransformerFactory.newInstance();
Transformer transformer = transformerFactory.newTransformer();
DOMSource source = new DOMSource(doc);
StreamResult result = new StreamResult(writer);
transformer.transform(source, result);
但是没有返回任何内容。Oracle App server 10.3.1.4中的问题可能的解决方案:
//this nekohtml
DOMParser parser = new DOMParser();
parser.parse(archivo);
//this xerces
OutputFormat format = new OutputFormat(parser.getDocument());
format.setIndenting(true);
//print xml for console
//XMLSerializer serializer = new XMLSerializer(System.out, format);
//save xml in string var
OutputStream outputStream = new ByteArrayOutputStream();
XMLSerializer serializer = new XMLSerializer(outputStream, format);
//process
serializer.serialize(parser.getDocument());
String xmlText = outputStream.toString();
System.out.println(xmlText);
//to generate a file output use fileoutputstream instead of system.out
//XMLSerializer serializer = new XMLSerializer(new FileOutputStream(new File("book.xml")), format);
网址:
请参见e)将DOM序列化为FileOutputStream以生成xml文件“book.xml”请定义“writer”是什么。FileWriter可用于写入文件,StringWriter可用于写入字符串缓冲区