Java 使用ApachePOI将Word转换为HTML
我看到有一个名为Java 使用ApachePOI将Word转换为HTML,java,apache-poi,Java,Apache Poi,我看到有一个名为WordToHtmlConverter的转换器,但进程方法没有公开。如何传递文档文件并获取HTML文件(或OutputStream)?此代码现在适用于我 HWPFDocumentCore wordDocument = WordToHtmlUtils.loadDoc(new FileInputStream("D:\\temp\\seo\\1.doc")); WordToHtmlConverter wordToHtmlConverter = new WordToHt
WordToHtmlConverter
的转换器,但进程方法没有公开。如何传递文档文件并获取HTML文件(或OutputStream
)?此代码现在适用于我
HWPFDocumentCore wordDocument = WordToHtmlUtils.loadDoc(new FileInputStream("D:\\temp\\seo\\1.doc"));
WordToHtmlConverter wordToHtmlConverter = new WordToHtmlConverter(
DocumentBuilderFactory.newInstance().newDocumentBuilder()
.newDocument());
wordToHtmlConverter.processDocument(wordDocument);
Document htmlDocument = wordToHtmlConverter.getDocument();
ByteArrayOutputStream out = new ByteArrayOutputStream();
DOMSource domSource = new DOMSource(htmlDocument);
StreamResult streamResult = new StreamResult(out);
TransformerFactory tf = TransformerFactory.newInstance();
Transformer serializer = tf.newTransformer();
serializer.setOutputProperty(OutputKeys.ENCODING, "UTF-8");
serializer.setOutputProperty(OutputKeys.INDENT, "yes");
serializer.setOutputProperty(OutputKeys.METHOD, "html");
serializer.transform(domSource, streamResult);
out.close();
String result = new String(out.toByteArray());
System.out.println(result);
这是你问的吗?这不是…在ApachePOI中,他们在org.Apache.POI.hwpf.converter包中有一个新类来处理这个问题…但是找不到任何关于如何使用它们的教程。