使用Java将字符串转换为XML
我从PDF中提取了如下字符串格式的数据(请注意间距不均和新行字符) 如何将此字符串转换为XML,如下所示使用Java将字符串转换为XML,java,xml,string,Java,Xml,String,我从PDF中提取了如下字符串格式的数据(请注意间距不均和新行字符) 如何将此字符串转换为XML,如下所示 public void testMethod() { String extractedTestFromPDF= " Virtual Salary 25,100.00 EIS EE Contr.
public void testMethod()
{
String extractedTestFromPDF=
" Virtual Salary 25,100.00 EIS EE Contr. 7.90\n"+
"\t Virtual Car Allowance 1,600.00 EPF Employee Contr. 2,937.00\n"+
" Payment Received(Oversea) 4,265.01 SOCSO Employee Contr. 19.75\n";
}
Desire XML:
<xml>
<Data>
<Allowance>Virtual Salary</Allowance>
<Allowance_Amount>25,100.00</Allowance_Amount>
</Data>
<Data>
<Allowance>EIS EE Contr.</Allowance>
<Allowance_Amount>7.90</Allowance_Amount>
</Data>
<Data>
<Allowance>Virtual Car Allowance</Allowance>
<Allowance_Amount>1,600.00</Allowance_Amount>
</Data>
...
</xml>
虚拟工资
25,100.00
环境影响报告书。
7.90
虚拟汽车津贴
1,600.00
...
谢谢你,艾根。如果您能在Java7中发布上述方法,我们将非常感谢,因为我们仍然使用旧版本。
<xml>
<Data>
<Allowance>Virtual Salary</Allowance>
<Allowance_Amount>25,100.00</Allowance_Amount>
</Data>
<Data>
<Allowance>EIS EE Contr.</Allowance>
<Allowance_Amount>7.90</Allowance_Amount>
</Data>
<Data>
<Allowance>Virtual Car Allowance</Allowance>
<Allowance_Amount>1,600.00</Allowance_Amount>
</Data>
...
</xml>
String fixedSizetoXML(String extractedTestFromPDF) {
String[] lines = extractedTestFromPDF.split("\\R");
Pattern pattern = Pattern.compile("^\\s*(\\S.{20})\\s\\s+([-\\d,\\.]+)\\s+.*$");
// (--------) (-----------)
return "<?xml verion="1.0">\n<Xml>\n"
+ Stream.of(lines)
.map(pattern::matcher)
.filter(m::find)
.map(m -> String.format("<Data>\n"
+ " <Allowance>%s</Allowance>\n"
+ " <Allowance_Amount>%s</Allowance_Amount>\n"
+ "</Data>\n",
m.group(1).trim(), m.group(2)))
.collect(Collectors.joining(""))
+ "<Xml>\n";
}
String fixedSizetoXML(String extractedTestFromPDF) {
String[] lines = extractedTestFromPDF.split("\\R");
Pattern pattern = Pattern.compile("^\\s*(\\S.{20})\\s\\s+([-\\d,\\.]+)\\s+.*$");
// (--------) (-----------)
StringBuilder sb = new StringBuilder(lines.length * 64);
sb.append("<?xml verion="1.0">\n<Xml>\n");
for (String line : lines) {
Matcher m = pattern.matcher(line);
if (m.find()) {
String data = String.format("<Data>\n"
+ " <Allowance>%s</Allowance>\n"
+ " <Allowance_Amount>%s</Allowance_Amount>\n"
+ "</Data>\n",
m.group(1).trim(), m.group(2));
sb.append(data);
}
}
sb.append("<Xml>\n");
return sb.toString();
}