Java 删除除<；之外的所有HTML标记；br>；从一个文本？_Java_Regex

Java 删除除<；之外的所有HTML标记；br>；从一个文本？

java regex

Java 删除除<；之外的所有HTML标记；br>；从一个文本？,java,regex,Java,Regex,大家好我有一个java字符串，我想 1-从中删除除新行标记和之外的所有html标记，如果有文本，则将文本保留在标记内。 2-解析后，文本结果彼此连接，如：text1和text2，文本之间没有空格分隔，我也想这样做以下是我正在做的： String html = "<div dir=\"ltr\">hello my friend<span>ECHO</span><br>how are you ?<br><br><div

大家好我有一个java字符串，我想 1-从中删除除新行标记

和

之外的所有html标记，如果有文本，则将文本保留在标记内。 2-解析后，文本结果彼此连接，如：text1和text2，文本之间没有空格分隔，我也想这样做

以下是我正在做的：

String html = "<div dir=\"ltr\">hello my friend<span>ECHO</span><br>how are you ?<br><br><div class=\"gmail_quote\">On Mon, Feb 14, 2011 at 10:45 AM, My Friend <span dir=\"ltr\">&lt;<a href=\"mailto:notifications@mydomain.com\">notifications@mydomain.com</a>&gt;</span> wrote:<br> "
            + "<blockquote class=\"gmail_quote\" style=\"margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;\"> ";
    String parsedText = html.replaceAll("\\<.*?\\>", "");
    System.out.println(parsedText);

期望输出：

hello my friend ECHO <br> how are you ? <br> <br> On Mon, Feb 14, 2011 at 10:45 AM, My Friend &`lt;notifications@mydomain.com&gt; wrote:`

你好，我的朋友ECHO
你好吗<2011年2月14日星期一上午10:45，我的朋友<；notifications@mydomain.com写道：`

我会的

用换行符或其他特殊字符替换所有
删除所有标签
将特殊字符替换为

用换行符或其他特殊字符替换所有
删除所有标签
将特殊字符替换为

final String html =
    "<div dir=\"ltr\">hello my friend<span>ECHO</span><br>how are you ?" +
    "<br><br><div class=\"gmail_quote\">On Mon, Feb 14, 2011 at 10:45 AM," +
    " My Friend <span dir=\"ltr\">&lt;<a href=\"mailto:notifications@mydo" +
    "main.com\">notifications@mydomain.com</a>&gt;</span> wrote:<br><bloc" +
    "kquote class=\"gmail_quote\" style=\"margin: 0pt 0pt 0pt 0.8ex; bord" +
    "er-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;\"> ";
final Pattern tagPattern = Pattern.compile("<([^\\s>/]+).*?>");
final Matcher matcher = tagPattern.matcher(html);
final StringBuffer sb = new StringBuffer(html.length());
while(matcher.find()){
    matcher
        .appendReplacement(sb, matcher.group(1).equalsIgnoreCase("br")
            ? matcher.group()
            : " ");
}
matcher.appendTail(sb);

final String parsedText = sb.toString();
System.out.println(parsedText);

最终字符串html=
“你好，我的朋友回声
你好吗？”+
“
2011年2月14日星期一上午10:45，
”+
“我的朋友写道：
”；
最终模式tagPattern=Pattern.compile（“/]+）.*？>”；
final Matcher Matcher=tagPattern.Matcher（html）；
final StringBuffer sb=新的StringBuffer（html.length（））；
while（matcher.find（））{
匹配器
.附录替换（sb，匹配器组（1）.等信号情况（“br”）
？matcher.group（）
: " ");
}
（某人）；
最后一个字符串parsedText=sb.toString（）；
System.out.println（解析文本）；

输出：

hello my friendECHO<br>how are you ?<br><br>On Mon, Feb 14, 2011 at 10:45 AM,
My Friend &lt;notifications@mydomain.com&gt; wrote:<br>

你好，我的朋友回声
你好吗？
2011年2月14日星期一上午10:45，
我的朋友notifications@mydomain.com写道：

final String html =
    "<div dir=\"ltr\">hello my friend<span>ECHO</span><br>how are you ?" +
    "<br><br><div class=\"gmail_quote\">On Mon, Feb 14, 2011 at 10:45 AM," +
    " My Friend <span dir=\"ltr\">&lt;<a href=\"mailto:notifications@mydo" +
    "main.com\">notifications@mydomain.com</a>&gt;</span> wrote:<br><bloc" +
    "kquote class=\"gmail_quote\" style=\"margin: 0pt 0pt 0pt 0.8ex; bord" +
    "er-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;\"> ";
final Pattern tagPattern = Pattern.compile("<([^\\s>/]+).*?>");
final Matcher matcher = tagPattern.matcher(html);
final StringBuffer sb = new StringBuffer(html.length());
while(matcher.find()){
    matcher
        .appendReplacement(sb, matcher.group(1).equalsIgnoreCase("br")
            ? matcher.group()
            : " ");
}
matcher.appendTail(sb);

final String parsedText = sb.toString();
System.out.println(parsedText);

最终字符串html=
“你好，我的朋友回声
你好吗？”+
“
2011年2月14日星期一上午10:45，
”+
“我的朋友写道：
”；
最终模式tagPattern=Pattern.compile（“/]+）.*？>”；
final Matcher Matcher=tagPattern.Matcher（html）；
final StringBuffer sb=新的StringBuffer（html.length（））；
while（matcher.find（））{
匹配器
.附录替换（sb，匹配器组（1）.等信号情况（“br”）
？matcher.group（）
: " ");
}
（某人）；
最后一个字符串parsedText=sb.toString（）；
System.out.println（解析文本）；

输出：

hello my friendECHO<br>how are you ?<br><br>On Mon, Feb 14, 2011 at 10:45 AM,
My Friend &lt;notifications@mydomain.com&gt; wrote:<br>

你好，我的朋友回声
你好吗？
2011年2月14日星期一上午10:45，
我的朋友notifications@mydomain.com写道：