Warning: file_get_contents(/data/phpspider/zhask/data//catemap/9/java/392.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
如何创建正则表达式来检索Java中的HTML代码?_Java_Regex - Fatal编程技术网

如何创建正则表达式来检索Java中的HTML代码?

如何创建正则表达式来检索Java中的HTML代码?,java,regex,Java,Regex,一段html代码: <a class="context_link" href="/thuc-don/41-Thit-vit-ram-sa-gung.html"> <img src="http://monngonmoingay.com/uploads/monan/201205170430310000000_thit" + "-vit-ram-sa-gung-48aq570.png" alt="Thịt vịt ram sả gừng

一段html代码:

<a class="context_link" href="/thuc-don/41-Thit-vit-ram-sa-gung.html">
        <img src="http://monngonmoingay.com/uploads/monan/201205170430310000000_thit" +
                "-vit-ram-sa-gung-48aq570.png" alt="Thịt vịt ram sả gừng " />

所以我使用正则表达式从代码中获取链接:

String pat = "<a\\s+class=\"context_link\"\\s+href=\"(.+)\"";       
   Pattern pattern = Pattern.compile(pat,Pattern.DOTALL | Pattern.UNIX_LINES);
   Matcher math = pattern.matcher(source);
   while(math.find()){Log.i("Value",math.group(1));}

String pat=“如果您试图提取HREF,则应使用
Jsoup

现在工作示例:

import java.io.IOException;

import org.jsoup.Jsoup;

public class Test {

    public static void main(String args[]) throws IOException {

        String source = "<a class=\"context_link\" href=\"/thuc-don/41-Thit-vit-ram-sa-gung.html\">        <img src=\"http://monngonmoingay.com/uploads/monan/201205170430310000000_thit\" +                \"-vit-ram-sa-gung-48aq570.png\" alt=\"Th?t v?t ram s? g?ng \" />";
        String link = Jsoup.parse(source).select("a").first().attr("href");
        System.out.println("Your link  :" + link);

    }

}
import java.io.IOException;
导入org.jsoup.jsoup;
公开课考试{
公共静态void main(字符串args[])引发IOException{
字符串源=”;
String link=Jsoup.parse(source)。选择(“a”).first().attr(“href”);
System.out.println(“您的链接:+link”);
}
}

您是否正在尝试提取HREF?如果输入是这样的,您会怎么做:
解析HTML在.Net中使用Regex非常简单,使用Java很糟糕。