Java 尝试使用Jsoup从Reddit中提取用户名
我正试图使用j-soup从reddit源代码中提取用户名,然后尝试对其进行DM,但我想不出我们如何只提取写有/user的链接。对不起,这太乱了Java 尝试使用Jsoup从Reddit中提取用户名,java,jsoup,Java,Jsoup,我正试图使用j-soup从reddit源代码中提取用户名,然后尝试对其进行DM,但我想不出我们如何只提取写有/user的链接。对不起,这太乱了 import org.jsoup.Jsoup; import org.jsoup.nodes.Document; import org.jsoup.nodes.Element; import org.jsoup.select.Elements; import java.io.IOException; public class Main { publ
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.select.Elements;
import java.io.IOException;
public class Main
{
public static void main(String[] args) {
Document doc;
try {
// need http protocol
doc = Jsoup.connect("https://www.reddit.com/new/).get();
// get page title
String title = doc.title();
System.out.println("title : " + title);
// get all links
Document doc = Jsoup.parse("https://www.reddit.com/new");
Elements certainLinks =
doc.select("https://www.reddit.com/user"); {
// get the value from href attribute
{
System.out.println("\nlink : " + link.attr("href"));
System.out.println("text : " + link.text());
}
catch (IOException e) {
e.printStackTrace();
}
}
}
}
像这样吗
Document doc = Jsoup.connect("https://www.reddit.com/new/").get();
Elements certainLinks = doc.select("a[href*=https://www.reddit.com/user/]");
certainLinks.forEach(l -> System.out.println(l.text()));
将打印:
_serial_chiller
dracorian
ImagesOfNetwork
...
a[href]*=https://www.reddit.com/user/]
表示具有href
属性且包含https://www.reddit.com/user/
string你的问题是什么?听起来像是垃圾邮件