Java 设置从数据库获取数据的搜索表单,然后使用jsoup解析html
在我提交表单后,它会给我一个HTML页面:“找不到文件或目录”,“您要查找的源可能已被删除,名称已更改,或者暂时不可用。”因为响应数据显示了不同的url如何处理Java 设置从数据库获取数据的搜索表单,然后使用jsoup解析html,java,spring-boot,web-scraping,web-crawler,jsoup,Java,Spring Boot,Web Scraping,Web Crawler,Jsoup,在我提交表单后,它会给我一个HTML页面:“找不到文件或目录”,“您要查找的源可能已被删除,名称已更改,或者暂时不可用。”因为响应数据显示了不同的url如何处理 response = Jsoup.connect(internalConstant.getURL()) .userAgent(USER_AGENT) .sslSocketFactory(utilService.socketFactory()) .cookies(coky) .method(Method.GET) .e
response = Jsoup.connect(internalConstant.getURL())
.userAgent(USER_AGENT)
.sslSocketFactory(utilService.socketFactory())
.cookies(coky)
.method(Method.GET)
.execute();
//search file
String yestarday = utilService.getYesterdayDateString();
logger.info("yestarday date: " + yestarday);
FormElement searchForm = (FormElement) response.parse().select("form[class=form-horizontal]").first();
checkElement("search form", searchForm);
Element searchField = searchForm.select("input[name=DateFrom]").first();
checkElement("date from: ", searchField);
searchField.val(yestarday);
Element searchField1 = searchForm.select("input[name=DateTo]").first();
checkElement("Date to: ", searchField1);
searchField1.val(yestarday);
response = searchForm.submit()
.cookies(coky)
.userAgent(USER_AGENT)
.method(Method.POST)
.sslSocketFactory(utilService.socketFactory())
.timeout(5000)
.ignoreHttpErrors(true)
.followRedirects(true)
.execute();
logger.info("target html: " + response.parse());