Web scraping 我如何从木偶演员夏普的h1标签中提取文本?
我的问题是: 我正试图将文本转换为con Puppeter Sharp,我想在控制台中显示该页面中的de h1标记文本。文本为“示例域” 我有以下代码:Web scraping 我如何从木偶演员夏普的h1标签中提取文本?,web-scraping,puppeteer,sharp,Web Scraping,Puppeteer,Sharp,我的问题是: 我正试图将文本转换为con Puppeter Sharp,我想在控制台中显示该页面中的de h1标记文本。文本为“示例域” 我有以下代码: await new BrowserFetcher().DownloadAsync(BrowserFetcher.DefaultRevision); var browser = await Puppeteer.LaunchAsync(new LaunchOptions { Headles
await new BrowserFetcher().DownloadAsync(BrowserFetcher.DefaultRevision);
var browser = await Puppeteer.LaunchAsync(new LaunchOptions
{
Headless = true
});
using (var page = await browser.NewPageAsync())
{
await page.GoToAsync("http://example.com");
await page.WaitForSelectorAsync("h1");
var texto = await page.QuerySelectorAsync("h1");
Console.WriteLine(texto.ToString());
}
await browser.CloseAsync();
但控制台显示“JSHandle@node". 如何从中获取h1的值JSHandle@node?
非常感谢。你就快到了
您需要一个EvaluateFunctionAsync()在QuerySelectorAsync()之后返回H1的内部文本
var texto=wait page.QuerySelectorAsync(“h1”).EvaluateFunctionAsync(“\u=>\ uu.innerText”);
var texto = await page.QuerySelectorAsync("h1").EvaluateFunctionAsync<string>("_ => _.innerText");