Php 从div中存在的标记内部提取特定内容(刮取)
我需要提取以下信息:Php 从div中存在的标记内部提取特定内容(刮取),php,html,screen-scraping,Php,Html,Screen Scraping,我需要提取以下信息: 标题,在这种情况下,这将是“为人类感到羞耻” 图像URL: 内容URL: 我将如何在PHP中实现这一点。我写过这样的东西 <div class="post_list_medium_widget"> <div> <div class="feature-two-column medium-two-columns appear_animation left_post_align animate_css_stlye anim
- 标题,在这种情况下,这将是“为人类感到羞耻”
- 图像URL:
- 内容URL:
<div class="post_list_medium_widget">
<div>
<div class="feature-two-column medium-two-columns appear_animation left_post_align animate_css_stlye animate_start">
<div id="post-22346"
class="post-22346 post type-post status-publish format-standard has-post-thumbnail hentry category-national">
<div class="image_post feature-item">
<a href="http://www.news.in" class="feature-link" title="Ashamed to be human">
<img width="400" height="260" src="http://www.image.in"
class="attachment-medium-feature"
onload="pagespeed.CriticalImages.checkImageForCriticality(this);">
</a>
</div>
</div>
</div>
</div>
</div>
loadHTML($html);
foreach($doc->getElementsByTagName('div')作为$node){
$xpath=新的DOMXpath($doc);
$xpath->query('//div[contains(@class,“post\u list\u medium\u widget”)));
echo var_dump($node);
}
这段代码有什么好处?你想从中解析数据的数据是重复的还是url中唯一的一段代码?这只是一段数据@VermaJr@Jab那么,你想从这段代码中解析数据吗?我想问的是,如果$html只包含这段代码,或者以重复的方式包含这段代码?是的。我想知道如何从div中提取数据
<?php
$url = 'http://www.url.in/';
$html = file_get_contents( $url);
libxml_use_internal_errors( true);
$doc = new DOMDocument;
$doc->loadHTML($html);
foreach ($doc->getElementsByTagName('div') as $node) {
$xpath = new DOMXpath($doc);
$xpath->query('//div[contains(@class, "post_list_medium_widget")]');
echo var_dump($node);
}