Warning: file_get_contents(/data/phpspider/zhask/data//catemap/3/html/71.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Php 从div中存在的标记内部提取特定内容(刮取)_Php_Html_Screen Scraping - Fatal编程技术网

Php 从div中存在的标记内部提取特定内容(刮取)

Php 从div中存在的标记内部提取特定内容(刮取),php,html,screen-scraping,Php,Html,Screen Scraping,我需要提取以下信息: 标题,在这种情况下,这将是“为人类感到羞耻” 图像URL: 内容URL: 我将如何在PHP中实现这一点。我写过这样的东西 <div class="post_list_medium_widget"> <div> <div class="feature-two-column medium-two-columns appear_animation left_post_align animate_css_stlye anim

我需要提取以下信息:

  • 标题,在这种情况下,这将是“为人类感到羞耻”
  • 图像URL:
  • 内容URL:
我将如何在PHP中实现这一点。我写过这样的东西

<div class="post_list_medium_widget">
    <div>
        <div class="feature-two-column medium-two-columns appear_animation left_post_align animate_css_stlye animate_start">
            <div id="post-22346"
                 class="post-22346 post type-post status-publish format-standard has-post-thumbnail hentry category-national">

                <div class="image_post feature-item">
                    <a href="http://www.news.in" class="feature-link" title="Ashamed to be human">
                        <img width="400" height="260" src="http://www.image.in"
                             class="attachment-medium-feature"
                             onload="pagespeed.CriticalImages.checkImageForCriticality(this);">
                    </a>
                </div>
            </div>
        </div>
    </div>
</div>
loadHTML($html);
foreach($doc->getElementsByTagName('div')作为$node){
$xpath=新的DOMXpath($doc);
$xpath->query('//div[contains(@class,“post\u list\u medium\u widget”)));
echo var_dump($node);
}

这段代码有什么好处?你想从中解析数据的数据是重复的还是url中唯一的一段代码?这只是一段数据@VermaJr@Jab那么,你想从这段代码中解析数据吗?我想问的是,如果$html只包含这段代码,或者以重复的方式包含这段代码?是的。我想知道如何从div中提取数据
<?php

$url = 'http://www.url.in/';
$html = file_get_contents( $url);
libxml_use_internal_errors( true);

$doc = new DOMDocument;
$doc->loadHTML($html);

foreach ($doc->getElementsByTagName('div') as $node) {
    $xpath = new DOMXpath($doc);
    $xpath->query('//div[contains(@class, "post_list_medium_widget")]');
    echo var_dump($node);
}