Warning: file_get_contents(/data/phpspider/zhask/data//catemap/1/php/281.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
PHP curl。遍历搜索结果_Php_Curl_Web Scraping - Fatal编程技术网

PHP curl。遍历搜索结果

PHP curl。遍历搜索结果,php,curl,web-scraping,Php,Curl,Web Scraping,我正在开发一个网站,允许人们搜索“x”产品,并以表格格式显示结果 我计划使用php curl从另一个网站上抓取搜索数据。(被删除网站的所有者知道并允许删除,因此没有法律问题) 我已经有了一个php curl代码,可以登录到该网站,并根据用户输入进行搜索。我不知道如何通过搜索结果和输出,然后在我的网站一个接一个 PHP curl代码: $username = '********'; $password = '********'; $loginUrl = 'http://www.a-website.

我正在开发一个网站,允许人们搜索“x”产品,并以表格格式显示结果

我计划使用php curl从另一个网站上抓取搜索数据。(被删除网站的所有者知道并允许删除,因此没有法律问题)

我已经有了一个php curl代码,可以登录到该网站,并根据用户输入进行搜索。我不知道如何通过搜索结果和输出,然后在我的网站一个接一个

PHP curl代码:

$username = '********';
$password = '********';
$loginUrl = 'http://www.a-website.com/login.asp';

//init curl
$ch = curl_init();

//Set the URL to work with
curl_setopt($ch, CURLOPT_URL, $loginUrl);

// ENABLE HTTP POST
curl_setopt($ch, CURLOPT_POST, 1);

//Set the post parameters
curl_setopt($ch, CURLOPT_POSTFIELDS, 'username=' . $username . '&password=' . $password . '&submit1=' . 'Login');

//Handle cookies for the login
curl_setopt($ch, CURLOPT_COOKIEJAR, 'cookie stuff hure');

//Setting CURLOPT_RETURNTRANSFER variable to 1 will force cURL
//not to print out the results of its query.
//Instead, it will return the results as a string return value
//from curl_exec() instead of the usual true/false.
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);

//execute the request (the login)
$store = curl_exec($ch);

/*         * *****************SEARCH HERE****************** */
curl_setopt($ch, CURLOPT_URL, 'http://www.a-website.com/Index.asp');
//execute the request
$content = curl_exec($ch);


//Set the post parameters

curl_setopt($ch, CURLOPT_POSTFIELDS, 'search_txt_vs=' . '' . '&search_txt_UPC=' . '' . '&search_txt_Name=' . $searchString .
        '&search_txt_Manufacturer=' . '' . '&submit=' . 'Search');
//execute the request (the search)
$Search = curl_exec($ch);

print CJSON::encode($Search);
print $Search;

//save the data to disk
print $content;
这是来自Im报废网站的html代码(顺便说一句,它是旧的学校表格格式)


分类
|    
/    
|    
类别
1.

00169347718 诺富林70/30U/ML CRT 5X3 ML 70-30 U/ML 对 诺和诺德制药公司 $ 0.01 / $ 0.01 000000000000 5X3ML 胰岛素 2.
00169347418 诺富林N 100 UN/ML CRT 5X3 ML 100 U/ML 对 国民生产净值 $ 0 / $ 0.01 000000000000 5X3ML 胰岛素 3.
00169231721 诺富林INNO 70/30 PFS 5X3毫升 70-30 U/ML 对 诺和诺德制药公司 $ 0 / $ 0.01 000000000000 5X3ML 胰岛素 4.
00169183311 诺富林R 100 UN/ML VL 10 ML 100 U/ML 对 诺和诺德制药公司 $ 99 / $ 82.09 000169183311 10毫升 胰岛素 5.
00169183711 诺富林70/30U/ML VL 10 ML 70-30 U/ML 对 诺和诺德制药公司 $ 99 / $ 82.09 000169183711 10毫升 胰岛素 6.
00169183411 诺富林N 100 UN/ML VL 10 ML 100 U/ML 对 诺和诺德制药公司 $ 99 / $ 82.09 000000000000 10毫升 胰岛素
您可以尝试将字符串添加到DOMDocument中,并使用getElementsByTagName,然后将其写入数组或其他您可以使用的内容。更多信息请点击此处:


同样,这里也回答了类似的问题,考虑到您返回的是HTML:

您可以尝试将字符串添加到DOMDocument中,并使用getElementsByTagName,然后将它们写入数组或其他您可以使用的内容。更多信息请点击此处:


另外,考虑到您返回的是HTML,这里也回答了类似的问题:

嘿,首先感谢您的回答。但是你能更具体一点吗?也许是一个例子。我的主要问题是输出没有任何标记名。只是桌子里面的桌子里面的桌子。那么,我如何区分网站的其余部分和我想要显示的实际项目呢?首先,谢谢你的回答。但是你能更具体一点吗?也许是一个例子。我的主要问题是输出没有任何标记名。只是桌子里面的桌子里面的桌子。那么,我如何区分网站的其余部分和我想要显示的实际项目呢?
<td colspan="3" height="100%" valign="top">
    <table width="100%" border="0" cellpadding="2" cellspacing="0" bordercolor="#99CCCC" class="text">
        <tbody>
            <tr bgcolor="#9999CC">
                <td align="right" class="calendar">Sort &gt;</td>
                <td align="center"> <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=1">NDC</a>
                </td>
                <td align="left">   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=2">Brand Name</a>
                </td>
                <td align="center" colspan="2"> <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=3">Strength</a>
&nbsp;|&nbsp;   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=4">UD</a>
                </td>
                <td align="left">   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=5">Stock</a>
                </td>
                <td align="center"> <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=6">Manufacturer</a>
                </td>
                <td align="center" bgcolor="cccccc">    <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=7">AWP</a>
&nbsp;/&nbsp;   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=8">Your Price</a>
                </td>
            </tr>
            <tr bgcolor="#9999CC">
                <td align="right" class="calendar">&nbsp;</td>
                <td align="center"> <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=9">UPC</a>
                </td>
                <td align="left">   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=10">Generic Alt/Name</a>
                </td>
                <td align="center" colspan="2"> <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=11">Size</a>
&nbsp;|&nbsp;   <a href="Index.asp?search_txt_UPC=&amp;search_txt_Name=novolin&amp;search_txt_Manufacturer=&amp;orderby=12">Form</a>
                </td>
                <td align="left" colspan="3" class="selected">Category</td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center" valign="top" rowspan="2">1
                    <br>    <a href="#" onclick="return openCart(19112,0.01021);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169347718</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN 70/ 30U/ML CRT 5X3 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 70-30 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NOVO NORDISK PHARM</span>
                </td>
                <td align="center"><span class="smallNorm">$

    0.01&nbsp;

    &nbsp;/&nbsp;$

    0.01

    </span>
                </td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center"><span class="smallNorm">000000000000</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('50101');">HUM INSULIN NPH/REG INSULIN HM</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 5X3ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(19112,0.01021);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center" valign="top" rowspan="2">2
                    <br>    <a href="#" onclick="return openCart(19116,0.012);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169347418</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN N 100 UN/ML CRT 5X3 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 100 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NNP</span>
                </td>
                <td align="center"><span class="smallNorm">$

        0.00&nbsp;

    &nbsp;/&nbsp;$

    0.01

    </span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center"><span class="smallNorm">000000000000</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('05331');">NPH HUMAN INSULIN ISOPHANE</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 5X3ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(19116,0.012);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center" valign="top" rowspan="2">3
                    <br>    <a href="#" onclick="return openCart(45211,0.012);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169231721</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN INNO 70/30 PFS 5X3 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 70-30 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NOVO NORDISK PHARM</span>
                </td>
                <td align="center"><span class="smallNorm">$


        0.00&nbsp;

    &nbsp;/&nbsp;$

    0.01

    </span>
                </td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center"><span class="smallNorm">000000000000</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('24486');">HUM INSULIN NPH/REG INSULIN HM</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 5X3ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(45211,0.012);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center" valign="top" rowspan="2">4
                    <br>    <a href="#" onclick="return openCart(19117,82.0884);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169183311</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN R 100 UN/ML VL 10 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 100 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NOVO NORDISK PHARM</span>
                </td>
                <td align="center"><span class="smallNorm">$

    99.00&nbsp;

    &nbsp;/&nbsp;$

    82.09

    </span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center"><span class="smallNorm">000169183311</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('11642');">INSULIN REGULAR HUMAN</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 10ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(19117,82.0884);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center" valign="top" rowspan="2">5
                    <br>    <a href="#" onclick="return openCart(19110,82.0884);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169183711</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN 70/ 30U/ML VL 10 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 70-30 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NOVO NORDISK PHARM</span>
                </td>
                <td align="center"><span class="smallNorm">$

    99.00&nbsp;

    &nbsp;/&nbsp;$

    82.09

    </span>
                </td>
            </tr>
            <tr bgcolor="eeeeee">
                <td align="center"><span class="smallNorm">000169183711</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('50001');">HUM INSULIN NPH/REG INSULIN HM</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 10ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(19110,82.0884);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center" valign="top" rowspan="2">6
                    <br>    <a href="#" onclick="return openCart(19114,82.0884);"><span class="smallNorm_red">[add]</span></a>

                </td>
                <td align="center"><span class="smallNorm">00169183411</span>
                </td>
                <td align="left"><span class="smallNorm_red">NOVOLIN N 100 UN/ML VL 10 ML</span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 100 U/ML</span>
                </td>
                <td align="left"><span class="smallNorm">YES</span>
                </td>
                <td align="center"><span class="smallNorm">NOVO NORDISK PHARM</span>
                </td>
                <td align="center"><span class="smallNorm">$

    99.00&nbsp;

    &nbsp;/&nbsp;$

    82.09

    </span>
                </td>
            </tr>
            <tr bgcolor="#99CCCC">
                <td align="center"><span class="smallNorm">000000000000</span>
                </td>
                <td align="left"><span class="smallNorm"><a href="#" onclick="return openGeneric('11660');">NPH HUMAN INSULIN ISOPHANE</a></span>
                </td>
                <td align="center" colspan="2"><span class="smallNorm"> 10ML </span>
                </td>
                <td align="left" colspan="3"><span class="smallNorm">&nbsp;

    <a href="#" onclick="return openreturn(19114,82.0884);"><span class="smallNorm_red">[return]</span>
                    </a>INSULIN</span>
                </td>
            </tr>
        </tbody>
    </table>
</td>