PHP网页解析到MYSQL数据库
我这里有个问题。 我正在解析网页,以下是我当前的代码:PHP网页解析到MYSQL数据库,php,html,mysql,Php,Html,Mysql,我这里有个问题。 我正在解析网页,以下是我当前的代码: <?php // Include the library include('simple_html_dom.php'); // Retrieve the DOM from a given URL $html = file_get_html('siteone.htm'); // //Dates echo 'Dates:<br />'; foreach($html->find('div.collectionLog t
<?php
// Include the library
include('simple_html_dom.php');
// Retrieve the DOM from a given URL
$html = file_get_html('siteone.htm');
// //Dates
echo 'Dates:<br />';
foreach($html->find('div.collectionLog td') as $e) {
$text = $e->innertext;
$string = preg_replace("/\([^)]+\)/","",$text);
echo $string . '<br>';
}
?>
以下是HTML代码:
<div class="data-container collectionLog">
<h3>Collection Log</h3>
<div id="lcLoanPerf2">
<table id="lcLoanPerfTable2" class="plain-table">
<tbody>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
</tbody>
</table>
</div>
收集日志
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
**/**/**
***********
现在我要做的是将信息插入数据库。我知道如何轻松做到这一点,但我对如何做到这一点感到困惑,因此它会在同一行中插入td时间和没有类/id的td。所以基本上我想要:
<td class="time">**/**/**</td>
<td>***********</td>
**/**/**
***********
一行,基本上是它自己的MYSQL行中的
。对不起,如果我第一次描述得不够详细,很难解释。
如果你不明白,请告诉我。只是想让这成为可能 谢谢,
Gamemann请使用类似或(with)的内容来解析网站,而不是使用正则表达式 使用Symfony DOM爬虫,请参见以下工作示例:
<?php
include 'vendor/autoload.php';
use Symfony\Component\DomCrawler\Crawler;
$crawler = new Crawler('<div class="data-container collectionLog">
<h3>Collection Log</h3>
<div id="lcLoanPerf2">
<table id="lcLoanPerfTable2" class="plain-table">
<tbody>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="odd">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
<tr class="">
<td class="time">**/**/**</td>
<td>***********</td>
</tr>
</tbody>
</table>
</div>');
$tfList = $crawler->filter('#lcLoanPerfTable2 tr td');
foreach($tfList AS $list) {
foreach($list->childNodes as $node) {
var_dump($node->wholeText);
}
}