Php 浏览某些html标记集

Php 浏览某些html标记集,php,curl,Php,Curl,我需要一点帮助。我想做的是拉b节点中的所有内容 <P><B>Credit Weighting: </B>5<BR><BR> <B>Teaching Period(s): </B>Teaching Periods 1 and 2.<BR><BR> <B>No. of Students: </B>-.<BR><BR> <B>Pre-r

我需要一点帮助。我想做的是拉b节点中的所有内容

<P><B>Credit Weighting: </B>5<BR><BR>
<B>Teaching Period(s): </B>Teaching Periods 1 and 2.<BR><BR>
<B>No. of Students: </B>-.<BR><BR>
<B>Pre-requisite(s): </B>None<BR><BR>
<P><A HREF="#top" class="toppage">[Top of page]</A></P>

<P><B>Credit Weighting: </B>20<BR><BR>
<B>Teaching Period(s): </B>Teaching Periods 1 and 2.<BR><BR>
<B>No. of Students: </B>-.<BR><BR>
<B>Pre-requisite(s): </B>None<BR><BR>
<P><A HREF="#top" class="toppage">[Top of page]</A></P>

<P><B>Credit Weighting: </B>10<BR><BR>
<B>Teaching Period(s): </B>Teaching Periods 1 and 2.<BR><BR>
<B>No. of Students: </B>-.<BR><BR>
<B>Pre-requisite(s): </B>None<BR><BR>
<P><A HREF="#top" class="toppage">[Top of page]</A></P>

信用权重:5

教学时段:教学时段1和2。

学生人数:-.

先决条件:无

信用权重:20

教学时段:教学时段1和2。

学生人数:-.

先决条件:无

信用权重:10

教学时段:教学时段1和2。

学生人数:-.

先决条件:无

我能够从第一组数据中提取数据

    // GETTING ALL THE B NODE STUFFS AND PRINTING IT'S CONTENTS
    $result = array();
    foreach($document->getElementsByTagName('b') as $node){
    $result[preg_replace('/:\s+$/','',$node->textContent)] = trim($node->nextSibling->textContent);
    } 
    var_dump($result);
    echo '<br /><br />'; 
//获取所有B节点内容并打印其内容
$result=array();
foreach($document->getElementsByTagName('b')作为$node){
$result[preg_replace('/:\s+$/,'$node->textContent)]=trim($node->nextSibling->textContent);
} 
var_dump($结果);
回音“

”;
现在我要做的是循环使用三组html代码来获取所有的b节点和内容。我该怎么做呢

preg_match_all("/\<B\>(.*)\<\/B>([^\<]+)/", $text, $regs);
你的意思是这样的吗

$result = array();
$id= -1;
foreach($document->getElementsByTagName('b') as $node){
    $field= preg_replace('/:\s+$/','',$node->textContent);
    if ( $field == "Credit Weighting" ) $id++;
    $result[$id][$field]= trim($node->nextSibling->textContent);
}.
var_dump($result);
这让你:

array(3) {
  [0] =>
  array(4) {
    'Credit Weighting' =>
    string(1) "5"
    'Teaching Period(s)' =>
    string(25) "Teaching Periods 1 and 2."
    'No. of Students' =>
    string(2) "-."
    'Pre-requisite(s)' =>
    string(4) "None"
  }
  [1] =>
  array(4) {
    'Credit Weighting' =>
    string(2) "20"
    'Teaching Period(s)' =>
    string(25) "Teaching Periods 1 and 2."
    'No. of Students' =>
    string(2) "-."
    'Pre-requisite(s)' =>
    string(4) "None"
  }
  [2] =>
  array(4) {
    'Credit Weighting' =>
    string(2) "10"
    'Teaching Period(s)' =>
    string(25) "Teaching Periods 1 and 2."
    'No. of Students' =>
    string(2) "-."
    'Pre-requisite(s)' =>
    string(4) "None"
  }
}

很抱歉,在上面的代码中,我应该在哪里安装这个正则表达式?上面所有的内容都在html文件的主体中。我不明白,这就是代码的作用,对吗?或者你是说一个平面数组?