PHP如何获取所有子目录中的所有文件（仅html文件）并为每个html页面编制索引_Php_Html_Directory_Indexing

PHP如何获取所有子目录中的所有文件（仅html文件）并为每个html页面编制索引

php html directory indexing

PHP如何获取所有子目录中的所有文件（仅html文件）并为每个html页面编制索引,php,html,directory,indexing,Php,Html,Directory,Indexing,对于家庭作业，我必须获取当前目录和所有子目录中的所有.htm和.html文件，并通过分别计算文件中出现的所有单词来为它们编制索引以下是我在目录中找到html文件后如何计算文件数： $file = '.html'; $index = indexer($file); echo '<pre>'.print_r($index,true).'</pre>'; function indexer($file) { $index = array(); $find =

对于家庭作业，我必须获取当前目录和所有子目录中的所有.htm和.html文件，并通过分别计算文件中出现的所有单词来为它们编制索引

以下是我在目录中找到html文件后如何计算文件数：

$file = '.html';
$index = indexer($file);
echo '<pre>'.print_r($index,true).'</pre>';

function indexer($file) {
    $index = array();
    $find = array('/\r/','/\n/','/\t/','!',',','.','"',';',                           ':');
    $replace = array(' ',' ',' ',' ',' ',' ',' ',' ',' ');
    $string = file_get_contents($file);
    $string = strip_tags($string);
    $string = strtolower($string);
    $string = str_replace($find, $replace, $string);
    $string = trim($string);
    $string = explode(' ', $string);
    natcasesort($string);
    $i = 0;
    foreach($string as $word) {
        $word = trim($word);
        $ignore = preg_match('/[^a-zA-Z]/', $word);
        if($ignore == 1) {
            $word = '';
        }
        if( (!empty($word)) && ($word != '') ) {
            if(!isset($index[$i]['word'])) {
                $index[$i]['word'] = $word;
                $index[$i]['count'] = 1;
            } elseif( $index[$i]['word'] == $word ) {
                $index[$i]['count'] += 1;
            } else {
                $i++;
                $index[$i]['word'] = $word;
                $index[$i]['count'] = 1;
            }
        }
    }
    unset($work);
    return($index);
}

$file='.html'；
$index=索引器（$file）；
回显“”。打印（$index，true）。“”；
函数索引器（$file）{
$index=array（）；
$find=array（“/\r/”、“/\n/”、“/\t/”、“！”、“、”、“、”、“；”、“：”）；
$replace=数组（“”、“”、“”、“”、“”、“”、“”、“”、“”、“”）；
$string=file\u get\u contents（$file）；
$string=带标签（$string）；
$string=strtolower（$string）；
$string=str_replace（$find，$replace，$string）；
$string=修剪（$string）；
$string=分解（“”，$string）；
natcasesort（$string）；
$i=0；
foreach（$word形式的字符串）{
$word=trim（$word）；
$ignore=preg_match（'/[^a-zA-Z]/'，$word）；
如果（$ignore==1）{
$word=''；
}
如果（（！empty（$word））&（$word！=''）{
如果（！isset（$index[$i]['word']））{
$index[$i]['word']=$word；
$index[$i]['count']=1；
}elseif（$index[$i]['word']==$word）{
$index[$i]['count']+=1；
}否则{
$i++；
$index[$i]['word']=$word；
$index[$i]['count']=1；
}
}
}
未结算（工作）；
回报（指数）；
}

我只需要先弄清楚如何找到目录中的所有htm或html文件，然后开始在每个htm/html文件上使用上述代码。任何帮助都将不胜感激，谢谢！

好吧，因为这是一个家庭作业，我不会给你代码。但我可以为你指出正确的方向。通常对于这类事情，people使用递归函数。函数调用自身

此功能应执行以下操作：

计算当前目录中所有htm和html文件的所有行数
将这些数字相加，然后将它们添加到函数外部的全局变量中（只需使用global，您可以返回每次调用的行数，并将它们相加，但这是一个棘手的问题）
对当前目录中的每个文件夹再次调用此函数（只需循环遍历它们）
回到起点后，重置全局变量并返回其值

计算当前目录中所有htm和html文件的所有行数
将这些数字相加，然后将它们添加到函数外部的全局变量中（只需使用global，您可以返回每次调用的行数，并将它们相加，但这是一个棘手的问题）
对当前目录中的每个文件夹再次调用此函数（只需循环遍历它们）
回到起点后，重置全局变量并返回其值

*.html

查找

*.html

查找

    function readDir($path) {
  $files = glob($path . '*.*');

  foreach ($files as $file) {
    if (is_dir($file)) {
      $html_files = array_merge((array) readDir($file . '/'), (array) $html_files);
    }

    if (in_array(strtolower(end(explode('.', $file))), array('html', 'htm'))) {
      $html_files[] = $file;
    }
  }

  return $html_files;
}

<?php

$dir = '/';

$iterator = new RecursiveIteratorIterator(new RecursiveDirectoryIterator($dir), RecursiveIteratorIterator::CHILD_FIRST);

foreach ( $iterator as $path )
  if ( $path->isFile() && preg_match('/^html?$/i', pathinfo($path->getFilename(), PATHINFO_EXTENSION)) )
    echo $path->getPathname() . PHP_EOL;

    function readDir($path) {
  $files = glob($path . '*.*');

  foreach ($files as $file) {
    if (is_dir($file)) {
      $html_files = array_merge((array) readDir($file . '/'), (array) $html_files);
    }

    if (in_array(strtolower(end(explode('.', $file))), array('html', 'htm'))) {
      $html_files[] = $file;
    }
  }

  return $html_files;
}

<?php

$dir = '/';

$iterator = new RecursiveIteratorIterator(new RecursiveDirectoryIterator($dir), RecursiveIteratorIterator::CHILD_FIRST);

foreach ( $iterator as $path )
  if ( $path->isFile() && preg_match('/^html?$/i', pathinfo($path->getFilename(), PATHINFO_EXTENSION)) )
    echo $path->getPathname() . PHP_EOL;

foreach

$files

foreach

$files