使用php和pdf解析器库搜索大文件夹中的pdf文件内容_Php_File_Pdf_Search_Pdfparser

使用php和pdf解析器库搜索大文件夹中的pdf文件内容

php file pdf search

使用php和pdf解析器库搜索大文件夹中的pdf文件内容,php,file,pdf,search,pdfparser,Php,File,Pdf,Search,Pdfparser,我正在使用pdf解析器在包含大量pdf文件的文件夹中搜索pdf文件内容。该代码仅适用于最多3个小文件： $keyword = "Calibri"; //the keyword is dynamic $dir = new DirectoryIterator('C:\wamp\www\pdfdemos\cv'); $parser = new \Smalot\PdfParser\Parser(); foreach ($dir as $file) { if($file->isFile

我正在使用pdf解析器在包含大量pdf文件的文件夹中搜索pdf文件内容。该代码仅适用于最多3个小文件：

$keyword = "Calibri";  //the keyword is dynamic

$dir = new DirectoryIterator('C:\wamp\www\pdfdemos\cv');
$parser = new \Smalot\PdfParser\Parser();
foreach ($dir as $file) {
    if($file->isFile() && $file->getExtension() =='pdf'){
        if ($file->getFilename() =="." || $file->getFilename() =="..") {
            continue;
        }else{
            echo "File name: ".$file->getFilename()."<br />";
            $pdf= $parser->parseFile('C:\wamp\www\pdfdemos/cv/'.$file->getFilename());
            $pages  = $pdf->getPages();
            $text = $pdf->getText();

            if(stripos(strtolower($text), $keyword)) {
                         echo "Keyword Matches";
                         echo "<br/><hr />";
                      }else{
                           echo "Keyword Not Matches";
                           unset($text);
                     }
        }
    }                                               

}

$keyword=“Calibri”//关键字是动态的
$dir=newdirectoryIterator（'C:\wamp\www\pdfdemos\cv'）；
$parser=new\Smalot\PdfParser\parser（）；
foreach（$dir作为$file）{
如果（$file->isFile（）&&&$file->getExtension（）=='pdf'）{
如果（$file->getFilename（）==“||$file->getFilename（）==”））{
继续；
}否则{
echo“文件名：”.$File->getFilename（）.“
”；
$pdf=$parser->parseFile（'C:\wamp\www\pdfdemos/cv/'。$file->getFilename（））；
$pages=$pdf->getPages（）；
$text=$pdf->getText（）；
if（stripos（strtolower（$text），$keyword））{
回显“关键字匹配”；
回声“
”；
}否则{
echo“关键字不匹配”；
取消设置（$文本）；
}
}
}                                               
}

该代码最多使用三个文件，否则我会收到消息“无法访问此站点”。我在本地工作，使用wamp服务器 -我正在使用pdf解析器库：

请帮助

请提供任何帮助？？较大的文件有什么错误？内存不足？然后简单地使用ini_set（'memory_limit'，'512M'）；（或者你需要多少…）我尝试了这个，但我也得到了消息“消息无法到达”。你需要从日志中得到确切的错误。我之前检查过它，但我不知道它的意思：PHP堆栈跟踪：[19-Jul-2018 09:19:09 UTC]PHP 1。{main}（）…\test2.php:0