Php GOUTE返回错误的url进行多个url刮取_Php_Goutte_Domcrawler - Fatal编程技术网

Php GOUTE返回错误的url进行多个url刮取

php

Php GOUTE返回错误的url进行多个url刮取,php,goutte,domcrawler,Php,Goutte,Domcrawler,我正在使用。在while循环中单击分页链接时，我总是得到错误的url 对象上的selectLink返回第一个while循环的正确url。看起来第二个循环为选择链接返回了错误的值这是代码 public function __construct(Goutte\Client $client){ $this->client = $client; } public function parse(){ $url = "https://www.nextag.com/Arts-E

我正在使用。在while循环中单击分页链接时，我总是得到错误的url

对象上的selectLink返回第一个while循环的正确url。看起来第二个循环为选择链接返回了错误的值
这是代码

public function __construct(Goutte\Client $client){ $this->client = $client; } public function parse(){ $url = "https://www.nextag.com/Arts-Entertainment--zz2702147z0z1zB6c4z5---html"; // crawl through first page $crawler = $this->client->request('GET', $url); // first page pagination links $links = $this->paginationCrawler($crawler); $linkBatch = array(); // get all pagination links and check if the next 10 links are available list($linkBatch[], $_nextPage) = $this->getPaginationLinks($links); // if $_nextPage == '11+/21+/etc' then crawl through all links while($_nextPage != 'false'){ $link = $links->selectLink($_nextPage)->link(); $crawler = $this->client->click($link); $links = $this->paginationCrawler($crawler); list($linkBatch[], $_nextPage) = $this->getPaginationLinks($links); } dd($linkBatch); } public function paginationCrawler($crawler){ return $crawler->filter('#pagination'); } public function getPaginationLinks($links){ $allLinks = $links->filter('#numbers a'); $linkNodes = $allLinks->each(function(Crawler $a) { return $a->attr('href'); }); $lastPage = trim($links->filter('#numbers :last-child')->text()); if (strpos($lastPage,'+') === false) { $lastPage = 'false'; } return array($linkNodes, $lastPage); }
以下是输出：

已解决。但是有一个工作机会。现在我发送的不是文本11+，而是url本身的链接对象。还是不明白出了什么问题，解决了。但是有一个工作机会。现在我发送的不是文本11+，而是url本身的链接对象。我还是不明白出了什么问题。

[tree]相关文章推荐

Tree 将输入存储到树中 tree

Tree 如何插入到完整的B+；树？ tree

Tree sparql来自主题的完整树 tree rdf sparql

Tree StofDoctrineExtension树实体：获取细枝中父级的id tree twig

Tree 关于二叉树旋转 tree pascal

Tree 在SML中查找2-3树中的节点数 tree sml

Tree 什么是三节点重组AVL树？ tree

Tree 计算从根到叶的最短路径 tree

Tree n元树OCaml的深度 tree functional-programming ocaml

Tree 关于求树的最大深度的问题 tree

Tree n个未标记节点的BST数量 tree

Tree 系统发育树-完整的ITS v。部分ITS tree

随机文章推荐

Twig是否允许在条件下分配任务？如何分配？ twig

Twig 根据分隔符截断一根柱子 twig

Twig 谷歌双引号制造问题 twig

Twig 细枝连接，显示0值 twig

Twig 细枝误差 twig

Twig 实体扩展（产品）的图像和价格不会显示在自己的店面模块中，但会显示其他信息 twig

[php]相关推荐

在php中过滤数组，同时具有值和键相关条件
Php

Php Prestashop菜单无子类别
Php Prestashop

Php 将属性类型转换为对象
Php

如何在Windows操作系统上安装gearman php扩展？
Php Windows Operating System

Php 在wordpress中使用ajax获取自定义字段值
Php Jquery Ajax Wordpress

Php 如何安装Phalcon开发工具
Php

Php 用于操作多个div的JQuery函数
Php Jquery

强制PHP json_encode（）将索引编码为字符串
Php Json

PHP循环数组
Php Xml Arrays

PHP变量的永久存储？
Php Variables

Php 有效替代substr？
Php String Optimization

将Div类添加到PHP列表
Php Css

创建mysql表时发生php错误
Php Mysql

Php 这个MySQL查询有什么问题。它给出了无效的参数编号错误
Php Mysql

可以在PHP中检查自定义ini文件语法吗？
Php

Php 在段落末尾添加句号
Php

正确的头php mysql blob显示图像
Php Ios Image

Php 谷歌日历手表过期时间超过一个月？
Php Google Api Google Calendar Api

Php 将方法设置为未设置变量的默认值
Php

Php 在Symfony2（条令）和MySQL中启用微秒
Php Mysql Symfony Datetime Doctrine Orm

Php 寻找装载机设计模式
Php Design Patterns Logic

Php 重新为对象编制索引，使关键帧是连续的
Php Laravel Object Indexing Collections

Php 如何从返回数组并作为参数传递给控制器函数的模型中获取id
Php

Php 带转义字符的Symfony控制台输出
Php Symfony

Php 在字符串中匹配多次
Php Regex String

仅使用PHP strottime（）的未来日期
Php

Php 为什么在数据库中保存数据时使用NULL
Php Sql Server 2008

Php 确定WooCommerce中购物车百分比的最大优惠券折扣
Php Woocommerce

Php 调用undefined方法illumb\Database\Query\Builder:：isEmpty加载laravel
Php Laravel

Php 更改默认用户表中laravel 5.6中的电子邮件列名
Php

Tags

Lucene Arm Jenkins Log4j Charts Rspec Variables Jasmine Asp.net Mvc Firefox Web Crawler Ionic Framework Coq Algorithm C++11 Visual Studio 2012 Silverlight Rabbitmq Cassandra Url Rewriting Timer Sbt Ios6 Actionscript 3 Shiny Api Selenium Webdriver Scikit Learn Excel Formula X86 3d Java 8 Postgresql Proxy Puppet Xcode Rss Google Chrome Extension Cucumber Teamcity Woocommerce Signalr Compression Deep Learning Networking Scripting Command Line Entity Framework Vaadin Zsh Google Cloud Dataflow Antlr Office365 Maven Processing Google Visualization Drupal Azure Asp.net Mvc 3 Debian Web Services Powershell Events Qt Azure Functions Visual Studio 2017 Dart Flask Node.js Highcharts Google App Maker Sphinx Automation Wicket Angular Material Dictionary .net System Verilog Matlab Scrapy Http Computer Science Html Perforce Firefox Addon Nuget Apache Kafka Migration Perl Web Applications Windows Workflow Ipython Grid Opengl Es Reference Optimization Fortran Common Lisp Ruby Text Biztalk Ibm Midrange Express Methods Xna Virtual Machine Batch File Sql Server 2012 Browser Deployment Swiftui Servlets Adobe Windows 7 Apache Zookeeper Talend Keyboard Spring Colors Rally Keras Machine Learning Akka Doctrine Orm Rxjs Smalltalk Javafx 2 Knockout.js Datatables Sublimetext2 F# Ipad Laravel 4 Asynchronous Osgi Compiler Errors Localization Discord.py Jekyll Cron Gitlab Airflow Jquery Ui Cakephp Abap Configuration Memory Management Llvm Applescript Dialogflow Es Neural Network Cookies Wcf Stata Core Data Stripe Payments Vb6 Vmware Prometheus Facebook Graph Api Visual Studio Jhipster Object Blockchain Objective C Lambda Xpages Apache Camel Pentaho Indexing Math Sed Mapping Jboss Paypal Ldap Filter Enums Elixir Openstack Twitter Bootstrap 3 Linkedin Gulp Model View Controller Combobox Reflection Performance Lotus Notes Parse Platform Process Forms Ajax Azure Service Fabric Qt4 Actionscript Computer Vision Installation Properties Amazon Web Services Logging

Copyright © 2024. All Rights Reserved by - Fatal编程技术网