Scrapy 我的错误在哪里-异地请求-刮擦
我正在尝试解决异地请求错误,我的错误在哪里Scrapy 我的错误在哪里-异地请求-刮擦,scrapy,Scrapy,我正在尝试解决异地请求错误,我的错误在哪里 # -*- coding: utf-8 -*- import scrapy from scrapy.linkextractors import LinkExtractor from scrapy.spiders import CrawlSpider, Rule class BestMoviesSpider(CrawlSpider): name = 'best_movies' allowed_domains = ['imbd.com']
# -*- coding: utf-8 -*-
import scrapy
from scrapy.linkextractors import LinkExtractor
from scrapy.spiders import CrawlSpider, Rule
class BestMoviesSpider(CrawlSpider):
name = 'best_movies'
allowed_domains = ['imbd.com']
start_urls = ['https://www.imdb.com/search/title/?groups=top_250&sort=user_rating']
rules = (
Rule(LinkExtractor(restrict_xpaths='//h3[@class="lister-item-header"]/a'), callback='parse_item', follow=True),
)
def parse_item(self, response):
print(response.url)
在VSCode中,我看到:
[scrapy.spidermiddlewares.offsite] DEBUG: Filtered offsite request to 'www.imdb.com': <GET https://www.imdb.com/title/tt0111161/>
[scrapy.spidermiddleware.offsite]调试:筛选到“www.imdb.com”的异地请求:
不是链接列表。错误的域,而是:
allowed_domains = ['imbd.com']
但应该像网站域:
allowed_domains = ['imdb.com']
您有imbd,需要更改为imdb