Python 3.x 尝试删除电子邮件地址
我当时正试图清理这个网站 [www.united-church.ca/search/locator/all?keyw=&mission\u units\u ucc\u ministry\u type\u advanced=10&locll=][1] 我确实擦掉了,但我无法擦掉电子邮件地址 你能帮我刮一下吗? 我用的是刮痧Python 3.x 尝试删除电子邮件地址,python-3.x,web-scraping,scrapy,Python 3.x,Web Scraping,Scrapy,我当时正试图清理这个网站 [www.united-church.ca/search/locator/all?keyw=&mission\u units\u ucc\u ministry\u type\u advanced=10&locll=][1] 我确实擦掉了,但我无法擦掉电子邮件地址 你能帮我刮一下吗? 我用的是刮痧 # -*- coding: utf-8 -*- import scrapy from ..items import ChurchItem class ChurchSpide
# -*- coding: utf-8 -*-
import scrapy
from ..items import ChurchItem
class ChurchSpiderSpider(scrapy.Spider):
name = 'church_spider'
page_number = 1
start_urls = ['https://www.united-church.ca/search/locator/all?keyw=&mission_units_ucc_ministry_type_advanced=10&locll=']
def parse(self, response):
items = ChurchItem()
container = response.css(".icon-ministry")
for t in container:
church_name = t.css(".field-name-locator-ministry-title a::text").extract()
church_phone = t.css(".field-name-field-phone::text").extract()
church_address = t.css(".thoroughfare::text").extract()
church_email = t.css(".field-name-field-mu-email span::text").extract()
items["church_name"] = church_name
items["church_phone"] = church_phone
items["church_address"] = church_address
items["church_email"] = church_email
yield items
# next_page = 'https://www.united-church.ca/search/locator/all?keyw=&mission_units_ucc_ministry_type_advanced=10&locll=&page=' + str(ChurchSpiderSpider.page_number)
# if ChurchSpiderSpider.page_number <= 110:
# ChurchSpiderSpider.page_number += 1
# yield response.follow(next_page, callback=self.parse)
你能帮我用@替换[at]并将其组合成一个字符串吗?加入列表元素并替换
email = ''.join(church_email).replace(" [at] ","@")
不要发布重复的问题。编辑原始问题以包含所有详细信息。
email = ''.join(church_email).replace(" [at] ","@")