2
添加scrapy中间件我有一个Python脚本设置我scrapy履带:从当前脚本
import sys
import csv
import scrapy
from scrapy.http import FormRequest
from scrapy.crawler import CrawlerProcess
class MyItem(scrapy.Item):
test = scrapy.Field()
class Spider(scrapy.Spider):
start_urls = [
"blah.com",
]
def parse(self, response):
blahblah = MyItem()
# Some Code
yield blahblah
class crawler:
def start(self):
process = CrawlerProcess({
'USER_AGENT': 'Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)',
'LOG_LEVEL': 'INFO',
'FEED_FORMAT': 'csv',
'FEED_URI': 'Output.csv'
})
process.crawl(Spider)
process.start()
app = crawler()
app.start()
,这是可以正常使用。 现在如何添加scrapy中间件如 process_spider_exception(response, exception, spider)
在此脚本,并通过添加到CrawlerProcess
设置使用它呢?