2016-03-04 64 views
2

添加scrapy中间件我有一个Python脚本设置我scrapy履带:从当前脚本

import sys 
import csv 
import scrapy 
from scrapy.http import FormRequest 
from scrapy.crawler import CrawlerProcess 


class MyItem(scrapy.Item): 
    test = scrapy.Field() 


class Spider(scrapy.Spider): 
    start_urls = [ 
     "blah.com", 
    ] 

    def parse(self, response): 
     blahblah = MyItem() 
     # Some Code 
     yield blahblah 


class crawler: 
    def start(self): 
     process = CrawlerProcess({ 
      'USER_AGENT': 'Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)', 
      'LOG_LEVEL': 'INFO', 
      'FEED_FORMAT': 'csv', 
      'FEED_URI': 'Output.csv' 
     }) 
     process.crawl(Spider) 
     process.start() 

app = crawler() 
app.start() 

,这是可以正常使用。 现在如何添加scrapy中间件如 process_spider_exception(response, exception, spider) 在此脚本,并通过添加到CrawlerProcess设置使用它呢?

回答