2014-11-24 54 views

回答

2

你想覆盖start_requests方法和使用自定义方法与链接提取:

def start_requests(self): 
    yield Request('http://a.com', self.parse_a) 
    yield Request('http://b.com', self.parse_b) 
    yield Request('http://c.com', self.parse_data) 

def parse_a(self, response): 
    links = LinkExtractor(
     # ... extract links from http://a.com 
    ).extract_links(response) 
    return (Request(url=link.url, callback=self.parse_data) for link in links) 

def parse_b(self, response): 
    links = LinkExtractor(
     # ... extract links from http://b.com 
    ).extract_links(response) 
    return (Request(url=link.url, callback=self.parse_data) for link in links) 
+0

谢谢它帮了我很多.. – ram 2014-11-28 07:24:31

+0

@ram不客气:) – elias 2014-11-28 14:36:40

相关问题