2016-02-12 59 views

回答

4

该页面的数据似乎已经用JavaScript加载。如果您检查页面(如萤火网络选项卡上),你会看到,一旦基地页面加载的产品正在由JavaScript,它发送POST请求http://www.walgreens.com/svc/products/search与内容加载:

{"p":"1", # seems to be page number 
"s":"15", # page size 
"sort":"relevance", 
"view":"allView", 
"geoTargetEnabled":false, 
"q":"bounty paper towel", # search query 
"requestType":"search", 
"deviceType":"desktop"} 

你可以使用scrapy发送此请求为:

yield Request('http://www.walgreens.com/svc/products/search', 
       method='POST', 
       body=<the json from above>) 

而且您应该收到一个完整的产品数据的json对象。

实际上,你可以查看,甚至通过该链接在浏览器的响应: http://www.walgreens.com/svc/products/search?p=1&s=15&sort=relevance&view=allView&geoTargetEnabled=false&q=bounty%20paper%20towel&requestType=search&deviceType=desktop