我对python和Python使用了新的3.6.2,并试图使用特定关键字从第2页中抓取数据。到目前为止,我可以将数据导入Python IDLE窗口,但是我很难将数据导出到CSV.I已尝试使用BeautifulSoup 4和pandas,但无法导出。这是迄今为止我所做的。任何帮助将非常感激。从前两页抓取网页内容并使用python和BS4将抓取的数据导出到csv
import csv
import requests
from bs4 import BeautifulSoup
import pandas as pd
url = "http://www.amazon.in/s/ref=nb_sb_noss?url=search-
alias%3Dautomotive&field-
keywords=helmets+for+men&rh=n%3A4772060031%2Ck%3Ahelmets+for+men&ajr=0"
request = requests.get(url)
soup = BeautifulSoup(request.content, "lxml")
#filename = auto.csv
#with open(str(auto.csv,"r+","\n")) as csvfile:
#headers = "Count , Asin \n"
#fo.writer(headers)
for url in soup.find_all('li'):
Nand = url.get('data-asin')
#print(Nand)
Result = url.get('id')
#print(Result)
#d=(str(Nand), str(Result))
df=pd.Index(url.get_attribute('url'))
#with open("auto.txt", "w",newline='') as dumpfile:
#dumpfilewriter = csv.writer(dumpfile)
#for Nand in soup:
#value = Nand.__gt__
#if value:
#dumpfilewriter.writerows([value])
df.to_csv(dumpfile)
dumpfile.close()
csvfile.csv.writer("auto.csv," , ',' ,'|' , "\n")
有人可以帮我这个。试图将结果导出到csv。我需要csv中的“数据asin”和“id”。 – Sunny
我忘了使用python 3.6.2版本添加我使用的版本 – Sunny
修复你的缩进,代码无法执行 –