1
我用requests
和bs4
。在圈子里,我发现只有当我得到每一个“汤”时,最后的“汤”才是正确的。另一个“汤”与HTML源不同。请帮帮我。谢谢。python网站爬虫(多个网站)
for eachLine in files:
addr = 'http://neuromorpho.org/neuron_info.jsp?neuron_name='+eachLine
print addr
st = []
st1 = []
r2 = requests.get(addr)
soup2 = bs4.BeautifulSoup(r2.text,"lxml")
print soup2