我想将表数据转换为CSV文件。不幸的是,我遇到了一个障碍,下面的代码简单地重复从所有后续TR中的第一个TR开始的TD。Python beautifulsoup迭代表
import urllib.request
from bs4 import BeautifulSoup
f = open('out.txt','w')
url = "http://www.international.gc.ca/about-a_propos/atip-aiprp/reports-rapports/2012/02-atip_aiprp.aspx"
page = urllib.request.urlopen(url)
soup = BeautifulSoup(page)
soup.unicode
table1 = soup.find("table", border=1)
table2 = soup.find('tbody')
table3 = soup.find_all('tr')
for td in table3:
rn = soup.find_all("td")[0].get_text()
sr = soup.find_all("td")[1].get_text()
d = soup.find_all("td")[2].get_text()
n = soup.find_all("td")[3].get_text()
print(rn + "," + sr + "," + d + ",", file=f)
这是我的第一个Python脚本,所以任何帮助将不胜感激!我已经看过其他问题的答案,但无法弄清楚我在这里做错了什么。