我解析表使用此模式代码:分析表的列与BeautifulSoup
soup = BeautifulSoup(open("out.html"), 'html.parser')
tab = soup.findAll('table')[3]
rows = tab.find_all('tr')
for sing_row in rows:
col = sing_row.find_all('td')[1]
print col
打印的结果是:
<td class="col-md-3">5.67.43.158<br/><span style="font-size: 0.9em; color: #eee;"></span></td>
<td class="col-md-3">32.54.44.155<br/><span style="font-size: 0.9em; color: #eee;">ns2.asdf.it</span></td>
<td class="col-md-3">53.64.21.154<br/><span style="font-size: 0.9em; color: #eee;">server1.adb.it</span></td>
<td class="col-md-3">23.62.53.22<br/><span style="font-size: 0.9em; color: #eee;">server1.xcvf.it</span></td>
我的目标是从表中只获取IP地址没有跨度内的域的列。我该如何继续?
尝试'sing_row.find_all( 'TD')[1] .contents' –
马库斯如果提供的答案符合您的要求,那么您应该将其标记为“已接受”。 –
仅供参考:https://stackoverflow.com/help/someone-answers –