2017-08-01 114 views
2

很基本的问题:如何在美丽的汤中返回标签的元素?

如何would.I返回001,002 ... 101来自:

<us-applicant sequence="001" app-type="applicant" designation="us-only"> 
... 
<us-applicant sequence="101" app-type="applicant" designation="us-only"> 

用美丽的汤吗?我知道返回两个标记之间的东西基本外观,但我不知道这是什么元素实际上是

回答

1

你可以做这样的事情,使用列表理解,你使用['sequence']属性:

from bs4 import BeautifulSoup 

data = ''' 
<us-applicant sequence="001" app-type="applicant" designation="us-only"> 
<us-applicant sequence="100" app-type="applicant" designation="us-only"> 
<us-applicant sequence="101" app-type="applicant" designation="us-only"> 
''' 

soup = BeautifulSoup(data, 'html.parser') 
>>> [tag['sequence'] for tag in soup.findAll('us-applicant')] 
['001', '100', '101']