beautifulsoup

0热度

2回答

我有一些问题网站刮美丽的汤一些数据，我想知道如果你们任何刮板专业人士可以给我一些指导。这是确切的网页，我想凑： https://coinmarketcap.com/currencies/bitcoin/historical-data/?start=20130428&end=20171013 具体来说，我想抓住历史价格的表格并以某种方式提取信息到数据帧。但首先我需要在原始html中实际找到它。 i

0热度

2回答

python beautifulsoup .text无类型

我正在刮美丽的汤的网站。我正在查找表中的“prtype”文本。我的问题是，这个专栏并不总是存在。如果列存在以下代码工作正常： prtyp = soup.find("dd", attrs={"class":"is_type g"}).text.strip() 但是，如果没有与这个类，我得到以下错误无柱： 'NoneType' object has no attribute 'text' 那

0热度

1回答

如何：在浏览器中打开请求会话

如何在浏览器中打开Python请求会话？我一直在使用GETS和POSTS浏览网站，并且在完成后，我想打开包含我发送给已发送网站的所有信息的URL。

0热度

3回答

遇到问题从里面提取文本刮html标签使用美丽的汤

我使用刮内容此方法返回的条目与此类似 <li class="title"><h4><a href="/addons/wow/world-quest-tracker">World Quest Tracker</a></h4></li> 我的列表中的代码试图提取中间的href标签中的文字，在这种情况下， World Quest Tracker 我怎么能完成这个？

1热度

2回答

网络与美丽的汤

刮我有以下代码以提取最新的MS Office版本的Mac： import urllib2 from bs4 import BeautifulSoup quote_page = 'https://support.office.com/en-us/article/Update-history- for-Office-2016-for-Mac-700cab62-0d67-4f23-947b-36

0热度

1回答

Python beautifulsoup得到2行文字

我是python的新手。试图从零开始学习......但需要做一些事情......这意味着我还没有完成我的阅读。我有下面的代码 import requests from bs4 import BeautifulSoup url="https://www.xxx.co.uk" page=requests.get(url) soup = BeautifulSoup(page.content,

0热度

1回答

UnicodeEncodeError在Python 3和BeautifulSoup4

当运行我的代码，我得到这个错误 UnicodeEncodeError: 'ascii' codec can't encode character '\u0303' in position 71: ordinal not in range(128) 这是我的全部代码， from urllib.request import urlopen as uReq from urllib.request im

1热度

2回答

如何为python3中的循环创建的每一行添加一个静态值？

district_name= [[li.getText() for li in data_rows[i].findAll('li')] for i in range(len(data_rows))] 上面的代码给出了一个州比哈尔邦的地区名称列表。像下面的表一样。 [['1', 'Nalanda'], ['2', 'Patna'], ['3', 'Gaya'], ['4',

1热度

2回答

访问与beautifulsoup

嵌套元素我有下面的HTML： <div id="contentDiv">  <div style="margin: 15px 0 10px 0; padding: 3px; overflow: hidden; background-color: #BCD6F8;"> <div class="mailer">Mailing

0热度

1回答

在python中获取下一页网址

现在我试图从网页上刮掉所有的url。它共有5个类别，每个类别都有不同的页面（每页有10篇文章）。例如： Categories Pages Banana 5 Apple 14 Cherry 7 Melon 6 Berry 2 代码： import requests from bs4 import BeautifulSoup import re from ur