新的python和lxml,请耐心等待。现在坚持看起来是unicode问题。我试过.encode,美丽的汤的unicodedammit没有运气。已经搜索论坛和网页,但我缺乏python技能未能将建议的解决方案应用于我的特定代码。感谢任何帮助,谢谢。lxml unicode输出问题
代码:
import requests
import lxml.html
sourceUrl = "http://www.hkex.com.hk/eng/market/sec_tradinfo/stockcode/eisdeqty.htm"
sourceHtml = requests.get(sourceUrl)
htmlTree = lxml.html.fromstring(sourceHtml.text)
for stockCodes in htmlTree.xpath('''/html/body/printfriendly/table/tr/td/table/tr/td/table/tr/table/tr/td'''):
string = stockCodes.text
print string
错误:
UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 0: ordinal not in range(128)
你能提供关于错误的更多细节吗?或者在'print string'之前添加一行'print type(string)'来查看发生了什么。 – iceout 2013-04-07 14:46:04