12
我试图与特定HTML文件美丽的汤的Unicode编码错误
from BeautifulSoup import BeautifulSoup
import re
import codecs
import sys
f = open('test1.html')
html = f.read()
soup = BeautifulSoup(html)
body = soup.body.contents
para = soup.findAll('p')
print str(para).encode('utf-8')
我收到以下错误以下代码:
UnicodeEncodeError: 'ascii' codec can't encode character u'\u2019' in position 9: ordinal not in range(128)
如何调试呢?
我在取消打印功能的调用时没有出现任何错误。