的Unicode UTF-8/UTF-16编码在Python

在蟒：的Unicode UTF-8/UTF-16编码在Python

u'\u3053\n'

它是UTF-16？

我真的不知道所有的Unicode /编码的东西，但这种类型的东西出现在我的数据集，，如果我有a=u'\u3053\n'。

print给出例外，并且解码给出例外。

a.encode("utf-16") > '\xff\xfeS0\n\x00' 
a.encode("utf-8") > '\xe3\x81\x93\n' 

print a.encode("utf-8") > πüô 
print a.encode("utf-16") >  ■S0

这是怎么回事？

来源

2009-08-04 8steve8

http://www.fileformat.info/info/unicode/char/3053/index.htm – 8steve8 2009-08-04 19:32:02

这是一个unicode字符，似乎不能在终端编码中显示。 print尝试在您的终端的编码中对unicode对象进行编码，如果无法完成，您将得到一个异常。

在可以显示UTF-8，你得到一个终端：

>>> print u'\u3053' 
こ

你的终端似乎并不能够显示UTF-8，否则至少print a.encode("utf-8")行应产生正确的字符。

来源

2009-08-04 19:35:04 sth

谢谢是的，PowerShell，甚至PowerShell ISE似乎没有“compatable”（因缺乏更好的理解）与unicode在python中。 http://stackoverflow.com/questions/2105022/unicode-in-powershell-with-python-alternative-shells-in-windows – 8steve8 2010-02-05 17:21:03