2013-04-05 94 views
-2

我有这样的代码:如何将此字符串解码为utf-8字符串?

string URL = "http://translate.google.com/m?hl=vi&sl=en&tl=vi&ie=UTF-8&prev=_m&q=song"; 
string s = ""; 
WebClient client = new WebClient(); 
client.Headers.Add("user-agent", "Mozilla/5.0 (SymbianOS/9.4; U; Series60/5.0 Nokia5800d-1/21.0.025; Profile/MIDP-2.1 Configuration/CLDC-1.1) AppleWebKit/413 (KHTML, like Gecko) Safari/413"); 
s = client.DownloadString(URL); 

在运行时我收到了字符串格式字符串:

danh tÆ°Ì€ buổi há»p để ca hát 

这个字符串:

danh từ buổi họp để ca hát 

非常感谢你:)

+3

哪里原从何而来?它编码为什么? – Oded 2013-04-05 14:31:11

+0

对。页面的编码是什么,为什么你不把'client.Encoding'设置为开始? – Oded 2013-04-05 14:37:37

+0

@Oded:谢谢,它的工作,我是新手:) – Sakura 2013-04-05 14:39:41

回答

1

当得到byte[]代表一个stri ng,你需要知道在获取之前使用了什么编码。

在这种情况下,它是UTF-8,因此设置的WebClient,以导致正确的字符串的Encoding

string URL = "http://translate.google.com/m?hl=vi&sl=en&tl=vi&ie=UTF-8&prev=_m&q=song"; 
string s = ""; 
WebClient client = new WebClient(); 
client.Encoding = Encoding.UTF8; 
client.Headers.Add("user-agent", "Mozilla/5.0 (SymbianOS/9.4; U; Series60/5.0 Nokia5800d-1/21.0.025; Profile/MIDP-2.1 Configuration/CLDC-1.1) AppleWebKit/413 (KHTML, like Gecko) Safari/413"); 
s = client.DownloadString(URL);