2011-11-17 44 views
0

我试图将文本转换成xml格式。我正在使用LXML库。但是我收到错误消息。请帮帮我。谢谢文本XML在python

import re 
from lxml import etree 
import urllib,urllib2 
def get_movie_info(movie_id): 
    URL = "http://www.raaga.com/a/rss.asp?%s"%(movie_id) 
    f = urllib.urlopen(URL) 
    movie_info = f.read() 
    rss = "".join([ line.strip() for line in movie_info ]) 
    mi_tree = etree.fromstring(rss) 
    #mi_title = self._parse_movie_title(mi_tree.xpath("/rss/channel/title/text()")[0]) 
    #mi_tracks = mi_tree.xpath("/rss/channel/item") 
    return mi_tree 

get_movie_info('A0000102') 

这是我回溯

Traceback (most recent call last): 
    File "py1.py", line 14, in <module> 
    get_movie_info('A0000102') 
    File "py1.py", line 9, in get_movie_info 
    mi_tree = etree.fromstring(rss) 
    File "lxml.etree.pyx", line 2743, in lxml.etree.fromstring (src/lxml\lxml.etre 
e.c:52665) 
    File "parser.pxi", line 1573, in lxml.etree._parseMemoryDocument (src/lxml\lxm 
l.etree.c:79932) 
    File "parser.pxi", line 1452, in lxml.etree._parseDoc (src/lxml\lxml.etree.c:7 
8774) 
    File "parser.pxi", line 960, in lxml.etree._BaseParser._parseDoc (src/lxml\lxm 
l.etree.c:75389) 
    File "parser.pxi", line 564, in lxml.etree._ParserContext._handleParseResultDo 
c (src/lxml\lxml.etree.c:71739) 
    File "parser.pxi", line 645, in lxml.etree._handleParseResult (src/lxml\lxml.e 
tree.c:72614) 
    File "parser.pxi", line 585, in lxml.etree._raiseParseError (src/lxml\lxml.etr 
ee.c:71955) 
lxml.etree.XMLSyntaxError: xmlParsePITarget: invalid name prefix 'xml', line 1, 
column 13 

回答

0

它工作正常,我没有这个字符串:

rss = "".join([ line.strip() for line in movie_info ]) 

事情是这样的:

mi_tree = etree.fromstring(movie_info) 
+0

它的工作。谢谢:-) – Kulbir

+0

它显示这个''。我如何打印所有内容?谢谢 – Kulbir

+0

[This](http://lxml.de/tutorial.html#tree-iteration)应该有帮助=) –