2012-04-26 97 views
0

这是我从中提取数据的RSS XML文件。我用SAX解析器完成了所有工作,所有工作都很好。但是我解析了整个RSS文件并获取了每个单元素数据。Rss解析问题(XML)

但我的问题是我得到整个文件元素的数据,但我只想要只在元素。

<rss version="2.0"> 
<channel> 
<title>RSS Feed</title> 
<link>http://www.xyz.com</link> 
<description>Calendar RSS Feeds</description> 
<language>en-us</language> 
<ttl>60</ttl> 
<item> 
<title> 
title 1 
</title> 
<description>description 1</description> 
<link> 
http://www.xyz.com 
</link> 
<guid isPermaLink="false">[email protected]://www.xyz.com</guid> 
</item> 
<item> 
<title> 
title 2 
</title> 
<description>description 2</description> 
<link> 
http://www.xyz.com 
</link> 
<guid isPermaLink="false">[email protected]://www.xyz.com</guid> 
</item> 
<item> 
</channel> 
</rss> 

我已经尝试了所有的组合来设置此我会告诉我的代码在这里我提出的startElement和 的endElement方法: -

public void startElement(String uri, String localName, String qName, 
      Attributes attributes) throws SAXException { 
     // TODO Auto-generated method stub 

     current = true; 

     if (localName.equals("channel")) 
     { 
      itemList = new ItemList(); 
     } 
    } 

    @Override 
    public void endElement(String uri, String localName, String qName) 
      throws SAXException { 
     // TODO Auto-generated method stub 
     current = false; 

     if(localName.equals("title")) 
     { 
      itemList.setTitle(currentValue); 
     } 

    } 

    @Override 
    public void characters(char[] ch, int start, int length) 
      throws SAXException { 
     // TODO Auto-generated method stub 

     if(current) 
     { 
      currentValue = new String(ch, start, length); 
      current=false; 
     } 
    } 

任何一个请建议我如何只能retrive的仅在元素下的元素。

我也试过,但这个给了我错误

public void startElement(String uri, String localName, String qName, 
      Attributes attributes) throws SAXException { 
     // TODO Auto-generated method stub 

     current = true; 

     if (localName.equals("channel")) 
     { 
      if (localName.equals("item")) 
      { 
      itemList = new ItemList(); 
      } 
     } 
    } 

在此先感谢。

回答

0

试试这个,希望它西港岛线帮助....

url = new URL("http://www.abcd.com/rss.xml");//....... 
    HttpURLConnection conn = (HttpURLConnection) url.openConnection(); 

    if(conn.getResponseCode() == HttpURLConnection.HTTP_OK){ 
      DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance(); 
      DocumentBuilder db = dbf.newDocumentBuilder(); 
      Document doc; 
      doc = db.parse(url.openStream()); 
      doc.getDocumentElement().normalize(); 
      NodeList itemLst = doc.getElementsByTagName("item"); 



      Array.Description = new String[itemLst.getLength()];//........ 
      Array.Title = new String[itemLst.getLength()]; 
      Array.image = new String[itemLst.getLength()]; 
      Array.Date = new String[itemLst.getLength()]; 

      for(int i=0; i < itemLst.getLength(); i++){ 

       Node item = itemLst.item(i); 
       if(item.getNodeType() == Node.ELEMENT_NODE){ 
         Element ielem = (Element)item; 
         NodeList title = ielem.getElementsByTagName("title"); 
         NodeList date = ielem.getElementsByTagName("pubDate"); 

         NodeList description = ielem.getElementsByTagName("description"); 

        ////......... 
         Array.Title[i] = title.item(0).getChildNodes().item(0).getNodeValue(); 
         Array.Description[i] = description.item(0).getChildNodes().item(0).getNodeValue(); 
         Array.Date[i] = date.item(0).getChildNodes().item(0).getNodeValue(); 
+0

我已经尝试过这样的DocumentBuilder我得到这个结果,但在这我不知道如何实现StringBuffer的,因为我不在DocumentBuider中获取完整的数据。你可以建议我如何使用DocumentBuilder实现StringBuffer.Thanks ... – user755278 2012-04-26 10:17:04

+0

k ...我已经构建了一个RSS阅读器。我将发布我用于解析的整个文件。只需检查它... – 2012-04-26 10:19:49

0
public class headlinesparser { 


public static void parse(){ 
URL url; 
try { 
    url = new URL("http://www.abcd.com/taxonomy/term/19/rss.xml");//....... 
    HttpURLConnection conn = (HttpURLConnection) url.openConnection(); 

    if(conn.getResponseCode() == HttpURLConnection.HTTP_OK){ 
      DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance(); 
      DocumentBuilder db = dbf.newDocumentBuilder(); 
      Document doc; 
      doc = db.parse(url.openStream()); 
      doc.getDocumentElement().normalize(); 
      NodeList itemLst = doc.getElementsByTagName("item"); 



      Headlines.Description = new String[itemLst.getLength()];//........ 
      Headlines.Title = new String[itemLst.getLength()]; 
      Headlines.image = new String[itemLst.getLength()]; 
      Headlines.Date = new String[itemLst.getLength()]; 

      for(int i=0; i < itemLst.getLength(); i++){ 

       Node item = itemLst.item(i); 
       if(item.getNodeType() == Node.ELEMENT_NODE){ 
         Element ielem = (Element)item; 
         NodeList title = ielem.getElementsByTagName("title"); 
         NodeList date = ielem.getElementsByTagName("pubDate"); 

         NodeList description = ielem.getElementsByTagName("description"); 

        ////......... 
         Headlines.Title[i] = title.item(0).getChildNodes().item(0).getNodeValue(); 
         Headlines.Description[i] = description.item(0).getChildNodes().item(0).getNodeValue(); 
         Headlines.Date[i] = date.item(0).getChildNodes().item(0).getNodeValue(); 

        if (Headlines.Description[i].contains("<img ")){ 

        String img = Headlines.Description[i].substring(Headlines.Description[i].indexOf("<img ")); 
        String cleanUp = img.substring(0, img.indexOf(">")+1); 
        img = img.substring(img.indexOf("src=") + 5); 
        int indexOf = img.indexOf("'"); 
        if (indexOf==-1){ 
         indexOf = img.indexOf("\""); 
        } 
        img = img.substring(0, indexOf); 

        //setImgLink(img); 

        Headlines.image[i]=img; 

         } 




       } 

      } 

    } 

} catch (MalformedURLException e) { 
    // TODO Auto-generated catch block 
    e.printStackTrace(); 
} catch (DOMException e) { 
    // TODO Auto-generated catch block 
    e.printStackTrace(); 
} catch (IOException e) { 
    // TODO Auto-generated catch block 
    e.printStackTrace(); 
} catch (ParserConfigurationException e) { 
    // TODO Auto-generated catch block 
    e.printStackTrace(); 
} catch (SAXException e) { 
    // TODO Auto-generated catch block 
    e.printStackTrace(); 
} 

} 


}