2012-04-25 144 views
1

如何使用php从xml页面获取内容。 内容如下:web scraping从xml中获取内容

<entry> 
    <title>News</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News2</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username2</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News3</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username3</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

<entry> 
    <title>News4</title> 
    <link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/> 
    <id>tag:www.website.com,2012-04-25:2688327:BlogPost:1569917</id> 
    <updated>2012-04-25T08:30:00.000Z</updated> 
    <author> 
    <name>Username4</name> 
    <uri>http://www.website.com/profile/username</uri> 
    </author> 
     <summary type="html"> 
     Hi this is the latest news 
     </summary> 
</entry> 

我怎样才能拿到冠军的数组,博客链接<link rel="alternate" href="http://www.website.com/detail/2688327:BlogPost:1569917"/>,笔者细节,如名称和URI(配置文件链接),并总结出使用PHP?

回答

1

退房simplexml的,XPath的 http://php.net/manual/en/book.simplexml.php

$file = 'url or file name'; 
    $xml = simplexml_load_file('$file'); 
    $list= $xml->xpath("/entry"); // root/entry ... 
    print $list[0]->id; 
    #var_dump($list);