2013-03-02 469 views
0

我正在尝试使用spinn3r的数据。数据作为protobuf返回。在蟒蛇,当我打印的protobuf的对象,我得到这个:在Python中读取protobuf。提取数据

print data 
source { 
    link { 
    href: "" 
    resource: "" 
    } 
    canonical_link { 
    href: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    resource: "" 
    } 
    title: "" 
    hashcode: "" 
    lang { 
    code: "en" 
    probability: -1.0 
    } 
    generator: "" 
    description: "" 
    last_posted: "" 
    last_published: "" 
    date_found: "" 
    publisher_type: "MICROBLOG" 
} 
feed { 
    link { 
    href: "" 
    resource: "" 
    } 
    canonical_link { 
    href: "" 
    resource: "" 
    } 
    title: "" 
    hashcode: "" 
    lang { 
    code: "en" 
    probability: -1.0 
    } 
    generator: "" 
    description: "" 
    last_posted: "" 
    last_published: "" 
    date_found: "" 
    etag: "" 
    channel_link { 
    href: "" 
    resource: "" 
    } 
} 
feed_entry { 
    link { 
    href: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    resource: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    } 
    canonical_link { 
    href: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    resource: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    } 
    title: "The value of a man resides in what he gives and not in what he is capable of receiving. ~ Albert Einstein" 
    hashcode: "8WhKLK9Lyng" 
    lang { 
    code: "en" 
    probability: -1.0 
    } 
    author { 
    name: "_PattiShaw (Patti Shaw)" 
    email: "" 
    link { 
     href: "http://twitter.com/_PattiShaw" 
    } 
    } 
    spam_probability: 0.0 
    last_published: "2011-01-20T19:08:49Z" 
    date_found: "2011-01-20T19:08:49Z" 
    identifier: 1295550574016007548 
    content { 
    mime_type: "text/html" 
    data: "x\332M\214\301\r\2000\014\304V\271\t`\201\n\211\007\033\[email protected]\003\215TR\324\226\362cv\020/\276\266\3459\010\032\305S\220V\020v2d)\352\[email protected]\rW\240\212\267\330\264\275\300\[email protected]\346]\317\003,\325\277\327\202\205\016\342\370m\262,\242Mm\353pc\214,\271bR+U\324\036\200\236&\363" 
    encoding: "zlib" 
    } 
} 
permalink_entry { 
    link { 
    href: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    resource: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    } 
    canonical_link { 
    href: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    resource: "http://twitter.com/_PattiShaw/statuses/28167079857225728" 
    } 
    title: "The value of a man resides in what he gives and not in what he is capable of receiving. ~ Albert Einstein" 
    hashcode: "8WhKLK9Lyng" 
    lang { 
    code: "en" 
    probability: -1.0 
    } 
    author { 
    name: "_PattiShaw (Patti Shaw)" 
    email: "" 
    link { 
     href: "http://twitter.com/_PattiShaw" 
    } 
    } 
    spam_probability: 0.0 
    last_published: "2011-01-20T19:08:49Z" 
    date_found: "2011-01-20T19:09:34Z" 
    identifier: 1295550574016007548 
    content { 
    mime_type: "text/html" 
    data: "" 
    } 
    content_extract { 
    mime_type: "text/html" 
    data: "" 
    } 
    generator: "" 
} 

我想从“feed_entry”对象中提取的“作者姓名”。我尝试这样做:

print data.feed_entry.author.name 

我得到的错误:

AttributeError: 'RepeatedCompositeFieldContainer' object has no attribute 'name' 

我尝试了打印author对象,看看会发生什么。这是我得到的:

print u.feed_entry.author 
[<spinn3rApi_pb2.Author object at 0x362e6d0>] 

我该如何提取此作者的名字?

+0

你'目录(数据)'得到什么? – Salem 2013-03-02 18:59:33

回答

2

它看起来像u.feed_entry.author是一个列表。请注意括号:

[<spinn3rApi_pb2.Author object at 0x362e6d0>] 

这应该解决您的问题(假设你有至少一个作者):

print data.feed_entry.author[0].name