2015-10-05 76 views
0

在Twisted的网站上,他们有一个名为“Receiving Responses”的例子,它显示了如何获取响应头文件和响应代码,而不是响应主体(请求返回到网站的实际HTML主体)。如何进行GET请求,然后使用Python打印响应正文(扭曲)

在def cbrequest(response)中,我将如何打印出由GET请求返回到example.com的HTML文本?他们显示了用于获取标题的response.headers之类的方法,但我还没有看到用于返回前端正文的方法。

我试图打印response.deliverBody(BeginningPrinter(完))来获取响应文本,但无济于事

http://twistedmatrix.com/documents/12.1.0/web/howto/client.html#auto9

from pprint import pformat 

from twisted.internet import reactor 
from twisted.internet.defer import Deferred 
from twisted.internet.protocol import Protocol 
from twisted.web.client import Agent 
from twisted.web.http_headers import Headers 

class BeginningPrinter(Protocol): 
    def __init__(self, finished): 
     self.finished = finished 
     self.remaining = 1024 * 10 

    def dataReceived(self, bytes): 
     if self.remaining: 
      display = bytes[:self.remaining] 
      print 'Some data received:' 
     print display 
     self.remaining -= len(display) 

def connectionLost(self, reason): 
    print 'Finished receiving body:', reason.getErrorMessage() 
    self.finished.callback(None) 

agent = Agent(reactor) 
d = agent.request(
    'GET', 
    'http://example.com/', 
    Headers({'User-Agent': ['Twisted Web Client Example']}), 
    None) 

def cbRequest(response): 
    print 'Response version:', response.version 
    print 'Response code:', response.code 
    print 'Response phrase:', response.phrase 
    print 'Response headers:' 
    print pformat(list(response.headers.getAllRawHeaders())) 
    finished = Deferred() 
    response.deliverBody(BeginningPrinter(finished)) 
    return finished 
d.addCallback(cbRequest) 

def cbShutdown(ignored): 
    reactor.stop() 
d.addBoth(cbShutdown) 

reactor.run() 

回答

1

你必须保持数据和事后归还

class BeginningPrinter(Protocol): 
    def __init__(self, finished): 
     self.finished = finished 
     self.remaining = 1024 * 10 
     self.buff = None 

    def dataReceived(self, bytes): 
     if self.buff is None: 
      self.buff = bytes 
     else: 
      self.buff += bytes 
     if self.remaining: 
      display = bytes[:self.remaining] 
      print 'Some data received:' 
     print display 
     self.remaining -= len(display) 

    def connectionLost(self, reason): 
     print 'Finished receiving body:', reason.getErrorMessage() 
     self.finished.callback(self.buff) 

请注意,您应该删除变量remaining,因为此代码示例期望服务器以1024 * 10字节。

class BeginningPrinter(Protocol): 
    def __init__(self, finished): 
     self.finished = finished 
     self.buff = None 

    def dataReceived(self, bytes): 
     if self.buff is None: 
      self.buff = bytes 
     else: 
      self.buff += bytes 

    def connectionLost(self, reason): 
     print 'Finished receiving body:', reason.getErrorMessage() 
     self.finished.callback(self.buff) 
+0

“请注意,您应该删除剩余的变量,因为此代码示例期望服务器以1024 * 10字节进行响应。”这个问题和答案所基于的[“检查响应”](http://twistedmatrix.com/documents/12.1.0/web/howto/client.html#auto4)代码似乎不同意这种说法。读错了吗? – msw

+0

我再次读过这个代码,它被认为只打印了1024 * 10的第一个字节,这就是为什么这个类被命名为“开始打印机”的原因,我想你只需要所有的响应。使用我给出的最后一个片段,但将类重命名为'ResponseReceiver' –