2015-09-20 43 views
1

试图将PDF解析为文本并试图从Slate开始。Python中的板岩在山脚上磕磕碰碰

然而,只是跟着到处张贴的简单的例子,我得到如下:

>>> import slate 
>>> with open('pytest.PDF') as fp: 
...  doc = slate.PDF(fp) 
... 
Traceback (most recent call last): 
    File "<stdin>", line 2, in <module> 
    File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/slate/slate.py", line 52, in __init__ 
self.append(self.interpreter.process_page(page)) 
    File "/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages/slate/slate.py", line 36, in process_page 
self.device.outfp.buf = '' 
AttributeError: 'cStringIO.StringO' object has no attribute 'buf' 

任何想法?

回答

0

这可以通过改变线36被固定发生的位置误差为:

self.device.outfp.truncate(0)