2017-04-05 96 views

回答

0

我发现最简单的方式,以OCR输出保存到文本文件

def ocr(file_to_ocr): 
    im = Image.open(file_to_ocr) 
    txt=pytesseract.image_to_string(im) 
    return txt 

directory = os.path.join("Your_path") 
for root,dirs,files in os.walk(directory): 
for file in files: 
    if file.endswith(".jpg"): 
     pre_fix=file[:-4] 
     txt=ocr(file) 
     with open(directory+"\\"+pre_fix+".txt",'w') as f: f.write(str(txt)) 
1

你可以创建一个使用uuid唯一的文件夹名称,然后写output.txt中,以它像这样?:

from uuid import uuid4 
import os 

folder_name = str(uuid4()) 
os.makedirs(folder_name) 
with open('./{fn}/output.txt'.format(fn=folder_name),'wb') as f: 
    f.write(image_to_string(img1))