哪个库用于从图像中提取文本？

Document modiDocument = new Document(); 
modiDocument.Create(filePath); 
modiDocument.OCR(MiLANGUAGES.miLANG_ENGLISH); 
MODI.Image modiImage = (modiDocument.Images[0] as MODI.Image); 
string extractedText = modiImage.Layout.Text; 
modiDocument.Close(); 
return extractedText;

来源

2016-08-19 19:08:10 user6736260

对于从图像提取词的文字，我用的是最准确的开源OCR引擎：正方体。可用here或直接在你的包NuGet。

这是我在C＃中的函数，它从图像中提取文字sourceFilePath。将EngineMode设置为TesseractAndCube;它会检测到比其他选项更多的单词。

var path = "YourSolutionDirectoryPath"; 
using (var engine = new TesseractEngine(path + Path.DirectorySeparatorChar + "tessdata", "fra", EngineMode.TesseractAndCube)) 
{ 
    using (var img = Pix.LoadFromFile(sourceFilePath)) 
    { 
     using (var page = engine.Process(img)) 
     { 
      var text = page.GetText(); 
      // text variable contains a string with all words found 
     } 
    } 
}

我希望有所帮助。

来源

2017-02-28 10:50:21

这里是C＃一些有用的示例代码：

使用正方体：免费开源的OCR应用程序对Windows桌面 - 一个现代化的GUI前端为正方体OCR引擎。该应用程序还包括用于读取和OCR'ing PDF文件的支持：https://github.com/A9T9/Free-Ocr-Windows-Desktop
使用微软OCR：对于Windows应用商店免费开源的OCR应用程序 - 一个现代化的GUI前端为微软OCR库。该应用程序还包括读取和OCR'PDF文件的支持：https://github.com/A9T9/Free-OCR-Software

来源

2017-02-28 11:16:44 Tienkamp

哪个库用于从图像中提取文本？

回答

相关问题