How to OCR using SDK in C#


hi all,

 

below requirement in detail.

i have pdf contains scanned documents.

i want convert pdf content (xml).

can me out achieve using sdk c#.

 

manoj k singh

this isn't useful believe it'll end being 2-stage process may need use 2+ libraries perform various steps.

 

don't me wrong, there's ocr libs c# read pdfs full of images, , no doubt saw price on $4k developer license haha. there's few. i'm assuming want avoid that.

 

there's tools xpdf , others should find , try can read pdf images themselves. after images, might need convert them different image format, , them feed them ocr library. google manages project tesseract ocr may want at. believe compiles c++ know there's ways use c++ library c#.

 

a lot of work do, that's why made direct pdf image -> ocr text plugins expensive.



More discussions in Coding Corner


adobe

Comments

Popular posts from this blog

Soustraire une selection

After Effects: could not find dvaeve_dialogs.txt

Illustrator cs6 "Invalid Serial Number"