Search code examples
programming-languagesjpegocr

read text document from scanned image


Is there any way we can get the text from a scanned document in jpg jpeg or any other format ? I am using ruby as my programming language . But I guess if I can get the texts with some help from other programming languages , it will not be much of a problem to integrate.

Thanks.


Solution

  • Yes, you can use an OCR library. There are additional details at https://stackoverflow.com/questions/1085/free-ocr-library.

    In brief, you may wish to consider using tessnet (http://www.pixel-technology.com/freeware/tessnet2/).