Search code examples
centosinstallationocrtesseract

Installing Tesseract-OCR on CentOS 6


I'm trying to install Tesseract-OCR on my server however when I install all what I believe to be the correct repos. When I try to install it the package is not found

I tried adding rpmforge but to no avail. Any ideas from somebody that has done before or is familiar with adding and searching through repos?


Solution

  • I used these instructions which worked correctly in Centos

    Install Tesseract OCR libs from sources in Centos

    Download Leptonica and Teseract sources:

    $ wget http://www.leptonica.org/source/leptonica-1.69.tar.gz
    $ wget https://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.02.tar.gz
    

    Configure, compile, install libs:

     $ tar xzvf leptonica-1.69.tar.gz      
     $ cd leptonica-1.69      
     $ ./configure
     $ make
     $ sudo make install
    
     $ tar xzf tesseract-ocr-3.02.02.tar.gz
     $ cd tesseract-3.01
     $ ./autogen.sh
     $ ./configure
     $ make
     $ sudo make install
     $ sudo ldconfig
    

    Download languages (english) and copy to tessdata folder:

    $ wget http://tesseract-ocr.googlecode.com/files/tesseract-ocr-3.02.eng.tar.gz       
    $ tar xzf tesseract-ocr-3.02.eng.tar.gz       
    $ sudo cp tesseract-ocr/tessdata/* /usr/local/share/tessdata
    

    and enjoy it ;)