Search code examples
pythontesseract

Tesseract not detecting letters


I'm trying to OCR this picture:

this picture

using Python Tesseract(pytesseract) but all I don't get any output except for a blank line, which i think means that it did not detect any letter.

The code I am using is this:

name = pytesseract.image_to_string(imo, lang='eng', config='--psm 13')

I am using python 3.6.5 and tesseract 4.0.0-beta.1 on Ubuntu 18.04.1 LTS


Solution

  • The steps should be

    step 1 - Add extra white border to the image

    step 2 - Binarised it using otsu or other method to get the best possible binarization

    step 3 - Then send it to tesseract with psm 10 (for single character)