Search code examples
pythonimageocrtesseractimage-segmentation

How to process an image for Tesseract?


I am trying to read characters from a low resolution image using Tesseract:

enter image description here

Using indications from here, I tried adaptive thresholding, but results are not convincing

[adapt thresholding]

I also tried to adapt code from here without more success.

Would you have any idea to improve pre-processing?


Solution

  • In the end, I implemented an OCR using Keras and CTC loss function. Results are great!

    You can find details of the OCR on the Keras github and explanations about the method here.