Search code examples
ocrtesseract

tesseract: Recognize plain multi-digit number


for some strange reason, tesseract is not able to recognize the following image. I tried various config options such as:

  • --psm 13: "Treat the image as a single text line"
  • tessedit_char_whitelist=012345678iI': Only allow numbers (and i's that can be replaced later).

This is the image:

plain number

Maybe it's my preprocessing, but to me the picture looks good (I also tried increasing the borders around the number). Any advice would be highly appreciated! Couldn't find anything helpfull neither Google or SO.

Thanks!


Solution

  • Figured it out: pytesseract.image_to_string(img, config='digits')