Search code examples
asciiocrxxd

Best OCR For Hexdumps


I am looking for a way in which it is possible to extract the left side of the image (the non ascii characters) here, reliably, using OCR. I have a number of images in a similar format, also showing hex data, that i would like to extract.

Would anyone be able to reccomend a way to extract the text from these images? enter image description here

enter image description here

Ideally, the output for first line would be :

0005C850 00 00 00 00 etc.

and output for fourth line would be something like:

0005C8AA ED 93 A7 8E etc.


Solution

  • A legendary program has been written that does this. - https://github.com/eighttails/ProgramListOCR . For Windows systems, i ran on a VM.

    Was hard to find.

    First you want to convert your image to 400% of the size:

    convert -resize 400% source.png source.png

    Make it grayscale:

    convert source.png -colorspace Gray destination.png

    Change it into a tiff file:

    convert destination.png destination.tiff

    You then process it using this software.