I'm developing for UWP, Windows has an OCR engine: Windows.Media.Ocr
My question is: someone knows if the Windows OCR can be trained to recognize new characters or use a custom font? if yes, how i can do this?
what i want to achieve is to recognize non alphabetical symbols. I want to recognize per example the character ⌰ (unicode: U+2330) or ⌖ (U+2316).
The characters that i want to recognize are symbols not for any language.
I used Windows.Media.Ocr
library in my WUP application and here some test result with different font
Font - Arial
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello @ World
Accuracy - 100%
Font - Agency FB
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello World
Accuracy - 84.6% (Missed - @ symbol and one space)
Font - Modern
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello @ world
Accuracy - 92.3% (W recognised as w)
Font - Lucida Handwriting
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - HeUe@ worw
Accuracy - 46.1%
Font - Arial Unicode MS
Test Symbols - ⌰ ⌖
Expected Result - ⌰ ⌖
Original Result - (Unable to Recognize)
Accuracy - 0%
Hope this helpful to you.