Search code examples
c#uwpwindows-phonewin-universal-appocr

Can Windows ocr recognize custom symbols/font?


I'm developing for UWP, Windows has an OCR engine: Windows.Media.Ocr

My question is: someone knows if the Windows OCR can be trained to recognize new characters or use a custom font? if yes, how i can do this?

what i want to achieve is to recognize non alphabetical symbols. I want to recognize per example the character ⌰ (unicode: U+2330) or ⌖ (U+2316).

The characters that i want to recognize are symbols not for any language.


Solution

  • I used Windows.Media.Ocr library in my WUP application and here some test result with different font


    Arial

    Font - Arial
    Test Words - Hello @ World
    Expected Result - Hello @ World
    Original Result - Hello @ World
    Accuracy - 100%

    enter image description here


    Agency FB

    Font - Agency FB
    Test Words - Hello @ World
    Expected Result - Hello @ World
    Original Result - Hello World
    Accuracy - 84.6% (Missed - @ symbol and one space)

    enter image description here


    Modern

    Font - Modern
    Test Words - Hello @ World
    Expected Result - Hello @ World
    Original Result - Hello @ world
    Accuracy - 92.3% (W recognised as w)

    enter image description here


    Lucida Handwriting

    Font - Lucida Handwriting Test Words - Hello @ World
    Expected Result - Hello @ World
    Original Result - HeUe@ worw
    Accuracy - 46.1%
    enter image description here


    Update [1]

    Arial Unicode MS

    Font - Arial Unicode MS
    Test Symbols - ⌰ ⌖
    Expected Result - ⌰ ⌖
    Original Result - (Unable to Recognize)
    Accuracy - 0%

    enter image description here


    Update 2

    enter image description here

    Hope this helpful to you.