I have two problems.
...
private TesseractEngine tesseract = new TesseractEngine(path, "eng", EngineMode.LstmOnly);
....
using (var page = tesseract.Process(image, rec, PageSegMode.Auto))
{
text= page.GetText(); // returns the ocr text of the whole rectangle
confidence = page.GetMeanConfidence(); // returns the confidence for the whole word.
List<System.Drawing.Rectangle> rectangles = page.GetSegmentedRegions(PageIteratorLevel.Symbol); //returns each character reactangle of the word.
}
Thanks for your help! :)
You'd need to obtain ResultIterator
object (via page.GetIterator()
method) and then operate on it at PageIteratorLevel.Symbol
level. Check PageSerializer class for example.