Im about to start working with tesseract, tess4j to be exact, and im going through the api docs. I havent come across any way to read from a webpage.
Basically the program opens a webpage that is just an image. I want tess4j to read the image from the page and convert it to words. If tess4j cant do this is there any other java ocrs that would be helpful, preferably without downloading the image?
Appreciate the help.
Reading and downloading are synonymous. If you are looking into reading an image from a web-page without opening it in a web-page, I would suggest looking into the "curl" command and it's equivalent in Java. After getting the image with the aforementioned command, it can then be parsed with Tesseract.