I was wondering how the google cloud vision works behind the scenes. What kind of algorithms are used for processing the images? Is there some texts explaining this?
Thanks to all
This might be a late answer, but adding it for future reference. As mentioned above the OCR is done through deep learning, especially the Convolutional Neural Networks. Do go though this blog by drop box on how they implemented their own OCR engine with similar workings.