computer-vision image-recognition object-detection

How to get a position of custom object on image using vision recognition api

I know there is a lot of vision recognition APIs such as Clarifai, Watson, Google Cloud Vision, Microsoft Cognitive Services which provide recognition of image content. The response of these services is simple json that contains different tags, for example

{ 
   man: 0.9969295263290405,
   portrait: 0.9949591159820557,
   face: 0.9261120557785034
}

The problem is that I need to know not only what is on the image but also the position of that object. Some of those APIs have such feature but only for face detection.

So does anyone know if there is such API or I need to train own haar cascades on OpenCV for every object.

I will be very greatful for sharing some info.

Solution

You could take a look at Wolfram Cloud/Mathematica.

It has the ability to detect object locations in a picture.

Some examples.

Detecting road signs.
Finding Waldo.
Object tracking in video.

How to do Image Alpha Matting in Python with a Given Trimap (Based on Closed Form Solution to Natural Image Matting)
Uniformity of color and texture in image
Image not segmenting properly using DBSCAN
Find a fragment in the whole image
Blending an inflated equirectilinear image with another camera image
Detect only left-most boxes in image
how to use a neural network to learn a matrix transformation?
Facial Expression Recognition Data Preparation for CNN
Finding CheckerBoard Points in opencv for any random ChessBoard( pattern size not known)
How do I crop the solar panels captured by drone?
Phash vs. SIFT in identifying similar image
How to detect a circle with uncertain thickness and some noise in an binary image?
Image Edge Detection with python
Circle detection on craters by Hough OpenCV - Python
Text Image Binary Classifier without deep learning or tesseract
Filter fluctuating lighting with OpenCV
Detect the state/position of an object using computer vision
Blob detection on embedded platform, memory restricted
Camera and Image recognition
Regarding a line through origin in camera coordinate
How can I transfer data efficiently via images?
Extracting a curve and identifying coordinates from an image using OpenCV
Frame Number Overlay With FFmpeg
OpenCV template matching and transparency
How to properly calculate PSD plot (Power Spectrum Density Plot) for images in order to remove periodic noise?
How to convert bounding box (x1, y1, x2, y2) to YOLO Style (X, Y, W, H)
Pytorch RuntimeError: The size of tensor a (4) must match the size of tensor b (3) at non-singleton dimension 0
Pytorch: How to get the first N item from dataloader
Coordinate mapping at an angle
Color overlaying algorithm