I am trying to use Vision API to extract the caption for an image
Looks like the Vision API does not provide the caption about the image. It provides the list of objects in the Image.
Looks like IBM Caption Generator provides the Caption for the image.
As of today there is no captioning option in Google Cloud Vision API
Request is taken into consideration by google team to (may) develop this as a feature