Google’s computer vision technology is now so good it’s able to find specific objects within a video or group of videos.
During the company’s Cloud Next event in San Francisco Wednesday, Google unveiled its new Cloud Video Intelligence API. Until now, most similar image recognition APIs available in the cloud only focused on doing this for still images, but with the help of this new API, developers will be able to build applications that let users search and discover information in videos. That means you can search for “dog” or “flower,” for example.
Besides extracting metadata, the API allows you to tag scene changes in a video.
As Google’s Fei-Fei Li, its chief scientist of AI and Machine Learning at Google Cloud, noted in today’s keynote, the world of pixels goes beyond images. Videos have long been a challenge for machine learning researchers. This new service, though, now makes extracting information from these videos as easy as doing the same for images.
In addition, the Cloud Machine Learning Engine, the company’s tool for building custom machine learning models using its TensorFlow framework, is now generally available.
As Li noted in today’s keynote, the company wants to democratize the machine learning technologies it has developed in-house. The Vision API is another example of this.