New AI Framework Can Translate Multimodal Inputs, Including Video, To Natural Language

An AI model that can understand real-world situations and communicate with humans in natural language is the holy grail of Artificial Intelligence. Building such agents would require the development of multimodal systems that can understand audio-visual input.

Source : https://analyticsindiamag.com/new-ai-framework-can-translate-multimodal-inputs-including-video-to-natural-language/

Date : February 8, 2021 at 10:54AM

Tag(s) : #AI ENG