From the course: AI and Digital Marketing Trends

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

The many modes of multimodal AI

The many modes of multimodal AI

- You've probably been hearing the term multimodal AI, and you may be wondering what it does. Well, multimodal AI refers to generative AI tools that can process or understand more than one type of data. So what are some examples? Well, the one you're probably most familiar with is text to image. When you use words to describe a visual in the head and the AI produces a picture. But there are many other applications too. There are image to image tools where you upload a visual you already have, and using text or voice commands, ask the AI to change it or create something new. Or there are image to video tools where an AI turns a still photo into a moving picture. There's also text to video, where you describe camera movements, lighting, the characters and style of a scene, and boom, there it is. Right now, text to video tools are still in the early stages, so individual scenes are short and can have mistakes like a person…

Contents