Date: Wednesday, May 21
Start Time: 4:50 pm
End Time: 5:20 pm
As artificial intelligence is making rapid strides in use of large language models, the need for multimodality arises in multiple application scenarios. Similar to the way humans use multiple sensory systems to solve problems and arrive at decisions, in many applications AI problem-solving is enriched by using multimodal inputs. In this presentation, we will explore the process of building multimodal applications at scale, focusing on the core aspects of quality dataset creation, multimodal data fusion techniques and model pipelines for enterprise applications. We will also examine the challenges that arise in bringing these applications to production and techniques for addressing these challenges.