In this talk, we’ll explore the evolution and implementation of multimodal generative AI at the edge targeting physical AI applications, including autonomous vehicles and humanoid robots. We’ll outline the challenges and trade-offs for implementing multimodal generative AI on resource-constrained systems and discuss the performance and bandwidth requirements NPUs must meet to support them. Finally, we’ll demonstrate how these advanced multimodal and action-capable models are efficiently accelerated on the Synopsys ARC enhanced NPX6 NPU IP, enabling the next wave of intelligent, adaptive edge devices.

