Beyond “What’s in the Image?”: World Models for Understanding and Predicting

Traditional computer vision excels at answering “what is in the image,” but many real systems need “what is happening” and “what will happen next.” World models address this by maintaining an internal, temporally consistent representation of the environment and using it to predict future states. This talk introduces world models at a conceptual level—state, memory, dynamics and rollout—then grounds the discussion in practical computer vision use cases, including autonomous driving, robotic manipulation, activity understanding and simulation-based planning. We’ll cover common architecture patterns (perception backbones, latent state representations, temporal modeling with RNNs, transformers and state-space models) and why world models are typically systems rather than single monolithic networks. We’ll also explain how multi-sensor fusion (camera, LiDAR, radar, IMU) enables more robust world representations and why current deployment realities force hybrid edge/cloud designs. Attendees will leave with a clear understanding of world models and emerging directions for creating and using them.

Track

Technical Insights

Session Speakers

Gowdhaman Sadhasivam
Chief Technology Officer, Labelbees AI

Gowdhaman Sadhasivam is an accomplished AI and computer vision practitioner with over a decade of experience designing and deploying production-grade machine learning systems across the defense, insurance and earth and space intelligence sectors. He currently serves as the Co-Founder and CTO of Labelbees AI. Previously, he led AI model engineering at Orbital Insight, playing a key role in its acquisition by Steve Wozniak’s Privateer, and spearheaded award-winning AI modernization initiatives at EMC Insurance. Gowdhaman is a sought-after speaker at international conferences, where he shares practical, field-tested insights on developing reliable vision systems for complex, real-world environments.

Beyond “What’s in the Image?”: World Models for Understanding and Predicting

Track

Session Speakers

Gowdhaman Sadhasivam

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share