Decomposing AI Pipelines for Robust Performance at the Edge

Successful deployment of AI on edge platforms requires system-level strategies that account for critical considerations such as latency bounds, memory behavior and orchestration of heterogeneous hardware cores. In this talk, we present techniques for running deep learning pipelines on edge computing platforms, emphasizing architectural decisions that determine real-time behavior and scalability in production systems. We compare monolithic models with decomposed, multistage pipelines, analyzing how partitioning computation across CPUs, GPUs, NPUs and DSPs affects synchronization, memory traffic and worst-case latency. We also explore how SoC topology and memory hierarchy constrain deployment options and how these constraints influence model structure, compiler behavior and runtime scheduling. We share concrete deployment patterns for hardware-aware optimization, multicore scheduling and cross-platform execution, prioritizing deterministic behavior over peak throughput.

Track

Technical Insights

Session Speakers

Pullarao Maddu
Embedded AI Architect, Valeo

Pullarao Maddu is an Embedded AI Architect at Valeo Vision Systems, Ireland. He has 20 years of experience optimizing AI and computer vision algorithms for various multimedia and ADAS systems. At Valeo, Pullarao works with top OEMs to deploy ADAS solutions. He received his postgraduate degree in Electrical Engineering from the Indian Institute of Technology Madras in 2006. Before Valeo, Pullarao worked at PathPartner Technology (now KPIT) and Motorola Mobility in Bangalore, India.

Decomposing AI Pipelines for Robust Performance at the Edge

Track

Session Speakers

Pullarao Maddu

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share