From CNNs to LLMs at the Edge: Why Memory, Not TOPS, Is the Bottleneck

Date: Monday, May 11

Start Time: 11:25 am

End Time: 11:55 am

We’re all trying to move LLM-class intelligence onto edge cameras and sensors so video can be understood—and acted on—in real time. In this talk, we’ll explain why this is hard by contrasting the compute and memory behavior of CNNs versus LLMs/VLMs. CNNs exploit locality, weight sharing and predictable memory reuse, so performance scales roughly linearly with image size and maps cleanly onto today’s NPUs. Transformers shift the bottleneck to quadratic attention (O[N²] compute and memory), high-bandwidth KV traffic and large, dense matmuls—turning memory pipelines, not TOPS, into the limiting factor for latency, power and cost. We’ll then survey practical paths forward: quantized small models, hybrid CNN-transformer pipelines, hardware-aware attention (e.g., FlashAttention and sparsity) and constrained task-specific models that reduce sequence length or operate in compressed domains. We’ll close with survey results on real-world use cases, chipset readiness and deployment pain points.

Track

Session Speakers

David Selinger
CEO, Deep Sentinel

David Selinger is a technologist at heart and has founded a number of successful start-ups, including Redfin and RichRelevance (and some unsuccessful ones too!). He has served a number of nonprofits, including My Two Front Teeth (now the Family Giving Tree), the Darfur Stoves Project (now Potential Energy) and Silicon Climate. His favorite technology right now is deep learning, specifically as applied to image and video object detection/recognition. David believes that this technology is undergoing a disruptive improvement and will change numerous industries. That’s why he started Deep Sentinel—which puts deep learning to practical use protecting families and businesses.

From CNNs to LLMs at the Edge: Why Memory, Not TOPS, Is the Bottleneck

Track

Session Speakers

David Selinger

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share