Deploying VLMs at the Edge: Model Selection, Memory Reality and System Design

Vision-language models (VLMs) enable “world understanding” tasks—natural language queries, zero-shot reasoning and richer scene context—that go beyond traditional CNN pipelines. But bringing VLMs to the edge is primarily a systems problem: selecting a model you can govern (license and data provenance), adapting it efficiently (prompting, quantization, LoRA-style fine-tuning, preference optimization) and fitting it into real device constraints where bandwidth and memory—not TOPS—determine feasibility. This talk frames VLM deployment as an accuracy-latency-cost trade-off, then provides a practical decision framework for matching VLM architectures (vision encoder + projector + language model) to edge hardware. Attendees will leave with a selection matrix for NPUs vs. GPUs vs. hybrid pipelines, plus concrete rules of thumb for memory budgeting and throughput bottlenecks in real deployments.

Track

Technical Insights

Session Speakers

Patrick Farry
Founder and Architect, Intelligence at the Edge

Patrick Farry is a Distinguished Engineer and Principal Architect with over 20 years of experience in distributed systems for regulated industries. As the Founder of Intelligence at the Edge, he architects decentralized, privacy-first AI systems, including VLMChat, a custom inference engine for local AI workloads. Patrick previously held leadership roles at Lytx, Visa and Endicia (Stamps.com), where he modernized legacy monoliths and defined “secure by design” architectures. He connects high-level vision with concrete implementation, designing domain specific languages (DSLs) for autonomous control flow. He holds multiple US patents for secure transaction and logistics systems, including specific claims regarding hardware-software security and cryptographic validation.

Deploying VLMs at the Edge: Model Selection, Memory Reality and System Design

Track

Session Speakers

Patrick Farry

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share