Natural Language Video Search at the Edge: An Implementation Case Study

Natural language video search—“find the man in the green hoodie riding a bike”—can turn a camera network from passive recording into a proactive investigation tool. But brute-force approaches fail: cloud processing of streaming video is cost- and bandwidth-prohibitive at scale, while running a VLM on every frame on device explodes power and latency budgets. This talk presents an implementation case study of a hierarchical pipeline that makes natural language video search practical across multiple edge platforms. We combine a 30-fps detector (for gatekeeping and attribute filtering) with distributed semantic scoring to promote only relevant regions, then run a VLM “reasoner” selectively on cropped regions of interest to produce high-precision descriptions. We’ll cover the overall solution architecture, model choices and trade-offs, implementation pitfalls and resource management techniques (model residency, zero-copy, ring buffers). Attendees will leave with a reference architecture and concrete optimization tactics for deployable, real-time VLM-based edge systems.

Track

Technical Insights

Session Speakers

Patrick Farry
Founder and Architect, Intelligence at the Edge

Patrick Farry is a Distinguished Engineer and Principal Architect with over 20 years of experience in distributed systems for regulated industries. As the founder of Intelligence at the Edge, he architects decentralized, privacy-first AI systems, including VLMChat, a custom inference engine for local AI workloads. Patrick previously held leadership roles at Lytx, Visa and Endicia (Stamps.com), where he modernized legacy monoliths and defined “secure by design” architectures. He connects high-level vision with concrete implementation, designing domain specific languages (DSLs) for autonomous control flow. He holds multiple US patents for secure transaction and logistics systems, including specific claims regarding hardware-software security and cryptographic validation.

Natural Language Video Search at the Edge: An Implementation Case Study

Track

Session Speakers

Patrick Farry

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share