Enabling Technologies Archives - Page 2 of 3 - 2025 Embedded Vision Summit

Image Tokenization for Distributed Neural Cascades

February 26, 2025 • mnakamura

Multimodal LLMs promise to bring exciting new abilities to devices! As we see foundational models become more capable, we see compute requirements grow as well. It is not uncommon to see LLMs grow to tens of billions of parameters, at […]

Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-Effective Solutions

February 26, 2025 • mnakamura

Many computer vision projects reach proof of concept but stall before production due to high costs, deployment challenges and infrastructure complexity. This session explores the path from prototype to production, focusing on how to reduce the cost of vision workloads […]

Bridging the Gap: Streamlining the Process of Deploying AI onto Processors

February 26, 2025 • mnakamura

Large language models (LLMs) often demand hand-coded conversion scripts for deployment on each distinct processor-specific software stack—a process that’s time-consuming and prone to error. In this session, we introduce a model-agnostic approach designed to streamline LLM deployment, especially for the […]

State-Space Models vs. Transformers for Ultra-Low-Power Edge AI

February 26, 2025 • mnakamura

At the embedded edge, choices of language model architectures have profound implications on the ability to meet demanding performance, latency and energy efficiency requirements. In this presentation, we contrast state-space models (SSMs) with transformers for use in this constrained regime. […]

ONNX and Python to C++: State-of-the-Art Graph Compilation

February 26, 2025 • mnakamura

Quadric’s Chimera general-purpose neural processor executes complete AI/ML graphs—all layers, including pre- and post-processing functions traditionally run on separate DSP processors. To enable this, the Chimera Graph Compiler processes and optimizes a combination of NN graphs, Python code and C++ […]

Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models at Runtime

February 26, 2025 • mnakamura

As humans, when we look at a scene our first impressions are sometimes wrong; we need to take a second look, to squint and reassess. Squinting enables us to focus our attention on the subject we are investigating and often […]

Voice Interfaces on a Budget: Building Real-Time Speech Recognition on Low-Cost Hardware

February 19, 2025 • mnakamura

In this talk we’ll present Moonshine, a speech-to-text model that outperforms OpenAI’s Whisper by a factor of five in terms of speed. Leveraging this efficiency, we’ll show how to build a voice interface on a low-cost, resource-constrained Cortex-A SoC using […]

Evolving Inference Processor Software Stacks to Support LLMs

February 3, 2025 • mnakamura

As large language models (LLMs) and vision-language models (VLMs) have quickly become important for edge applications from smartphones to automobiles, chipmakers and IP providers have struggled with how to adapt processor software stacks. In this talk, Expedera’s Ram Tadishetti examines […]

NPU IP Hardware Shaped Through Software and Use-Case Analysis

February 3, 2025 • mnakamura

True innovation in tiny machine learning (tinyML) emerges from a synergy between software ingenuity, real-world application insights and leading-edge processor IP. In this presentation, we will explore the process of integrating these elements to shape the design of our latest […]

Why It’s Critical to Have an Integrated Development Methodology for Edge AI

February 3, 2025 • mnakamura

The deployment of neural networks near sensors brings well-known advantages such as lower latency, privacy and reduced overall system cost—but also brings significant challenges that complicate development. These challenges can be addressed effectively by choosing the right solution and design […]

123

Page 2 of 3

Enabling Technologies

Image Tokenization for Distributed Neural Cascades

Beyond the Demo: Turning Computer Vision Prototypes into Scalable, Cost-Effective Solutions

Bridging the Gap: Streamlining the Process of Deploying AI onto Processors

State-Space Models vs. Transformers for Ultra-Low-Power Edge AI

ONNX and Python to C++: State-of-the-Art Graph Compilation

Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models at Runtime

Voice Interfaces on a Budget: Building Real-Time Speech Recognition on Low-Cost Hardware

Evolving Inference Processor Software Stacks to Support LLMs

NPU IP Hardware Shaped Through Software and Use-Case Analysis

Why It’s Critical to Have an Integrated Development Methodology for Edge AI

See you May 20-22, 2025, at the Santa Clara Convention Center!

Sponsors & Exhibitors

Get in Touch