Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoCs

Increasingly, machine learning models are being deployed at the edge, and these models are getting bigger. As a result, we are hitting the constraints of edge devices – bandwidth, performance and power. One way to reduce ML computation demands and increase power efficiency is quantization – a set of techniques that reduce the number of bits needed, and hence reduce bandwidth, computation and storage requirements. Qualcomm® Snapdragon™ SoCs provide a robust hardware solution for deploying ML applications in embedded and mobile devices. Many Snapdragon SoCs incorporate the Qualcomm Artificial Intelligence Engine, comprised of hardware and software components to accelerate on-device ML. In this talk, we will explore the performance and accuracy offered by the accelerator cores within the AI Engine. We will also highlight the tools and techniques Qualcomm offers for developers targeting these cores, utilizing intelligent quantization to deliver optimal performance with low power consumption while maintaining algorithm accuracy.

Track

Enabling Technologies II

Session Speakers

Felix Baum
Director of Product Management, AI Software, Qualcomm

Felix Baum is responsible for AI Software Products at Qualcomm Technologies Inc. (QTI). Felix has spent 20 years in the embedded industry, both as an embedded developer and as a product manager. He previously led QTI product management for the Hexagon Software supporting DSPs with scalar, vector and tensor accelerators for camera, video, machine learning and audio verticals. Prior to that, he led marketing and product management efforts for various real-time operating system technologies. His career began at NASA's Jet Propulsion Laboratory at the California Institute of Technology, designing flight software for various spacecraft and managing integration for a launch campaign of the GRACE mission. Felix holds a master’s degree in computer science from the California State University at Northridge and a Master of Business Administration from the University of California at Los Angeles.

Efficient Deployment of Quantized ML Models at the Edge Using Snapdragon SoCs

Track

Session Speakers

Felix Baum

Share