In this presentation, we will describe methods and tools for developing, profiling and optimizing neural network solutions for deployment on Arm MCUs, CPUs and GPUs using Au-Zone’s DeepView ML Toolkit. We’ll introduce the need for optimization to enable efficient deployment of deep learning models, and will highlight the specific challenges of profiling and optimizing models for deployment in cost- and energy-constrained systems. We’ll show how Au-Zone’s DeepView tools can be used in conjunction with Arm’s Streamline tools to gain detailed insights into the performance of neural networks on ARM-based SoCs. Using a facial recognition solution as an example, we will explore how to evaluate, profile and optimize deep learning models on a Cortex-M7 MCU, a Cortex-A73/A53 big.LITTLE CPU and a MALI G-71 GPU.