5+ Techniques for Efficient Implementation of Neural Networks

Where: Room 203/204

Date: Day 3

Start Time: 11:20 am

End Time: 11:50 am

Embedding real-time, large-scale deep learning vision applications at the edge is challenging due to their huge computational, memory and bandwidth requirements. System architects can mitigate these demands by modifying deep neural networks (DNNs) to make them more energy-efficient and less demanding of embedded processing hardware. In this talk we’ll provide an introduction to today’s established techniques for efficient implementation of DNNs: advanced quantization, network decomposition, weight pruning and sharing and sparsity-based compression. We’ll also preview up-and-coming techniques such as trained quantization and correlation-based compression.

Track

Fundamentals

Session Speakers

Bert Moons
Hardware Design Architect, Synopsys

Dr. Bert Moons received the PhD degree in Electrical Engineering cum ultima laude from KU Leuven in 2018. He performed his PhD research at ESAT-MICAS, focusing on energy-scalable and run-time adaptable digital architectures and circuits for embedded Deep Learning applications. Bert authored 15+ conference and journal publications, was a Visiting Research Student at Stanford University and received the SSCS pre-doctoral achievement award in 2018. Currently he is with Synopsys, as a hardware design architect for the DesignWare EV6x Embedded Vision and AI processors.

5+ Techniques for Efficient Implementation of Neural Networks

Track

Session Speakers

Bert Moons

Share