Optimizing Neural Networks with Quantization-Aware Training and Post-Training Quantization

In this session we’ll explain two neural network quantization techniques, quantization-aware training (QAT) and post-training quantization (PTQ), and explain when to use each. We’ll discuss what needs to be done for efficient implementation of each: for example, QAT requires preparation of models through layer fusion and graph optimization, while PTQ requires a suitable dataset. We will highlight the advantages and limitations of each approach and explore model architectures that benefit from QAT and PTQ. We will also present strategies for combining these techniques and introduce tools such as Brevitas that enable quantization, demonstrating how to optimize neural networks for improved performance and efficiency.

Track

Fundamentals

Session Speakers

Robert Cimpeanu
Machine Learning Software Engineer, NXP Semiconductors

Robert Cimpeanu is a Machine Learning Engineer, passionate about cutting-edge technologies, focused on model optimization and fine-tuning processes for deployment on embedded devices. His work is primarily concentrated on exploring innovative transformer-based networks suited for edge devices and implementing the most effective quantization methods in order to achieve the best possible solution. Robert has a master’s degree in Computer Vision and Intelligent Systems from the Politehnica University of Bucharest.

Optimizing Neural Networks with Quantization-Aware Training and Post-Training Quantization

Track

Session Speakers

Robert Cimpeanu

See you May 20-22, 2025, at the Santa Clara Convention Center!

Sponsors & Exhibitors

Get in Touch

Share