Self-Compression for Edge Inference

Date: Monday, May 11

Start Time: 1:30 pm

End Time: 2:00 pm

Self-compression is a quantization-aware training technique to reduce neural network size and optimize performance for edge inference. By learning optimal bit depths for weights and activations during training, self-compression achieves significant reductions in memory footprint and bandwidth consumption while maintaining accuracy. The method employs high sparsity alongside low-bit representations, enabling efficient deployment on CPUs, GPUs, DSPs and NPUs without specialized hardware. Unlike traditional compression approaches, self-compression removes redundant weights and minimizes bits required for remaining parameters. Experiments demonstrate floating-point accuracy across applications including perception CNNs (as few as 3% of the original bits and 18% of weights retained) and LLMs (outperforming ternary compression in transformer-based language models). In this presentation, we explain how self-compression works, its practical implementation and real-world benefits for embedded systems, offering a simple yet powerful solution to reduce inference costs (execution time, power consumption, bandwidth and memory usage).

Track

Session Speakers

James Imber
Director of Research, Imagination Technologies

Dr. James Imber is Director of Research at Imagination Technologies, with 14 years of experience in the semiconductor IP industry. His team’s work spans network compression, embedded AI inference and algorithm design for resource-constrained systems. James has expertise in neural graphics, quantization-aware training, edge perception, numerical optimization and classical computer vision. Prior to his current role, he worked extensively with NPUs, ISPs and GPUs, driving innovation in embedded AI and advanced imaging technologies. James holds a PhD from the University of Surrey’s Centre for Vision, Speech and Signal Processing.

Track

Session Speakers

James Imber

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share