Date: Wednesday, May 26

Start Time: 11:00 am

End Time: 11:30 am

When converting floating-point networks to low-precision equivalents for high-performance inference, the primary objective is to maximally compress the network whilst maintaining fidelity to the original, floating-point network. This is made particularly challenging when only a reduced or unlabelled dataset is available.

Data may be limited for reasons of a commercial or legal nature: for example, companies may be unwilling to share valuable data and labels that represent a substantial investment of resources; or the collector of the original dataset may not be permitted to share it for data privacy reasons. We present a method based on distillation that allows high-fidelity, low-precision networks to be produced for a wide range of different network types, using the original trained network in place of a labeled dataset. Our proposed approach is directly applicable across multiple domains (e.g. classification, segmentation and style transfer) and can be adapted to numerous network compression techniques.

Track

Session Speakers

James Imber
Senior Research Engineer, Imagination Technologies

James is a member of Imagination Technologies’ AI Research team, where he works primarily on neural network accelerators, compilers and low-precision inference targeting embedded systems. He has 9 years’ experience as a researcher in the semiconductor IP industry, during which time he has accumulated 24 granted patents and has contributed to publications in international computer vision conferences including ECCV and ICPR. Last year he received Electronics Weekly’s BrightSparks award for young engineers in recognition of his research and his work promoting STEM subjects to secondary school pupils in the UK. His research interests include image processing, ray tracing, machine learning and computer vision. He undertook his PhD studies at the University of Surrey’s Centre for Vision, Speech and Signal Processing (CVSSP) on shape-assisted intrinsic image decomposition and holds a BEng from the University of Southampton in Electronic Engineering.

High-Fidelity Conversion of Floating-Point Networks for Low-Precision Inference using Distillation with Limited Data

Track

Session Speakers

James Imber

Share