Implementing Transformer Neural Networks for Visual Perception on Embedded Devices

Date: Wednesday, May 22

Start Time: 2:05 pm

End Time: 2:35 pm

Transformers are a class of neural network models originally designed for natural language processing. Transformers are also powerful for visual perception due to their ability to model long-range dependencies and process multimodal data. Resource constraints form a central challenge when deploying transformers on embedded platforms. Transformers demand substantial memory for parameters and intermediate computations. Further, the computations involved in self-attention create challenging computation requirements. Energy efficiency adds another layer of complexity. Mitigating these challenges requires a multifaceted approach. Optimization techniques like quantization ameliorate memory constraints. Pruning and sparsity techniques, removing less critical connections, alleviate computation demands. Knowledge distillation transfers knowledge from larger models to compact models. Shang-Hung will also discuss hardware accelerators such as NPUs customized for transformer workloads, and software techniques for efficiently mapping transformer models to hardware accelerators.

Track

Session Speakers

Shang-Hung Lin
Vice President of Neural Processing Products, VeriSilicon

Shang-Hung Lin is the Vice President of Neural Processing Products (NPUs) at VeriSilicon. He has more than 25 years of product innovation and development experience in artificial intelligence, neural networks, computer vision, image signal processing and sensor fusion. He has been granted 50+ US patents in these technical areas. Shang-Hung received his BS degree from National Taiwan University and his PhD in Electrical Engineering from Princeton University.

Implementing Transformer Neural Networks for Visual Perception on Embedded Devices

Track

Session Speakers

Shang-Hung Lin

Sponsors & Exhibitors

Get in Touch

Share