Joint Regularization of Activations and Weights for Efficient Neural Network Pruning

Date: Thursday, September 24, 2020

Start Time: 1:00 pm

End Time: 1:30 pm

With the rapid increase in the sizes of deep neural networks (DNNs), there has been extensive research on network model compression to improve deployment efficiency. In this presentation, we present our work to advance compression beyond the weights to neuron activations. We propose a joint regularization technique that simultaneously regulates the distribution of weights and activations. By distinguishing and leveraging the significant difference among neuron responses and connections during learning, the jointly pruned networks (JPnet) optimize the sparsity of activations and weights. The derived deep sparsification reveals more optimization space for existing DNN accelerators utilizing sparse matrix operations. We evaluate the effectiveness of joint regularization through various network models with different activation functions and on different datasets. With a 0.4% degradation constraint on inference accuracy, a JPnet can save 72% to 99% of computation cost, with up to 5.2x and 12.3x reductions in activation and weight numbers, respectively.

Track

Session Speakers

Zuoguan Wang
Senior Algorithm Manager, Black Sesame Technologies

Zuoguan Wang is a senior algorithm manager at Black Sesame Technologies. He obtained a Ph.D. in electrical engineering and a Master’s in applied mathematics at Rensselaer Polytechnic Institute, where he worked on computer vision, deep learning and mathematical optimization. He has had research work published in the top conferences in this field, such as NIPS, CVPR, IJCAI and AAAI, and has served as a reviewer for these conferences for years. Before joining Black Sesame Technologies, he worked as a researcher in 3M, Ambarella and Nvidia.

Joint Regularization of Activations and Weights for Efficient Neural Network Pruning

Track

Session Speakers

Zuoguan Wang

Share