A Survey of Model Compression Methods

Date: Wednesday, May 24

Start Time: 12:00 pm

End Time: 12:30 pm

One of the main challenges when deploying computer vision models to the edge is optimizing the model for speed, memory and energy consumption. In this talk, we’ll provide a comprehensive survey of model compression approaches, which are crucial for harnessing the full potential of deep learning models on edge devices. We’ll explore pruning, weight clustering and knowledge distillation, explaining how these techniques work and how to use them effectively. We’ll also examine inference frameworks, including ONNX, TFLite and OpenVINO. We’ll discuss how these frameworks support model compression and explore the impact of hardware considerations on the choice of framework. We’ll conclude with a comparison of the techniques presented, considering implementation complexity and typical efficiency gains.

Track

Session Speakers

Rustem Feyzkhanov
Staff Machine Learning Engineer, Instrumental

Rustem Feyzkhanov is a Machine Learning Engineer at Instrumental, where he develops machine learning models for the manufacturing sector. Before joining Instrumental, he worked at Astro Digital creating machine learning models for satellite imagery. Rustem has a passion for utilizing cloud infrastructure in AI/ML applications and wrote the book Hands-On Serverless Deep Learning with TensorFlow and AWS Lambda (2019). He has spoken at various conferences and meetups, including the O’Reilly Strata Data & AI Conference and AI Dev World, on employing cloud-native AWS (Amazon Web Services) infrastructure for deep learning training and inference pipelines. In addition, Rustem has created several popular open-source repositories on GitHub, focusing on AWS infrastructure and its application in deep learning. His current interests involve using serverless technology for AI/ML applications and addressing the challenges of building production ML workflows with AWS infrastructure.

A Survey of Model Compression Methods

Track

Session Speakers

Rustem Feyzkhanov

See you May 21 - 23, 2024 at the Santa Clara Convention Center!

Sponsors & Exhibitors

Get in Touch

Share