An Overview of Deep Neural Network Model Compression

Conference: CAIBDA 2022 - 2nd International Conference on Artificial Intelligence, Big Data and Algorithms
06/17/2022 - 06/19/2022 at Nanjing, China

Proceedings: CAIBDA 2022

Pages: 6Language: englishTyp: PDF

Authors:
Zhang, Xue; Ding, Rui; Mi, Lingchao (Capital Normal University, Beijing, China)

Abstract:
In recent years, with the rapid development of machine learning, deep neural networks have achieved great success in the fields of computer vision and natural language processing. However, accompany with the remarkable performance, came the huge amount of parameters, the high cost of storage and computing. It is a great challenge for the embedded devices, such as autonomous vehicles, robotics and so on. Therefore, model compression and model pruning has become a research hotspot. This paper summarizes the achievements and progress in model compression from the aspects of model pruning, quantization, and lightweight network design. The future research directions in the field of model compression and acceleration are also prospected.