research-article

FVW: Finding Valuable Weight on Deep Neural Network for Model Pruning

Authors:
Zhiyu Zhu

The University of Sydney, Sydney, NSW, Australia

The University of Sydney, Sydney, NSW, Australia

0009-0009-0231-4410
View Profile

,
Huaming Chen

The University of Sydney, Sydney, NSW, Australia

The University of Sydney, Sydney, NSW, Australia

0000-0001-5678-472X
View Profile

,
Zhibo Jin

The University of Sydney, Sydney, NSW, Australia

The University of Sydney, Sydney, NSW, Australia

0009-0003-0218-1941
View Profile

,
Xinyi Wang

Jiangsu University, Zhenjiang, China

Jiangsu University, Zhenjiang, China

0009-0000-5103-011X
View Profile

,
Jiayu Zhang

Suzhou Yierqi, Suzhou, China

Suzhou Yierqi, Suzhou, China

0009-0008-6636-8656
View Profile

,
Minhui Xue

Data61, CSIRO, Sydney, NSW, Australia

Data61, CSIRO, Sydney, NSW, Australia

0000-0001-5411-5039
View Profile

,
Qinghua Lu

Data61, CSIRO, Sydney, NSW, Australia

Data61, CSIRO, Sydney, NSW, Australia

0000-0002-9466-1672
View Profile

,
Jun Shen

SCIT, University of Wollongong, Australia, Wollongong , Australia

SCIT, University of Wollongong, Australia, Wollongong , Australia

0000-0002-9403-7140
View Profile

,
Kim-Kwang Raymond Choo

University of Texas at San Antonio, San Antonio, TX, USA

University of Texas at San Antonio, San Antonio, TX, USA

0000-0001-9208-5336
View Profile

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementOctober 2023Pages 3657–3666https://doi.org/10.1145/3583780.3614889

Published:21 October 2023Publication History

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

Pages 3657–3666

ABSTRACT

The rapid development of deep learning has demonstrated its potential for deployment in many intelligent service systems. However, some issues such as optimisation (e.g., how to reduce the deployment resources costs and further improve the detection speed), especially in scenarios where limited resources are available, remain challenging to address. In this paper, we aim to delve into the principles of deep neural networks, focusing on the importance of network neurons. The goal is to identify the neurons that exert minimal impact on model performances, thereby aiding in the process of model pruning. In this work, we have thoroughly considered the deep learning model pruning process with and without fine-tuning step, ensuring the model performance consistency. To achieve our objectives, we propose a methodology that employs adversarial attack methods to explore deep neural network parameters. This approach is combined with an innovative attribution algorithm to analyse the level of network neurons involvement. In our experiments, our approach can effectively quantify the importance of network neuron. We extend the evaluation through comprehensive experiments conducted on a range of datasets, including CIFAR-10, CIFAR-100 and Caltech101. The results demonstrate that, our method have consistently achieved the state-of-the-art performance over many existing methods. We anticipate that this work will help to reduce the heavy training and inference cost of deep neural network models where a lightweight deep learning enhanced service and system is possible. The source code is open source at https://github.com/LMBTough/FVW.

References

A. Voulodimos, N. Doulamis, A. Doulamis, E. Protopapadakis et al., "Deep learning for computer vision: A brief review," Computational intelligence and neuroscience, vol. 2018, 2018.Google Scholar
A. Osipov, E. Pleshakova, S. Gataullin, S. Korchagin, M. Ivanov, A. Finogeev, and V. Yadav, "Deep learning method for recognition and classification of images from video recorders in difficult weather conditions," Sustainability, vol. 14, no. 4, p. 2420, 2022.Google ScholarCross Ref
S. Mehtab and J. Sen, "Analysis and forecasting of financial time series using cnn and lstm-based deep learning models," in Advances in Distributed Computing and Machine Learning: Proceedings of ICADCML 2021. Springer, 2022, pp. 405--423.Google Scholar
M. E. Alzahrani, T. H. Aldhyani, S. N. Alsubari, M. M. Althobaiti, and A. Fahad, "Developing an intelligent system with deep learning algorithms for sentiment analysis of e-commerce product reviews," Computational Intelligence and Neuro-science, vol. 2022, 2022.Google Scholar
S. Han, J. Pool, J. Tran, and W. Dally, "Learning both weights and connections for efficient neural network," in Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett, Eds., vol. 28. Curran Associates, Inc., 2015.Google Scholar
M. Uzair and N. Jamil, "Effects of hidden layers on the efficiency of neural networks," in 2020 IEEE 23rd international multitopic conference (INMIC). IEEE, 2020, pp. 1--6.Google Scholar
Y. Y. Huang and W. Y. Wang, "Deep residual learning for weakly-supervised relation extraction," arXiv preprint arXiv:1707.08866, 2017.Google Scholar
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, "Attention is all you need," Advances in neural information processing systems, vol. 30, 2017.Google Scholar
M. H. Zhu and S. Gupta, "To prune, or not to prune: Exploring the efficacy of pruning for model compression," 2018. [Online]. Available: https://openreview.net/forum"id=S1lN69AT-Google Scholar
P. Molchanov, A. Mallya, S. Tyree, I. Frosio, and J. Kautz, "Importance estimation for neural network pruning," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 11 264--11 272.Google Scholar
H. Hu, R. Peng, Y.-W. Tai, and C.-K. Tang, "Network trimming: A data-driven neuron pruning approach towards efficient deep architectures," arXiv preprint arXiv:1607.03250, 2016.Google Scholar
T.-J. Yang, Y.-H. Chen, and V. Sze, "Designing energy-efficient convolutional neural networks using energy-aware pruning," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 5687--5695.Google Scholar
Y. LeCun, J. Denker, and S. Solla, "Optimal brain damage," Advances in neural information processing systems, vol. 2, 1989.Google Scholar
M. Mondal, B. Das, S. D. Roy, P. Singh, B. Lall, and S. D. Joshi, "Adaptive cnn filter pruning using global importance metric," Computer Vision and Image Understanding, vol. 222, p. 103511, 2022.Google ScholarDigital Library
S. Anwar, K. Hwang, and W. Sung, "Structured pruning of deep convolutional neural networks," ACM Journal on Emerging Technologies in Computing Systems (JETC), vol. 13, no. 3, pp. 1--18, 2017.Google ScholarDigital Library
C. Zhao, B. Ni, J. Zhang, Q. Zhao, W. Zhang, and Q. Tian, "Variational convolutional neural network pruning," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 2780--2789.Google Scholar
Z. Wang, C. Li, and X. Wang, "Convolutional neural network pruning with structural redundancy reduction," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 14 913--14 922.Google Scholar
M. Sundararajan, A. Taly, and Q. Yan, "Axiomatic attribution for deep networks," in International conference on machine learning. PMLR, 2017, pp. 3319--3328.Google Scholar
A. Patra and J. A. Noble, "Incremental learning of fetal heart anatomies using interpretable saliency maps," in Medical Image Understanding and Analysis: 23rd Conference, MIUA 2019, Liverpool, UK, July 24--26, 2019, Proceedings 23. Springer, 2020, pp. 129--141.Google Scholar
Z. Wang, M. Fredrikson, and A. Datta, "Robust models are more interpretable because attributions look normal," arXiv preprint arXiv:2103.11257, 2021.Google Scholar
I. J. Goodfellow, J. Shlens, and C. Szegedy, "Explaining and harnessing adversarial examples," arXiv preprint arXiv:1412.6572, 2014.Google Scholar
A. Kurakin, I. J. Goodfellow, and S. Bengio, "Adversarial examples in the physical world," in Artificial intelligence safety and security. Chapman and Hall/CRC, 2018, pp. 99--112.Google Scholar
Y. Dong, F. Liao, T. Pang, H. Su, J. Zhu, X. Hu, and J. Li, "Boosting adversarial attacks with momentum," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, pp. 9185--9193.Google Scholar
J. Lin, C. Song, K. He, L. Wang, and J. E. Hopcroft, "Nesterov accelerated gradient and scale invariance for adversarial attacks," arXiv preprint arXiv:1908.06281, 2019.Google Scholar
A. Krizhevsky, V. Nair, and G. Hinton, "The cifar-10 dataset," online: http://www.cs. toronto.edu/kriz/cifar. html, vol. 55, no. 5, 2014.Google Scholar
--, "Cifar-10 and cifar-100 datasets," URl: https://www. cs. toronto. edu/kriz/cifar.html, vol. 6, no. 1, p. 1, 2009.Google Scholar
F.-F. Li, M. Andreeto, M. Ranzato, and P. Perona, "Caltech 101," 4 2022.Google Scholar
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770--778.Google Scholar
H. Wang, C. Qin, Y. Zhang, and Y. Fu, "Neural pruning via growing regularization," arXiv preprint arXiv:2012.09243, 2020.Google Scholar
G. Retsinas, A. Elafrou, G. Goumas, and P. Maragos, "Weight pruning via adaptive sparsity loss," arXiv preprint arXiv:2006.02768, 2020.Google Scholar

Index Terms

FVW: Finding Valuable Weight on Deep Neural Network for Model Pruning
1. Security and privacy
  1. Software and application security
    1. Web application security
2. Software and its engineering
  1. Software organization and properties
    1. Extra-functional properties
      1. Software safety

Recommendations

Facial Expression Intensity Estimation using Deep Convolutional Neural Network
ACIT '21: Proceedings of the the 8th International Virtual Conference on Applied Computing & Information Technology

Facial expression recognition has been applied in various fields, but it is not enough to express complex emotions. It is required to detect the intensity of facial expression because the intensity of facial expression varies depending on the intensity ...
Read More
Detect and Remove Watermark in Deep Neural Networks via Generative Adversarial Networks
Information Security
Abstract
Deep neural networks (DNN) have achieved remarkable performance in various fields. However, training a DNN model from scratch requires expensive computing resources and a lot of training data, which are difficult to obtain for most individual ...
Read More
A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples
Abstract
Deep neural networks (DNNs) are vulnerable to adversarial attacks that generate adversarial examples by adding small perturbations to the clean images. To combat adversarial attacks, the two main defense methods used are denoising and adversarial ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
October 2023
5508 pages
ISBN:9798400701245
DOI:10.1145/3583780
General Chairs:
Ingo Frommholz
University of Wolverhampton, UK
,
Frank Hopfgartner
University of Koblenz, Germany
,
Mark Lee
University of Birmingham, UK
,
Michael Oakes
University of Birmingham, UK
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Min Zhang
Tsinghua University, China
,
Rodrygo Santos
Federal University of Minas Gerais, Brazil
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
adversarial attack
assessing neuron importance
attribution algorithm
deep neural network
fine-tuning
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,861of8,427submissions,22%
Upcoming Conference
CIKM '24

Sponsor:

sigir

sigir

The 33rd ACM International Conference on Information and Knowledge Management

October 21 - 25, 2024

Boise , ID , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 88
  Total Downloads
- Downloads (Last 12 months)88
- Downloads (Last 6 weeks)11
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

FVW: Finding Valuable Weight on Deep Neural Network for Model Pruning

CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management

ABSTRACT

References

Cited By

Index Terms

Recommendations

Facial Expression Intensity Estimation using Deep Convolutional Neural Network

Detect and Remove Watermark in Deep Neural Networks via Generative Adversarial Networks

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples