PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio

Haocheng Zhao; Limin Yu; Runwei Guan; Liye Jia; Junqing Zhang; Yutao Yue

doi:10.1109/ICMLA58977.2023.00009

PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio

Haocheng Zhao, Limin Yu^*, Runwei Guan, Liye Jia, Junqing Zhang, Yutao Yue

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

1 Citation (Scopus)

Abstract

In the current era of multi-modal and large models gradually revealing their potential, neural network pruning has emerged as a crucial means of model compression. It is widely recognized that models tend to be over-parameterized, and pruning enables the removal of unimportant weights, leading to improved inference speed while preserving accuracy. From early methods such as gradient-based, and magnitude-based pruning to modern algorithms like iterative magnitude pruning, lottery ticket hypothesis, and pruning at initialization, researchers have strived to increase the compression ratio of model parameters while maintaining high accuracy. Currently, mainstream algorithms focus on the global pruning of neural networks using various scoring functions, followed by different pruning strategies to enhance the accuracy of sparse model. Recent studies have shown that random pruning with varying layer-wise sparsity ratio has achieved robust results for large models and out-of-distribution data. Based on this discovery, we propose a new score called FeatIO, which is based on module input and output feature map sizes. As a score function used in PaI, FeatIO surpasses the performance of other PaI score functions. Additionally, we propose a novel pruning strategy called Pruning with Layer-wise Sparsity Ratio (PLSR), which conbines the layer-wise sparsity ratios and magnitude-based score function, resulting in optimal evaluation performance. Almost all algorithms exhibit improved performance when using our novel pruning strategy. The combination of PLSR and FeatIO consistently outperforms other algorithms in testing, demonstrating the significant potential of our proposed approach. Our code will be available here.

Original language	English
Title of host publication	Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
Editors	M. Arif Wani, Mihai Boicu, Moamar Sayed-Mouchaweh, Pedro Henriques Abreu, Joao Gama
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1-8
Number of pages	8
ISBN (Electronic)	9798350345346
DOIs	https://doi.org/10.1109/ICMLA58977.2023.00009
Publication status	Published - Dec 2023
Event	22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023 - Jacksonville, United States Duration: 15 Dec 2023 → 17 Dec 2023

Publication series

Name	International Conference on Machine Learning and Applications (ICMLA)

Conference

Conference	22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
Country/Territory	United States
City	Jacksonville
Period	15/12/23 → 17/12/23

Keywords

Layer-wise Sparsity
Model Com-pression
Pruning
Unstructured Pruning

Access to Document

10.1109/ICMLA58977.2023.00009

Cite this

Zhao, H., Yu, L., Guan, R., Jia, L., Zhang, J., & Yue, Y. (2023). PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio. In M. Arif Wani, M. Boicu, M. Sayed-Mouchaweh, P. H. Abreu, & J. Gama (Eds.), Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023 (pp. 1-8). (International Conference on Machine Learning and Applications (ICMLA) ). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMLA58977.2023.00009

Zhao, Haocheng ; Yu, Limin ; Guan, Runwei et al. / PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio. Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023. editor / M. Arif Wani ; Mihai Boicu ; Moamar Sayed-Mouchaweh ; Pedro Henriques Abreu ; Joao Gama. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 1-8 (International Conference on Machine Learning and Applications (ICMLA) ).

@inproceedings{df3d11439f64490b9e788990bd82cd07,

title = "PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio",

abstract = "In the current era of multi-modal and large models gradually revealing their potential, neural network pruning has emerged as a crucial means of model compression. It is widely recognized that models tend to be over-parameterized, and pruning enables the removal of unimportant weights, leading to improved inference speed while preserving accuracy. From early methods such as gradient-based, and magnitude-based pruning to modern algorithms like iterative magnitude pruning, lottery ticket hypothesis, and pruning at initialization, researchers have strived to increase the compression ratio of model parameters while maintaining high accuracy. Currently, mainstream algorithms focus on the global pruning of neural networks using various scoring functions, followed by different pruning strategies to enhance the accuracy of sparse model. Recent studies have shown that random pruning with varying layer-wise sparsity ratio has achieved robust results for large models and out-of-distribution data. Based on this discovery, we propose a new score called FeatIO, which is based on module input and output feature map sizes. As a score function used in PaI, FeatIO surpasses the performance of other PaI score functions. Additionally, we propose a novel pruning strategy called Pruning with Layer-wise Sparsity Ratio (PLSR), which conbines the layer-wise sparsity ratios and magnitude-based score function, resulting in optimal evaluation performance. Almost all algorithms exhibit improved performance when using our novel pruning strategy. The combination of PLSR and FeatIO consistently outperforms other algorithms in testing, demonstrating the significant potential of our proposed approach. Our code will be available here.",

keywords = "Layer-wise Sparsity, Model Com-pression, Pruning, Unstructured Pruning",

author = "Haocheng Zhao and Limin Yu and Runwei Guan and Liye Jia and Junqing Zhang and Yutao Yue",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023 ; Conference date: 15-12-2023 Through 17-12-2023",

year = "2023",

month = dec,

doi = "10.1109/ICMLA58977.2023.00009",

language = "English",

series = "International Conference on Machine Learning and Applications (ICMLA) ",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1--8",

editor = "{Arif Wani}, M. and Mihai Boicu and Moamar Sayed-Mouchaweh and Abreu, {Pedro Henriques} and Joao Gama",

booktitle = "Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023",

}

Zhao, H, Yu, L , Guan, R , Jia, L, Zhang, J & Yue, Y 2023, PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio. in M Arif Wani, M Boicu, M Sayed-Mouchaweh, PH Abreu & J Gama (eds), Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023. International Conference on Machine Learning and Applications (ICMLA) , Institute of Electrical and Electronics Engineers Inc., pp. 1-8, 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023, Jacksonville, United States, 15/12/23. https://doi.org/10.1109/ICMLA58977.2023.00009

PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio. / Zhao, Haocheng; Yu, Limin ; Guan, Runwei et al.
Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023. ed. / M. Arif Wani; Mihai Boicu; Moamar Sayed-Mouchaweh; Pedro Henriques Abreu; Joao Gama. Institute of Electrical and Electronics Engineers Inc., 2023. p. 1-8 (International Conference on Machine Learning and Applications (ICMLA) ).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio

AU - Zhao, Haocheng

AU - Yu, Limin

AU - Guan, Runwei

AU - Jia, Liye

AU - Zhang, Junqing

AU - Yue, Yutao

PY - 2023/12

Y1 - 2023/12

N2 - In the current era of multi-modal and large models gradually revealing their potential, neural network pruning has emerged as a crucial means of model compression. It is widely recognized that models tend to be over-parameterized, and pruning enables the removal of unimportant weights, leading to improved inference speed while preserving accuracy. From early methods such as gradient-based, and magnitude-based pruning to modern algorithms like iterative magnitude pruning, lottery ticket hypothesis, and pruning at initialization, researchers have strived to increase the compression ratio of model parameters while maintaining high accuracy. Currently, mainstream algorithms focus on the global pruning of neural networks using various scoring functions, followed by different pruning strategies to enhance the accuracy of sparse model. Recent studies have shown that random pruning with varying layer-wise sparsity ratio has achieved robust results for large models and out-of-distribution data. Based on this discovery, we propose a new score called FeatIO, which is based on module input and output feature map sizes. As a score function used in PaI, FeatIO surpasses the performance of other PaI score functions. Additionally, we propose a novel pruning strategy called Pruning with Layer-wise Sparsity Ratio (PLSR), which conbines the layer-wise sparsity ratios and magnitude-based score function, resulting in optimal evaluation performance. Almost all algorithms exhibit improved performance when using our novel pruning strategy. The combination of PLSR and FeatIO consistently outperforms other algorithms in testing, demonstrating the significant potential of our proposed approach. Our code will be available here.

AB - In the current era of multi-modal and large models gradually revealing their potential, neural network pruning has emerged as a crucial means of model compression. It is widely recognized that models tend to be over-parameterized, and pruning enables the removal of unimportant weights, leading to improved inference speed while preserving accuracy. From early methods such as gradient-based, and magnitude-based pruning to modern algorithms like iterative magnitude pruning, lottery ticket hypothesis, and pruning at initialization, researchers have strived to increase the compression ratio of model parameters while maintaining high accuracy. Currently, mainstream algorithms focus on the global pruning of neural networks using various scoring functions, followed by different pruning strategies to enhance the accuracy of sparse model. Recent studies have shown that random pruning with varying layer-wise sparsity ratio has achieved robust results for large models and out-of-distribution data. Based on this discovery, we propose a new score called FeatIO, which is based on module input and output feature map sizes. As a score function used in PaI, FeatIO surpasses the performance of other PaI score functions. Additionally, we propose a novel pruning strategy called Pruning with Layer-wise Sparsity Ratio (PLSR), which conbines the layer-wise sparsity ratios and magnitude-based score function, resulting in optimal evaluation performance. Almost all algorithms exhibit improved performance when using our novel pruning strategy. The combination of PLSR and FeatIO consistently outperforms other algorithms in testing, demonstrating the significant potential of our proposed approach. Our code will be available here.

KW - Layer-wise Sparsity

KW - Model Com-pression

KW - Pruning

KW - Unstructured Pruning

UR - http://www.scopus.com/inward/record.url?scp=85190113835&partnerID=8YFLogxK

U2 - 10.1109/ICMLA58977.2023.00009

DO - 10.1109/ICMLA58977.2023.00009

M3 - Conference Proceeding

AN - SCOPUS:85190113835

T3 - International Conference on Machine Learning and Applications (ICMLA)

SP - 1

EP - 8

BT - Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023

A2 - Arif Wani, M.

A2 - Boicu, Mihai

A2 - Sayed-Mouchaweh, Moamar

A2 - Abreu, Pedro Henriques

A2 - Gama, Joao

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023

Y2 - 15 December 2023 through 17 December 2023

ER -

Zhao H, Yu L , Guan R , Jia L, Zhang J, Yue Y. PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio. In Arif Wani M, Boicu M, Sayed-Mouchaweh M, Abreu PH, Gama J, editors, Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 1-8. (International Conference on Machine Learning and Applications (ICMLA) ). doi: 10.1109/ICMLA58977.2023.00009

PLSR: Unstructured Pruning with Layer-wise Sparsity Ratio

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this