Causal Unstructured Pruning in Linear Networks Using Effective Information

Changyu Zeng; Li Liu; Haocheng Zhao; Yu Zhang; Wei Wang; Ning Cai; Yutao Yue

doi:10.1109/CyberC55534.2022.00056

Causal Unstructured Pruning in Linear Networks Using Effective Information

Changyu Zeng, Li Liu, Haocheng Zhao, Yu Zhang, Wei Wang, Ning Cai, Yutao Yue^*

^*Corresponding author for this work

Department of Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

4 Citations (Scopus)

Abstract

Excessive number of parameters in today's (deep) neural networks demands tremendous computational resources and slows down training speed. The problem also makes it difficult to deploy these neural network models on capability constrained devices such as mobile devices. To address this challenge, we propose an unstructured pruning method that measures the causal structure of neural networks based on effective information (EI). It introduces an intervention to the input and computes the mutual information between the interference and its corresponding output, within a single linear layer measuring the importance of each weight. In the experiments, we found that the sparsity of EI pruning can reach more than 90%. Only 10% of non-zero parameters in the linear layers were needed compared to the benchmark methods without pruning, while ensuring similar level of accuracy and stable training performance in iterative pruning. In addition, as the invariance of the causal structure of the network is exploited, the network after pruning using EI is highly generalizable and interpretable than other methods.

Original language	English
Title of host publication	Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	294-302
Number of pages	9
ISBN (Electronic)	9798350331547
DOIs	https://doi.org/10.1109/CyberC55534.2022.00056
Publication status	Published - 2022
Event	12th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022 - Virtual, Online, China Duration: 15 Dec 2022 → 16 Dec 2022

Publication series

Name	Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022

Conference

Conference	12th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022
Country/Territory	China
City	Virtual, Online
Period	15/12/22 → 16/12/22

Keywords

Causal Inference
Deep Learning
Effective Information
Unstructured Pruning

Access to Document

10.1109/CyberC55534.2022.00056

Cite this

Zeng, C., Liu, L., Zhao, H., Zhang, Y., Wang, W., Cai, N., & Yue, Y. (2022). Causal Unstructured Pruning in Linear Networks Using Effective Information. In Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022 (pp. 294-302). (Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CyberC55534.2022.00056

Zeng, Changyu ; Liu, Li ; Zhao, Haocheng et al. / Causal Unstructured Pruning in Linear Networks Using Effective Information. Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022. Institute of Electrical and Electronics Engineers Inc., 2022. pp. 294-302 (Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022).

@inproceedings{b21d63837e194bcb97b50398d6144cf4,

title = "Causal Unstructured Pruning in Linear Networks Using Effective Information",

abstract = "Excessive number of parameters in today's (deep) neural networks demands tremendous computational resources and slows down training speed. The problem also makes it difficult to deploy these neural network models on capability constrained devices such as mobile devices. To address this challenge, we propose an unstructured pruning method that measures the causal structure of neural networks based on effective information (EI). It introduces an intervention to the input and computes the mutual information between the interference and its corresponding output, within a single linear layer measuring the importance of each weight. In the experiments, we found that the sparsity of EI pruning can reach more than 90%. Only 10% of non-zero parameters in the linear layers were needed compared to the benchmark methods without pruning, while ensuring similar level of accuracy and stable training performance in iterative pruning. In addition, as the invariance of the causal structure of the network is exploited, the network after pruning using EI is highly generalizable and interpretable than other methods.",

keywords = "Causal Inference, Deep Learning, Effective Information, Unstructured Pruning",

author = "Changyu Zeng and Li Liu and Haocheng Zhao and Yu Zhang and Wei Wang and Ning Cai and Yutao Yue",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 12th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022 ; Conference date: 15-12-2022 Through 16-12-2022",

year = "2022",

doi = "10.1109/CyberC55534.2022.00056",

language = "English",

series = "Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "294--302",

booktitle = "Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022",

}

Zeng, C, Liu, L, Zhao, H, Zhang, Y, Wang, W, Cai, N & Yue, Y 2022, Causal Unstructured Pruning in Linear Networks Using Effective Information. in Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022. Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022, Institute of Electrical and Electronics Engineers Inc., pp. 294-302, 12th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022, Virtual, Online, China, 15/12/22. https://doi.org/10.1109/CyberC55534.2022.00056

Causal Unstructured Pruning in Linear Networks Using Effective Information. / Zeng, Changyu; Liu, Li; Zhao, Haocheng et al.
Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022. Institute of Electrical and Electronics Engineers Inc., 2022. p. 294-302 (Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Causal Unstructured Pruning in Linear Networks Using Effective Information

AU - Zeng, Changyu

AU - Liu, Li

AU - Zhao, Haocheng

AU - Zhang, Yu

AU - Wang, Wei

AU - Cai, Ning

AU - Yue, Yutao

PY - 2022

Y1 - 2022

N2 - Excessive number of parameters in today's (deep) neural networks demands tremendous computational resources and slows down training speed. The problem also makes it difficult to deploy these neural network models on capability constrained devices such as mobile devices. To address this challenge, we propose an unstructured pruning method that measures the causal structure of neural networks based on effective information (EI). It introduces an intervention to the input and computes the mutual information between the interference and its corresponding output, within a single linear layer measuring the importance of each weight. In the experiments, we found that the sparsity of EI pruning can reach more than 90%. Only 10% of non-zero parameters in the linear layers were needed compared to the benchmark methods without pruning, while ensuring similar level of accuracy and stable training performance in iterative pruning. In addition, as the invariance of the causal structure of the network is exploited, the network after pruning using EI is highly generalizable and interpretable than other methods.

AB - Excessive number of parameters in today's (deep) neural networks demands tremendous computational resources and slows down training speed. The problem also makes it difficult to deploy these neural network models on capability constrained devices such as mobile devices. To address this challenge, we propose an unstructured pruning method that measures the causal structure of neural networks based on effective information (EI). It introduces an intervention to the input and computes the mutual information between the interference and its corresponding output, within a single linear layer measuring the importance of each weight. In the experiments, we found that the sparsity of EI pruning can reach more than 90%. Only 10% of non-zero parameters in the linear layers were needed compared to the benchmark methods without pruning, while ensuring similar level of accuracy and stable training performance in iterative pruning. In addition, as the invariance of the causal structure of the network is exploited, the network after pruning using EI is highly generalizable and interpretable than other methods.

KW - Causal Inference

KW - Deep Learning

KW - Effective Information

KW - Unstructured Pruning

UR - http://www.scopus.com/inward/record.url?scp=85153686412&partnerID=8YFLogxK

U2 - 10.1109/CyberC55534.2022.00056

DO - 10.1109/CyberC55534.2022.00056

M3 - Conference Proceeding

AN - SCOPUS:85153686412

T3 - Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022

SP - 294

EP - 302

BT - Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 12th International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022

Y2 - 15 December 2022 through 16 December 2022

ER -

Zeng C, Liu L, Zhao H, Zhang Y, Wang W, Cai N et al. Causal Unstructured Pruning in Linear Networks Using Effective Information. In Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022. Institute of Electrical and Electronics Engineers Inc. 2022. p. 294-302. (Proceedings - 2022 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, CyberC 2022). doi: 10.1109/CyberC55534.2022.00056

Causal Unstructured Pruning in Linear Networks Using Effective Information

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this