A unified gradient regularization family for adversarial examples

Chunchuan Lyu; Kaizhu Huang; Hai Ning Liang

doi:10.1109/ICDM.2015.84

A unified gradient regularization family for adversarial examples

Chunchuan Lyu, Kaizhu Huang^*, Hai Ning Liang

^*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

141 Citations (Scopus)

Abstract

Adversarial examples are augmented data points generated by imperceptible perturbation of input samples. They have recently drawn much attention with the machine learning and data mining community. Being difficult to distinguish from real examples, such adversarial examples could change the prediction of many of the best learning models including the state-of-the-art deep learning models. Recent attempts have been made to build robust models that take into account adversarial examples. However, these methods can either lead to performance drops or lack mathematical motivations. In this paper, we propose a unified framework to build robust machine learning models against adversarial examples. More specifically, using the unified framework, we develop a family of gradient regularization methods that effectively penalize the gradient of loss function w.r.t. inputs. Our proposed framework is appealing in that it offers a unified view to deal with adversarial examples. It incorporates another recently-proposed perturbation based approach as a special case. In addition, we present some visual effects that reveals semantic meaning in those perturbations, and thus support our regularization method and provide another explanation for generalizability of adversarial examples. By applying this technique to Maxout networks, we conduct a series of experiments and achieve encouraging results on two benchmark datasets. In particular, we attain the best accuracy on MNIST data (without data augmentation) and competitive performance on CIFAR-10 data.

Original language	English
Title of host publication	Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015
Editors	Charu Aggarwal, Zhi-Hua Zhou, Alexander Tuzhilin, Hui Xiong, Xindong Wu
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	301-309
Number of pages	9
ISBN (Electronic)	9781467395038
DOIs	https://doi.org/10.1109/ICDM.2015.84
Publication status	Published - 5 Jan 2016
Event	15th IEEE International Conference on Data Mining, ICDM 2015 - Atlantic City, United States Duration: 14 Nov 2015 → 17 Nov 2015

Publication series

Name	Proceedings - IEEE International Conference on Data Mining, ICDM
Volume	2016-January
ISSN (Print)	1550-4786

Conference

Conference	15th IEEE International Conference on Data Mining, ICDM 2015
Country/Territory	United States
City	Atlantic City
Period	14/11/15 → 17/11/15

Keywords

Adversarial examples
Deep learning
Regularization
Robust classification

Access to Document

10.1109/ICDM.2015.84

Cite this

Lyu, C., Huang, K., & Liang, H. N. (2016). A unified gradient regularization family for adversarial examples. In C. Aggarwal, Z.-H. Zhou, A. Tuzhilin, H. Xiong, & X. Wu (Eds.), Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015 (pp. 301-309). Article 7373334 (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2016-January). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDM.2015.84

Lyu, Chunchuan ; Huang, Kaizhu ; Liang, Hai Ning. / A unified gradient regularization family for adversarial examples. Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015. editor / Charu Aggarwal ; Zhi-Hua Zhou ; Alexander Tuzhilin ; Hui Xiong ; Xindong Wu. Institute of Electrical and Electronics Engineers Inc., 2016. pp. 301-309 (Proceedings - IEEE International Conference on Data Mining, ICDM).

@inproceedings{94de7dcd1fe7436da17a0dbc60497904,

title = "A unified gradient regularization family for adversarial examples",

abstract = "Adversarial examples are augmented data points generated by imperceptible perturbation of input samples. They have recently drawn much attention with the machine learning and data mining community. Being difficult to distinguish from real examples, such adversarial examples could change the prediction of many of the best learning models including the state-of-the-art deep learning models. Recent attempts have been made to build robust models that take into account adversarial examples. However, these methods can either lead to performance drops or lack mathematical motivations. In this paper, we propose a unified framework to build robust machine learning models against adversarial examples. More specifically, using the unified framework, we develop a family of gradient regularization methods that effectively penalize the gradient of loss function w.r.t. inputs. Our proposed framework is appealing in that it offers a unified view to deal with adversarial examples. It incorporates another recently-proposed perturbation based approach as a special case. In addition, we present some visual effects that reveals semantic meaning in those perturbations, and thus support our regularization method and provide another explanation for generalizability of adversarial examples. By applying this technique to Maxout networks, we conduct a series of experiments and achieve encouraging results on two benchmark datasets. In particular, we attain the best accuracy on MNIST data (without data augmentation) and competitive performance on CIFAR-10 data.",

keywords = "Adversarial examples, Deep learning, Regularization, Robust classification",

author = "Chunchuan Lyu and Kaizhu Huang and Liang, {Hai Ning}",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; 15th IEEE International Conference on Data Mining, ICDM 2015 ; Conference date: 14-11-2015 Through 17-11-2015",

year = "2016",

month = jan,

day = "5",

doi = "10.1109/ICDM.2015.84",

language = "English",

series = "Proceedings - IEEE International Conference on Data Mining, ICDM",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "301--309",

editor = "Charu Aggarwal and Zhi-Hua Zhou and Alexander Tuzhilin and Hui Xiong and Xindong Wu",

booktitle = "Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015",

}

Lyu, C, Huang, K & Liang, HN 2016, A unified gradient regularization family for adversarial examples. in C Aggarwal, Z-H Zhou, A Tuzhilin, H Xiong & X Wu (eds), Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015., 7373334, Proceedings - IEEE International Conference on Data Mining, ICDM, vol. 2016-January, Institute of Electrical and Electronics Engineers Inc., pp. 301-309, 15th IEEE International Conference on Data Mining, ICDM 2015, Atlantic City, United States, 14/11/15. https://doi.org/10.1109/ICDM.2015.84

A unified gradient regularization family for adversarial examples. / Lyu, Chunchuan; Huang, Kaizhu; Liang, Hai Ning.
Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015. ed. / Charu Aggarwal; Zhi-Hua Zhou; Alexander Tuzhilin; Hui Xiong; Xindong Wu. Institute of Electrical and Electronics Engineers Inc., 2016. p. 301-309 7373334 (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2016-January).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - A unified gradient regularization family for adversarial examples

AU - Lyu, Chunchuan

AU - Huang, Kaizhu

AU - Liang, Hai Ning

PY - 2016/1/5

Y1 - 2016/1/5

N2 - Adversarial examples are augmented data points generated by imperceptible perturbation of input samples. They have recently drawn much attention with the machine learning and data mining community. Being difficult to distinguish from real examples, such adversarial examples could change the prediction of many of the best learning models including the state-of-the-art deep learning models. Recent attempts have been made to build robust models that take into account adversarial examples. However, these methods can either lead to performance drops or lack mathematical motivations. In this paper, we propose a unified framework to build robust machine learning models against adversarial examples. More specifically, using the unified framework, we develop a family of gradient regularization methods that effectively penalize the gradient of loss function w.r.t. inputs. Our proposed framework is appealing in that it offers a unified view to deal with adversarial examples. It incorporates another recently-proposed perturbation based approach as a special case. In addition, we present some visual effects that reveals semantic meaning in those perturbations, and thus support our regularization method and provide another explanation for generalizability of adversarial examples. By applying this technique to Maxout networks, we conduct a series of experiments and achieve encouraging results on two benchmark datasets. In particular, we attain the best accuracy on MNIST data (without data augmentation) and competitive performance on CIFAR-10 data.

AB - Adversarial examples are augmented data points generated by imperceptible perturbation of input samples. They have recently drawn much attention with the machine learning and data mining community. Being difficult to distinguish from real examples, such adversarial examples could change the prediction of many of the best learning models including the state-of-the-art deep learning models. Recent attempts have been made to build robust models that take into account adversarial examples. However, these methods can either lead to performance drops or lack mathematical motivations. In this paper, we propose a unified framework to build robust machine learning models against adversarial examples. More specifically, using the unified framework, we develop a family of gradient regularization methods that effectively penalize the gradient of loss function w.r.t. inputs. Our proposed framework is appealing in that it offers a unified view to deal with adversarial examples. It incorporates another recently-proposed perturbation based approach as a special case. In addition, we present some visual effects that reveals semantic meaning in those perturbations, and thus support our regularization method and provide another explanation for generalizability of adversarial examples. By applying this technique to Maxout networks, we conduct a series of experiments and achieve encouraging results on two benchmark datasets. In particular, we attain the best accuracy on MNIST data (without data augmentation) and competitive performance on CIFAR-10 data.

KW - Adversarial examples

KW - Deep learning

KW - Regularization

KW - Robust classification

UR - http://www.scopus.com/inward/record.url?scp=84963570113&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2015.84

DO - 10.1109/ICDM.2015.84

M3 - Conference Proceeding

AN - SCOPUS:84963570113

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 301

EP - 309

BT - Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015

A2 - Aggarwal, Charu

A2 - Zhou, Zhi-Hua

A2 - Tuzhilin, Alexander

A2 - Xiong, Hui

A2 - Wu, Xindong

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 15th IEEE International Conference on Data Mining, ICDM 2015

Y2 - 14 November 2015 through 17 November 2015

ER -

Lyu C, Huang K, Liang HN. A unified gradient regularization family for adversarial examples. In Aggarwal C, Zhou ZH, Tuzhilin A, Xiong H, Wu X, editors, Proceedings - 15th IEEE International Conference on Data Mining, ICDM 2015. Institute of Electrical and Electronics Engineers Inc. 2016. p. 301-309. 7373334. (Proceedings - IEEE International Conference on Data Mining, ICDM). doi: 10.1109/ICDM.2015.84

A unified gradient regularization family for adversarial examples

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint