TY - GEN
T1 - Image Ordinal Classification and Understanding
T2 - 2018 IEEE International Conference on Multimedia and Expo, ICME 2018
AU - Zhang, Chao
AU - Zhu, Ce
AU - Xiao, Jimin
AU - Xu, Xun
AU - Liu, Yipeng
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/10/8
Y1 - 2018/10/8
N2 - Image ordinal classification refers to predicting a discrete target value which carries ordering correlation among image categories. The limited size of labeled ordinal data renders modern deep learning approaches easy to overfit. To tackle this issue, neuron dropout and data augmentation were proposed which, however, still suffer from over-parameterization and breaking spatial structure, respectively. To address the issues, we first propose a grid dropout method that randomly dropout/blackout some areas of the training image. Then we combine the objective of predicting the blackout patches with classification to take advantage of the spatial information. Finally we demonstrate the effectiveness of both approaches by visualizing the Class Activation Map (CAM) and discover that grid dropout is more aware of the whole facial areas and more robust than neuron dropout for small training dataset. Experiments are conducted on a challenging age estimation dataset-Adience dataset with very competitive results compared with state-of-the-art methods.
AB - Image ordinal classification refers to predicting a discrete target value which carries ordering correlation among image categories. The limited size of labeled ordinal data renders modern deep learning approaches easy to overfit. To tackle this issue, neuron dropout and data augmentation were proposed which, however, still suffer from over-parameterization and breaking spatial structure, respectively. To address the issues, we first propose a grid dropout method that randomly dropout/blackout some areas of the training image. Then we combine the objective of predicting the blackout patches with classification to take advantage of the spatial information. Finally we demonstrate the effectiveness of both approaches by visualizing the Class Activation Map (CAM) and discover that grid dropout is more aware of the whole facial areas and more robust than neuron dropout for small training dataset. Experiments are conducted on a challenging age estimation dataset-Adience dataset with very competitive results compared with state-of-the-art methods.
KW - Data Augmentation
KW - Dropout
KW - Ordinal Classification
KW - Overfitting
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=85055843998&partnerID=8YFLogxK
U2 - 10.1109/ICME.2018.8486469
DO - 10.1109/ICME.2018.8486469
M3 - Conference Proceeding
AN - SCOPUS:85055843998
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
BT - 2018 IEEE International Conference on Multimedia and Expo, ICME 2018
PB - IEEE Computer Society
Y2 - 23 July 2018 through 27 July 2018
ER -