Feature transformation with class conditional decorrelation

Xu Yao Zhang; Kaizhu Huang; Cheng Lin Liu

doi:10.1109/ICDM.2013.43

Feature transformation with class conditional decorrelation

Xu Yao Zhang, Kaizhu Huang, Cheng Lin Liu

School of Advanced Technology

Chinese Academy of Sciences

Research output: Contribution to journal › Conference article › peer-review

2 Citations (Scopus)

Abstract

The well-known feature transformation model of Fisher linear discriminant analysis (FDA) can be decomposed into an equivalent two-step approach: whitening followed by principal component analysis (PCA) in the whitened space. By proving that whitening is the optimal linear transformation to the Euclidean space in the sense of minimum log-determinant divergence, we propose a transformation model called class conditional decor relation (CCD). The objective of CCD is to diagonalize the covariance matrices of different classes simultaneously, which is efficiently optimized using a modified Jacobi method. CCD is effective to find the common principal components among multiple classes. After CCD, the variables become class conditionally uncorrelated, which will benefit the subsequent classification tasks. Combining CCD with the nearest class mean (NCM) classification model can significantly improve the classification accuracy. Experiments on 15 small-scale datasets and one large-scale dataset (with 3755 classes) demonstrate the scalability of CCD for different applications. We also discuss the potential applications of CCD for other problems such as Gaussian mixture models and classifier ensemble learning.

Original language	English
Article number	6729573
Pages (from-to)	887-896
Number of pages	10
Journal	Proceedings - IEEE International Conference on Data Mining, ICDM
DOIs	https://doi.org/10.1109/ICDM.2013.43
Publication status	Published - 2013
Event	13th IEEE International Conference on Data Mining, ICDM 2013 - Dallas, TX, United States Duration: 7 Dec 2013 → 10 Dec 2013

Keywords

class conditional decorrelation
feature transformation
simultaneous diagonalization

Access to Document

10.1109/ICDM.2013.43

Cite this

@article{dcc392e4774e469ba8518695ba9528aa,

title = "Feature transformation with class conditional decorrelation",

abstract = "The well-known feature transformation model of Fisher linear discriminant analysis (FDA) can be decomposed into an equivalent two-step approach: whitening followed by principal component analysis (PCA) in the whitened space. By proving that whitening is the optimal linear transformation to the Euclidean space in the sense of minimum log-determinant divergence, we propose a transformation model called class conditional decor relation (CCD). The objective of CCD is to diagonalize the covariance matrices of different classes simultaneously, which is efficiently optimized using a modified Jacobi method. CCD is effective to find the common principal components among multiple classes. After CCD, the variables become class conditionally uncorrelated, which will benefit the subsequent classification tasks. Combining CCD with the nearest class mean (NCM) classification model can significantly improve the classification accuracy. Experiments on 15 small-scale datasets and one large-scale dataset (with 3755 classes) demonstrate the scalability of CCD for different applications. We also discuss the potential applications of CCD for other problems such as Gaussian mixture models and classifier ensemble learning.",

keywords = "class conditional decorrelation, feature transformation, simultaneous diagonalization",

author = "Zhang, {Xu Yao} and Kaizhu Huang and Liu, {Cheng Lin}",

year = "2013",

doi = "10.1109/ICDM.2013.43",

language = "English",

pages = "887--896",

journal = "Proceedings - IEEE International Conference on Data Mining, ICDM",

issn = "1550-4786",

note = "13th IEEE International Conference on Data Mining, ICDM 2013 ; Conference date: 07-12-2013 Through 10-12-2013",

}

TY - JOUR

T1 - Feature transformation with class conditional decorrelation

AU - Zhang, Xu Yao

AU - Huang, Kaizhu

AU - Liu, Cheng Lin

PY - 2013

Y1 - 2013

N2 - The well-known feature transformation model of Fisher linear discriminant analysis (FDA) can be decomposed into an equivalent two-step approach: whitening followed by principal component analysis (PCA) in the whitened space. By proving that whitening is the optimal linear transformation to the Euclidean space in the sense of minimum log-determinant divergence, we propose a transformation model called class conditional decor relation (CCD). The objective of CCD is to diagonalize the covariance matrices of different classes simultaneously, which is efficiently optimized using a modified Jacobi method. CCD is effective to find the common principal components among multiple classes. After CCD, the variables become class conditionally uncorrelated, which will benefit the subsequent classification tasks. Combining CCD with the nearest class mean (NCM) classification model can significantly improve the classification accuracy. Experiments on 15 small-scale datasets and one large-scale dataset (with 3755 classes) demonstrate the scalability of CCD for different applications. We also discuss the potential applications of CCD for other problems such as Gaussian mixture models and classifier ensemble learning.

AB - The well-known feature transformation model of Fisher linear discriminant analysis (FDA) can be decomposed into an equivalent two-step approach: whitening followed by principal component analysis (PCA) in the whitened space. By proving that whitening is the optimal linear transformation to the Euclidean space in the sense of minimum log-determinant divergence, we propose a transformation model called class conditional decor relation (CCD). The objective of CCD is to diagonalize the covariance matrices of different classes simultaneously, which is efficiently optimized using a modified Jacobi method. CCD is effective to find the common principal components among multiple classes. After CCD, the variables become class conditionally uncorrelated, which will benefit the subsequent classification tasks. Combining CCD with the nearest class mean (NCM) classification model can significantly improve the classification accuracy. Experiments on 15 small-scale datasets and one large-scale dataset (with 3755 classes) demonstrate the scalability of CCD for different applications. We also discuss the potential applications of CCD for other problems such as Gaussian mixture models and classifier ensemble learning.

KW - class conditional decorrelation

KW - feature transformation

KW - simultaneous diagonalization

UR - http://www.scopus.com/inward/record.url?scp=84894648989&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2013.43

DO - 10.1109/ICDM.2013.43

M3 - Conference article

AN - SCOPUS:84894648989

SN - 1550-4786

SP - 887

EP - 896

JO - Proceedings - IEEE International Conference on Data Mining, ICDM

JF - Proceedings - IEEE International Conference on Data Mining, ICDM

M1 - 6729573

T2 - 13th IEEE International Conference on Data Mining, ICDM 2013

Y2 - 7 December 2013 through 10 December 2013

ER -

Feature transformation with class conditional decorrelation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this