Abstract
We propose a new sparse principal component analysis (SPCA) method in which the solutions are obtained by projecting the full cardinality principal components onto subsets of variables. The resulting components are guaranteed to explain a given proportion of variance. The computation of these solutions is very efficient. The proposed method compares well with the optimal least squares sparse components. We show that other SPCA methods fail to identify the best sparse approximations of the principal components and explain less variance than our solutions. We illustrate and compare our method with others with extensive simulations and with the analysis of the computational results for nine datasets of increasing dimensions up to 16,000 variables.
Original language | English |
---|---|
Pages (from-to) | 366-382 |
Number of pages | 17 |
Journal | Journal of Multivariate Analysis |
Volume | 173 |
DOIs | |
Publication status | Published - Sept 2019 |
Keywords
- Dimension reduction
- Power method
- SPCA
- Variable selection