TY - GEN
T1 - Subcellular phenotype images classification by MLP ensembles with random linear oracle
AU - Zhang, Bai Ling
AU - Han, Guoxia
PY - 2011
Y1 - 2011
N2 - Subcellular localization is a key functional characteristic of proteins. An automatic, reliable and efficient prediction system for protein subcellular localization can be used for establishing knowledge of the spatial distribution of proteins within living cells and permits to screen systems for drug discovery or for early diagnosis of a disease. In this paper, we investigate an approach based on augmented image features by incorporating curvelet transform and neural network (MLP) ensemble for classification. A simple Random Subspace (RS) ensemble offers satisfactory performance, which contains a set of base MLP classifiers trained with subsets of attributes randomly drawn from the combined features of curvelet coefficients and original Subcellular Location Features (SLF). An MLP ensemble with Random Linear Oracle (RLO) can further improve the performance by replacing a base classifier with a "miniensemble", which consists of a pair of base classifiers and a fixed, randomly created oracle that selects between them. With the benchmarking 2D HeLa images, our experiments show the effectiveness of the proposed approach. The RS-MLP ensemble offers the classification rate 95% while the RS-RLO ensemble gives 95.7% accuracy, which compares sharply with the previously published benchmarking result 84%.
AB - Subcellular localization is a key functional characteristic of proteins. An automatic, reliable and efficient prediction system for protein subcellular localization can be used for establishing knowledge of the spatial distribution of proteins within living cells and permits to screen systems for drug discovery or for early diagnosis of a disease. In this paper, we investigate an approach based on augmented image features by incorporating curvelet transform and neural network (MLP) ensemble for classification. A simple Random Subspace (RS) ensemble offers satisfactory performance, which contains a set of base MLP classifiers trained with subsets of attributes randomly drawn from the combined features of curvelet coefficients and original Subcellular Location Features (SLF). An MLP ensemble with Random Linear Oracle (RLO) can further improve the performance by replacing a base classifier with a "miniensemble", which consists of a pair of base classifiers and a fixed, randomly created oracle that selects between them. With the benchmarking 2D HeLa images, our experiments show the effectiveness of the proposed approach. The RS-MLP ensemble offers the classification rate 95% while the RS-RLO ensemble gives 95.7% accuracy, which compares sharply with the previously published benchmarking result 84%.
UR - http://www.scopus.com/inward/record.url?scp=79960120714&partnerID=8YFLogxK
U2 - 10.1109/icbbe.2011.5780000
DO - 10.1109/icbbe.2011.5780000
M3 - Conference Proceeding
AN - SCOPUS:79960120714
SN - 9781424450893
T3 - 5th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2011
BT - 5th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2011
T2 - 5th International Conference on Bioinformatics and Biomedical Engineering, iCBBE 2011
Y2 - 10 May 2011 through 12 May 2011
ER -