TY - JOUR
T1 - Ovarian cancer detection using optimized machine learning models with adaptive differential evolution
AU - Juwono, Filbert H.
AU - Wong, W. K.
AU - Pek, Hui Ting
AU - Sivakumar, Saaveethya
AU - Acula, Donata D.
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/8
Y1 - 2022/8
N2 - Ovarian cancer is one of the leading causes of mortality among women. The most common detection approach is by tracking related biomarkers to see whether patient has ovarian cancer. Despite emphasis on biomarkers, feature subsets may include physiological features. In this current research, we propose an optimal simultaneous feature weighting/parameter optimization for detecting ovarian cancer occurrence. The weights are optimized using adaptive differential evolution (ADE) with cross validation error and least absolute shrinkage and selection operator (LASSO) regularization as the fitness function. Furthermore, to be in line with the Occam's razor principle, we apply LASSO regularization in the form of weighted ℓ1 norm to reduce features being selected. The regularization constant, λ, is integral to balance the feature weights and the cross validation error. Using ADE method and, in particular LASSO regularization, we obtain the results showing clear reduction and elimination of features from feature subset. Using K-nearest neighbors (KNN) with λ=0.01, 97.24% accuracy (mean) is obtained. Meanwhile, when using support vector machines (SVM) with λ=0.01, an accuracy of 96.48% (mean) is achieved. From the viewpoint of machine learning, this, at least, suggests that the proposed approach may be a good solution for optimizing feature weights/parameters at the same time. From the perspective of cancer research, it demonstrates that a plethora of features may be valuable in building a model and should not be discarded for having weak discriminative capability. As a result, the combination of feature subset and non linear mapping plays a significant role.
AB - Ovarian cancer is one of the leading causes of mortality among women. The most common detection approach is by tracking related biomarkers to see whether patient has ovarian cancer. Despite emphasis on biomarkers, feature subsets may include physiological features. In this current research, we propose an optimal simultaneous feature weighting/parameter optimization for detecting ovarian cancer occurrence. The weights are optimized using adaptive differential evolution (ADE) with cross validation error and least absolute shrinkage and selection operator (LASSO) regularization as the fitness function. Furthermore, to be in line with the Occam's razor principle, we apply LASSO regularization in the form of weighted ℓ1 norm to reduce features being selected. The regularization constant, λ, is integral to balance the feature weights and the cross validation error. Using ADE method and, in particular LASSO regularization, we obtain the results showing clear reduction and elimination of features from feature subset. Using K-nearest neighbors (KNN) with λ=0.01, 97.24% accuracy (mean) is obtained. Meanwhile, when using support vector machines (SVM) with λ=0.01, an accuracy of 96.48% (mean) is achieved. From the viewpoint of machine learning, this, at least, suggests that the proposed approach may be a good solution for optimizing feature weights/parameters at the same time. From the perspective of cancer research, it demonstrates that a plethora of features may be valuable in building a model and should not be discarded for having weak discriminative capability. As a result, the combination of feature subset and non linear mapping plays a significant role.
KW - Adaptive differential evolution
KW - Feature reduction
KW - KNN
KW - Ovarian cancer
KW - SVM
UR - http://www.scopus.com/inward/record.url?scp=85129586461&partnerID=8YFLogxK
U2 - 10.1016/j.bspc.2022.103785
DO - 10.1016/j.bspc.2022.103785
M3 - Article
AN - SCOPUS:85129586461
SN - 1746-8094
VL - 77
JO - Biomedical Signal Processing and Control
JF - Biomedical Signal Processing and Control
M1 - 103785
ER -