TY - GEN
T1 - Forecasting Clinical Expenditure of Child Patients Using Binary and Multi-Classification Methods
AU - Wang, Chenguang
AU - Pan, Xinyi
AU - Ye, Lishan
AU - Zhuang, Weifen
AU - Ma, Fei
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/11/26
Y1 - 2018/11/26
N2 - In this paper, random forest algorithm (RF) and error-correction output code model (ECOC) were employed to predict the clinic expenditure of child patients with data consisting of records extracted from a hospital information system. Throughout the modelling, the training set utilized 80% of the records selected from original data set in random and the rest of data were used in the test set. The RF received better predictive accuracy than ECOC, with RMSE being 0.152, R 2 being 0.924, |R| being 0.869, and Acc ±1 being 82.6%. Additionally, RF obtained good performances on different types of charges, achieving over 80% accuracy in average. Besides, among different types of information, clinic features gave better results, with RMSE being 0.215, R 2 being 0.844, |R| being 0.709 and Acc being 60.5%. In comparison, the random forest generally performed better than ECOC models in most fields. To summarize, the random forest could obtain best accuracy on charge7 (Treatment Fee), with accuracy of 90.5%, and clinic features could provide models with higher accuracy among all fields of information.
AB - In this paper, random forest algorithm (RF) and error-correction output code model (ECOC) were employed to predict the clinic expenditure of child patients with data consisting of records extracted from a hospital information system. Throughout the modelling, the training set utilized 80% of the records selected from original data set in random and the rest of data were used in the test set. The RF received better predictive accuracy than ECOC, with RMSE being 0.152, R 2 being 0.924, |R| being 0.869, and Acc ±1 being 82.6%. Additionally, RF obtained good performances on different types of charges, achieving over 80% accuracy in average. Besides, among different types of information, clinic features gave better results, with RMSE being 0.215, R 2 being 0.844, |R| being 0.709 and Acc being 60.5%. In comparison, the random forest generally performed better than ECOC models in most fields. To summarize, the random forest could obtain best accuracy on charge7 (Treatment Fee), with accuracy of 90.5%, and clinic features could provide models with higher accuracy among all fields of information.
KW - Clinical expenditure
KW - error-correction output code
KW - forecasts
KW - multiple classification
KW - random forest
UR - http://www.scopus.com/inward/record.url?scp=85059939737&partnerID=8YFLogxK
U2 - 10.1109/BDAI.2018.8546678
DO - 10.1109/BDAI.2018.8546678
M3 - Conference Proceeding
AN - SCOPUS:85059939737
T3 - International Conference on Big Data and Artificial Intelligence, BDAI 2018
SP - 111
EP - 115
BT - International Conference on Big Data and Artificial Intelligence, BDAI 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 International Conference on Big Data and Artificial Intelligence, BDAI 2018
Y2 - 22 June 2018 through 24 June 2018
ER -