TY - JOUR
T1 - Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods
AU - Ma, Fei
AU - Yu, Limin
AU - Ye, Lishan
AU - Yao, David D.
AU - Zhuang, Weifen
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2020/9
Y1 - 2020/9
N2 - Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.
AB - Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.
KW - Machine learning
KW - decision tree
KW - length-of-stay prediction
UR - http://www.scopus.com/inward/record.url?scp=85090491488&partnerID=8YFLogxK
U2 - 10.1109/JBHI.2020.2973285
DO - 10.1109/JBHI.2020.2973285
M3 - Article
C2 - 32092020
AN - SCOPUS:85090491488
SN - 2168-2194
VL - 24
SP - 2651
EP - 2662
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
IS - 9
M1 - 9007437
ER -