Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods

Fei Ma*, Limin Yu, Lishan Ye, David D. Yao, Weifen Zhuang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

26 Citations (Scopus)

Abstract

Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.

Original languageEnglish
Article number9007437
Pages (from-to)2651-2662
Number of pages12
JournalIEEE Journal of Biomedical and Health Informatics
Volume24
Issue number9
DOIs
Publication statusPublished - Sept 2020

Keywords

  • Machine learning
  • decision tree
  • length-of-stay prediction

Cite this