Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods

Fei Ma; Limin Yu; Lishan Ye; David D. Yao; Weifen Zhuang

doi:10.1109/JBHI.2020.2973285

Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods

Fei Ma^*, Limin Yu, Lishan Ye, David D. Yao, Weifen Zhuang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

29 Citations (Scopus)

Abstract

Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.

Original language	English
Article number	9007437
Pages (from-to)	2651-2662
Number of pages	12
Journal	IEEE Journal of Biomedical and Health Informatics
Volume	24
Issue number	9
DOIs	https://doi.org/10.1109/JBHI.2020.2973285
Publication status	Published - Sept 2020

Keywords

Machine learning
decision tree
length-of-stay prediction

Access to Document

10.1109/JBHI.2020.2973285

Cite this

@article{afa4d20c592b49a68ecde5393b91f588,

title = "Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods",

abstract = "Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.",

keywords = "Machine learning, decision tree, length-of-stay prediction",

author = "Fei Ma and Limin Yu and Lishan Ye and Yao, {David D.} and Weifen Zhuang",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2020",

month = sep,

doi = "10.1109/JBHI.2020.2973285",

language = "English",

volume = "24",

pages = "2651--2662",

journal = "IEEE Journal of Biomedical and Health Informatics",

issn = "2168-2194",

number = "9",

}

TY - JOUR

T1 - Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods

AU - Ma, Fei

AU - Yu, Limin

AU - Ye, Lishan

AU - Yao, David D.

AU - Zhuang, Weifen

PY - 2020/9

Y1 - 2020/9

N2 - Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.

AB - Accurate prediction of a patient's length-of-stay (LOS) in the hospital enables an efficient and effective management of hospital beds. This paper studies LOS prediction for pediatric patients with respiratory diseases using three decision tree methods: Bagging, Adaboost, and Random forest. A data set of 11,206 records retrieved from the hospital information system is used for analysis after preprocessing and transformation through a computation and an expansion method. Two tests, namely bisection test and periodic test, are designed to assess the performance of the prediction methods. Bagging shows the best result on the bisection test (0.296 RMSE, 0.831 R^2, and 0.723 Acc\;\pm\ 1) for the testing set of the whole data test. The performances of the three methods are similar on the periodic test, whereas Adaboost performs slightly better than the other two methods. Results indicate that the three methods are all effective for the LOS prediction. This study also investigates the importance of different data fields to the LOS prediction, and finds that hospital treatment-related data fields contribute more to the LOS prediction than other categories of fields.

KW - Machine learning

KW - decision tree

KW - length-of-stay prediction

UR - http://www.scopus.com/inward/record.url?scp=85090491488&partnerID=8YFLogxK

U2 - 10.1109/JBHI.2020.2973285

DO - 10.1109/JBHI.2020.2973285

M3 - Article

C2 - 32092020

AN - SCOPUS:85090491488

SN - 2168-2194

VL - 24

SP - 2651

EP - 2662

JO - IEEE Journal of Biomedical and Health Informatics

JF - IEEE Journal of Biomedical and Health Informatics

IS - 9

M1 - 9007437

ER -

Length-of-Stay Prediction for Pediatric Patients with Respiratory Diseases Using Decision Tree Methods

Abstract

Keywords

Access to Document

Other files and links

Cite this