TY - JOUR
T1 - The identification of significant features towards travel mode choice and its prediction via optimised random forest classifier
T2 - An evaluation for active commuting behavior
AU - Mohd Ali, Nur Fahriza
AU - Mohd Sadullah, Ahmad Farhan
AU - Abdul Majeed, Anwar P.P.
AU - Mohd Razman, Mohd Azraai
AU - Musa, Rabiu Muazu
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/6
Y1 - 2022/6
N2 - Introduction: Physical activity is the foundation to staying healthy, but sedentary activities have become not uncommon that ought to be mitigated immediately. The study aims to highlight the role of a transport system that encourages physical activity among users by applying an active door-to-door transport system. Users’ mode choice is studied to understand their preferences for active commuting. The use of machine learning has since been ubiquitous in a myriad of fields, including transportation studies and hence is also investigated towards its efficacy in predicting travel mode choice. Methodology: The application of the Random Forest (RF) model to identify travel mode choice is explored using the Revealed/Stated Preferences (RP/SP) Survey data in Kuantan City during weekdays. A total of 386 respondents were involved in this survey. The efficacy of the tuned RF models towards predicting the travel mode choice is evaluated via the Classification Accuracy (CA) performance indicator. In addition, a Feature Importance study is also carried out in order to identify significant factors that contribute towards travel mode choice. Results: The results from the present investigation demonstrated that the default RF model has acceptable predictability for both training and test dataset of users’ mode choice, with a CA of 70.2% and 69.3%, respectively. Upon identifying the significant features and further refining the hyperparameters of the RF model heuristically, it was shown that with 145 trees, the CA improved to up to 71.6% and 70.1% for both the training and test dataset, respectively. Through the feature selection technique, the most significant features that affect users mode choice are total travel time (TT), waiting time at a public transport stop (WT), region, walking distance from the last stop to destination (WD2), and walking distance from home to the nearest bus stop (WD1). Conclusions: The study has illustrated the efficacy of the optimised RF in predicting travel mode choice as well as identified the significant factors for the selection. The findings of the present study provide significant insight for policymakers to improve the performance of the public transportation system so that the users will benefit in terms of health and well-being from active commuting.
AB - Introduction: Physical activity is the foundation to staying healthy, but sedentary activities have become not uncommon that ought to be mitigated immediately. The study aims to highlight the role of a transport system that encourages physical activity among users by applying an active door-to-door transport system. Users’ mode choice is studied to understand their preferences for active commuting. The use of machine learning has since been ubiquitous in a myriad of fields, including transportation studies and hence is also investigated towards its efficacy in predicting travel mode choice. Methodology: The application of the Random Forest (RF) model to identify travel mode choice is explored using the Revealed/Stated Preferences (RP/SP) Survey data in Kuantan City during weekdays. A total of 386 respondents were involved in this survey. The efficacy of the tuned RF models towards predicting the travel mode choice is evaluated via the Classification Accuracy (CA) performance indicator. In addition, a Feature Importance study is also carried out in order to identify significant factors that contribute towards travel mode choice. Results: The results from the present investigation demonstrated that the default RF model has acceptable predictability for both training and test dataset of users’ mode choice, with a CA of 70.2% and 69.3%, respectively. Upon identifying the significant features and further refining the hyperparameters of the RF model heuristically, it was shown that with 145 trees, the CA improved to up to 71.6% and 70.1% for both the training and test dataset, respectively. Through the feature selection technique, the most significant features that affect users mode choice are total travel time (TT), waiting time at a public transport stop (WT), region, walking distance from the last stop to destination (WD2), and walking distance from home to the nearest bus stop (WD1). Conclusions: The study has illustrated the efficacy of the optimised RF in predicting travel mode choice as well as identified the significant factors for the selection. The findings of the present study provide significant insight for policymakers to improve the performance of the public transportation system so that the users will benefit in terms of health and well-being from active commuting.
KW - Door-to-door journey
KW - Hyperparameter tuning
KW - Private vehicles
KW - Public transport
KW - Random forest classifier
KW - Travel mode choice
UR - http://www.scopus.com/inward/record.url?scp=85127337982&partnerID=8YFLogxK
U2 - 10.1016/j.jth.2022.101362
DO - 10.1016/j.jth.2022.101362
M3 - Article
AN - SCOPUS:85127337982
SN - 2214-1405
VL - 25
JO - Journal of Transport and Health
JF - Journal of Transport and Health
M1 - 101362
ER -