Bicycle crash frequency modeling across different crash severities using a random-forest-based Shapley Additive explanations approach

Tao Li, Ruiqi Wang, Hongliang Ding*, Tiantian Chen, Hyungchul Chung

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Statistical modeling and data-driven studies on bicycle accidents are widespread, however, explanations of the underlying mechanisms remain limited, particularly regarding the impact of key risk factors on the bicycle crash frequency across different crash severities. This study aims to examine the effects of various risk factors on the frequency of bicycle crashes using Random Forest and Shapley Additive Explanations (RF-SHAP), taking into account the different crash severity levels. Data from three years of London crash data (2017 to 2019) is utilized. Population demographics, land use, road infrastructure, and traffic flows, are collected in Greater London. In addition to providing superior predictive accuracy, our proposed method identified critical risk factors at different levels of severity associated with bicycle crashes. The distinct contribution of this study is the identification of the primary factors influencing the severity of bicycle collisions in London through the use of RF-SHAP. The study quantifies both the main and interactive effects of various severity risk factors on bicycle collisions. Results suggest that the proportion of building areas and population density are most critical to bicycle crash numbers in different severity levels. Also, the interaction effects of the risk factors on bicycle crashes are revealed. Specifically, results reveal a negative correlation between traffic flow and overall bicycle crash frequency when the average road network connectivity is below 2.25. After controlling the population density, the proportion of residential areas shows a three-stage pattern of influence on the slight injury crash frequency. Furthermore, a boundary value of 6.3 is identified for the safety impact of road density on fatal and severely-injured bicycle crashes. Study findings should provide insights into cost-effective safety countermeasures for bicycle infrastructures, traffic controls, and safety education. Bicycle safety can be improved through these measures over the long term.

Original languageEnglish
JournalInternational Journal of Injury Control and Safety Promotion
DOIs
Publication statusAccepted/In press - 2025

Keywords

  • bicycle injury severity levels
  • Bicycle safety
  • interactive effects
  • random-forest-based shapley additive explanations
  • zonal risk factors

Fingerprint

Dive into the research topics of 'Bicycle crash frequency modeling across different crash severities using a random-forest-based Shapley Additive explanations approach'. Together they form a unique fingerprint.

Cite this