A Comparative Study of Four Machine Learning Algorithms in Crash Severity Modeling

Document Type : Scientific - Research

Authors
1 MSc Transportation Engineering, Department of Transportation Engineering, School of Civil Engineering, Iran University of Science and Technology, Tehran, Iran
2 Associate Professor, Department of Transportation Engineering, School of Civil Engineering, Iran University of Science and Technology, Tehran, Iran
Abstract
Road crashes cause sociological and economical costs to societies, hence in recent years traffic safety studies has been a major concern. studies have been conducted in regards to crash severity modeling, but the majority of theses studies has traditionally used statistical modeling techniques, and the use of machine learning has been limited. Statistical methods have presupposed some conditions regarding variables and their relations, and if these conditions are violated, inference is affected. A wide spectrum of machine learning algorithm could be employed in crash safety modeling; hence the current study aims to comprehensively compare the performance of 4 widely used machine learning algorithms, namely classification and regression tree, artificial neural network, support vector machine and k-nearest neighbors. Results indicate that CART outperformed other models both in terms of accuracy and the prediction of the less populated crash severity level. Furthermore, the variable importance effected was analyzed using SHAP method, where at fault driver’s age, lighting condition and speed limit were found to be most significant across different models.

Keywords

Subjects


- Abdel-Aty, M. (2003). "Analysis of driver injury severity levels at multiple locations using ordered probit models." Journal of safety research 34(5): 597-603.
 
- Abdel-Aty, M. A., C. L. Chen and J. R. Schott (1998). "An assessment of the effect of driver age on traffic accident involvement using log-linear models." Accident Analysis & Prevention 30(6): 851-861.
 
- Abdelwahab, H. T. and M. A. Abdel-Aty (2001). "Development of artificial neural network models to predict driver injury severity in traffic accidents at signalized intersections." Transportation research record 1746(1): 6-13.
 
- Asadi, R., A. Khattak, H. Vashani, H. R. Almujibah, H. Rabie, S. Asadi and B. Dimitrijevic (2023). "Self-Paced Ensemble-SHAP Approach for the Classification and Interpretation of Crash Severity in Work Zone Areas." Sustainability 15(11): 9076.
 
- Barbone, F., A. McMahon, P. Davey, A. Morris, I. Reid, D. McDevitt and T. MacDonald (1998). "Association of road-traffic accidents with benzodiazepine use." The Lancet 352(9137): 1331-1336.
 
- Chang, L.-Y. and J.-T. Chien (2013). "Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model." Safety science 51(1): 17-22.
 
- Chen, C., G. Zhang, Z. Qian, R. A. Tarefder and Z. Tian (2016). "Investigating driver injury severity patterns in rollover crashes using support vector machine models." Accident Analysis & Prevention 90: 128-139.
- Chimba, D. and T. Sando (2009). "The prediction of highway traffic accident injury severity with neuromorphic techniques." Advances in transportation studies 19: 17-26.
 
- Chong, M., A. Abraham and M. Paprzycki (2005). "Traffic accident analysis using machine learning paradigms." Informatica 29 (1).
 
- Delen, D., R. Sharda and M. Bessonov (2006). "Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks." Accident Analysis & Prevention 38(3): 434-444.
 
- Do, L. N., H. L. Vu, B. Q. Vo, Z. Liu and D. Phung (2019). "An effective spatial-temporal attention based neural network for traffic flow prediction." Transportation research part C: emerging technologies 108: 12-28.
 
- Doecke, S. D., C. N. Kloeden, J. K. Dutschke and M. R. Baldock (2018). "Safe speed limits for a safe system: The relationship between speed limit and fatal crash rate for different crash types." Traffic injury prevention 19(4): 404-408.
 
- Fanyu, M., N. Sze, S. Cancan, C. Tiantian and Z. Yiping (2021). "Temporal instability of truck volume composition on non-truck-involved crash severity using uncorrelated and correlated grouped random parameters binary logit models with space-time variations." Analytic Methods in Accident Research 31: 100168.
 
- Gu, X., T. Li, Y. Wang, L. Zhang, Y. Wang and J. Yao (2018). "Traffic fatalities prediction using support vector machine with hybrid particle swarm optimization." Journal of Algorithms & Computational Technology 12(1): 20-29.
 
- Haghshenas, S. S., G. Guido, A. Vitale and V. Astarita (2023). "Assessment of the level of road crash severity: Comparison of intelligence studies." Expert Systems with Applications 234: 121118.
 
- Hanrahan, R. B., P. M. Layde, S. Zhu, C. E. Guse and S. W. Hargarten (2009). "The association of driver age with traffic injury severity in Wisconsin." Traffic injury prevention 10(4): 361-367.
 
- Hou, Q., X. Huo, J. Leng and F. Mannering (2022). "A note on out-of-sample prediction, marginal effects computations, and temporal testing with random parameters crash-injury severity models." Analytic methods in accident research 33: 100191.
 
- Hu, Z., J. Zhou, K. Huang and E. Zhang (2022). "A data-driven approach for traffic crash prediction: A case study in Ningbo, China." International Journal of Intelligent Transportation Systems Research 20(2): 508-518.
 
- Iranitalab, A. and A. Khattak (2017). "Comparison of four statistical and machine learning methods for crash severity prediction." Accident Analysis & Prevention 108: 27-36.
 
- Islam, M. R., K. B. Kelarestaghi, A. Ermagun and S. Banerjee (2019). "Gender Differences in Injury Severity Risk of Single-Vehicle Crashes in Virginia: A Nested Logit Analysis of Heterogeneity." arXiv preprint arXiv:1901.03289.
 
- Kashani, A. T. and A. S. Mohaymany (2011). "Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models." Safety science 49(10): 1314-1320.
 
- Li, J., Y. Yang, Y. Hu, X. Zhu, N. Ma and X. Yuan (2023). "Using Multidimensional Data to Analyze Freeway Real-Time Traffic Crash Precursors Based on XGBoost-SHAP Algorithm." Journal of Advanced Transportation 2023.
 
- Li, P., M. Abdel-Aty and J. Yuan (2020). "Real-time crash risk prediction on arterials based on LSTM-CNN." Accident Analysis & Prevention 135: 105371.
 
- Li, X., D. Lord, Y. Zhang and Y. Xie (2008). "Predicting motor vehicle crashes using support vector machine models." Accident Analysis & Prevention 40(4): 1611-1618.
 
- Li, Z., P. Liu, W. Wang and C. Xu (2012). "Using support vector machine models for crash injury severity analysis." Accident Analysis & Prevention 45: 478-486.
 
- Pakgohar, A., R. S. Tabrizi, M. Khalili and A. Esmaeili (2011). "The role of human factor in incidence and severity of road crashes based on the CART and LR regression: a data mining approach." Procedia Computer Science 3: 764-769.
 
- Park, S. and J. Park (2022). "Multilevel Mixed-Effects Models to Identify Contributing Factors on Freight Vehicle Crash Severity." Sustainability 14(19): 11804.
 
- Patil, S., S. R. Geedipally and D. Lord (2012). "Analysis of crash severities using nested logit model—accounting for the underreporting of crashes." Accident Analysis & Prevention 45: 646-653.
 
- Rahim, M. A. and H. M. Hassan (2021). "A deep learning based traffic crash severity prediction framework." Accident Analysis & Prevention 154: 106090.
 
- Rezapour, M., S. Nazneen and K. Ksaibati (2020). "Application of deep learning techniques in predicting motorcycle crash severity." Engineering Reports 2(7): e12175.
- Rezapour, M., S. S. Wulff and K. Ksaibati (2020). "Bayesian hierarchical modelling of traffic barrier crash severity." International journal of injury control and safety promotion 28(1): 94-102.
 
- Sameen, M. I. and B. Pradhan (2017). "Severity prediction of traffic accidents with recurrent neural networks." Applied Sciences 7(6): 476.
 
- Silva, P. B., M. Andrade and S. Ferreira (2020). "Machine learning applied to road safety modeling: A systematic literature review." Journal of traffic and transportation engineering (English edition) 7(6): 775-790.
 
- Sohn, S. Y. and S. H. Lee (2003). "Data fusion, ensemble and clustering to improve the classification accuracy for the severity of road traffic accidents in Korea." Safety Science 41(1): 1-14.
 
- Sohn, S. Y. and H. Shin (2001). "Pattern recognition for road traffic accident severity in Korea." Ergonomics 44(1): 107-117.
 
- Tang, J., J. Liang, C. Han, Z. Li and H. Huang (2019). "Crash injury severity analysis using a two-layer Stacking framework." Accident Analysis & Prevention 122: 226-238.
- Tang, J., L. Zheng, C. Han, W. Yin, Y. Zhang, Y. Zou and H. Huang (2020). "Statistical and machine-learning methods for clearance time prediction of road incidents: A methodology review." Analytic Methods in Accident Research 27: 100123.
 
- Vajari, M. A., K. Aghabayk, M. Sadeghian and N. Shiwakoti (2020). "A multinomial logit model of motorcycle crash severity at Australian intersections." Journal of safety research 73: 17-24.
 
- Wahab, L. and H. Jiang (2019). "A comparative study on machine learning based algorithms for prediction of motorcycle crash severity." PLoS one 14(4): e0214966.
- Wen, X., Y. Xie, L. Jiang, Z. Pu and T. Ge (2021). "Applications of machine learning methods in traffic crash severity modelling: current status and future directions." Transport reviews 41(6): 855-879.
 
- Wilmot, C. G. and M. Khanal (1999). "Effect of speed limits on speed and safety: a review." Transport Reviews 19(4): 315-329.
 
- Yu, H., Z. Li, G. Zhang, P. Liu and T. Ma (2021). "Fusion convolutional neural network-based interpretation of unobserved heterogeneous factors in driver injury severity outcomes in single-vehicle crashes." Analytic methods in accident research 30, 100157.
 
- Yu, R. and M. Abdel-Aty (2014). "Analyzing crash injury severity for a mountainous freeway incorporating real-time traffic and weather data." Safety science 63: 50-56.
 
- Zeng, Q., W. Gu, X. Zhang, H. Wen, J. Lee and W. Hao (2019). "Analyzing freeway crash severity using a Bayesian spatial generalized ordered logit model with conditional autoregressive priors." Accident Analysis & Prevention 127: 87-95.
 
- Zeng, Q. and H. Huang (2014). "A stable and optimized neural network model for crash injury severity prediction." Accident Analysis & Prevention 73: 351-358.
 
- Zeng, Q., H. Wen, H. Huang, X. Pei and S. Wong (2017). "A multivariate random-parameters Tobit model for analyzing highway crash rates by injury severity." Accident Analysis & Prevention 99: 184-191.
 
- Zhao, X., H. Qi, Y. Yao, M. Guo and Y. Su (2023). "Traffic Order Analysis of Intersection Entrance Based on Aggressive Driving Behavior Data Using CatBoost and SHAP." Journal of Transportation Engineering, Part A: Systems 149(6): 04023037.
 
- Zubaidi, H. A., I. A. Obaid, A. Alnedawi and S. Das (2021). "Motor vehicle driver injury severity analysis utilizing a random parameter binary probit model considering different types of driving licenses in 4-legs roundabouts in South Australia." Safety science 134, 105083.
 
- Zuo, D., C. Qian, D. Xiao, X. Xu and H. Wang (2023). "Data-driven crash prediction by injury severity using a recurrent neural network model based on Keras framework." International Journal of Injury Control and Safety Promotion: 1-10.
Volume 16, Issue 4 - Serial Number 65
Spring 2025
Pages 4845-4860

  • Receive Date 04 September 2023
  • Revise Date 28 September 2023
  • Accept Date 30 September 2023