| 313 | 0 | 710 |
| 下载次数 | 被引频次 | 阅读次数 |
为提高交通事故严重程度的预测准确性,构建一种基于觅食生境选择粒子群优化(foraging habitat selection particle swarm optimization, FHSPSO)算法优化随机森林(random forest, RF)算法关键超参数的FHSPSO-RF模型,以美国华盛顿州西雅图市2022年1月至2023年2月的交通事故数据为基础,选取12项特征指标,采用合成少数类过采样技术,增加重伤和死亡事故的样本数,改善事故类别分布均衡性;将FHSPSO-RF模型与支持向量机(support vector machine, SVM)、K近邻(K-nearest neighbors, KNN)和逻辑回归(logistic regression, LR)等模型进行性能对比,并通过SHAP(Shapley additive explanations)分析法,解析各特征对交通事故严重程度的影响机制。结果表明:过采样后重伤、死亡事故的召回率显著增大,FHSPSO-RF模型的整体性能更均衡;FHSPSO-RF模型的准确率、精确率、召回率、F1分数均高于其他3个模型,对交通事故严重程度的预测效果最好;在所有交通事故类型中,受伤人数和车辆数量均为最关键的驱动因素,对交通事故严重程度具有显著正向影响,行人数量、高冲击碰撞类型(如正面撞击)与复杂道路环境(如交叉口、匝道)是重伤及死亡事故的关键风险组合;财产损失事故与是否碰撞路边停放车辆密切相关。FHSPSORF模型在交通事故严重程度预测中表现出良好的性能与可解释性,可为交通事故风险预测与防控决策提供依据。
Abstract:To enhance the accuracy of traffic accident severity prediction, an FHSPSO-RF model is developed, where foraging habitat selection particle swarm optimization(FHSPSO) is used to optimize key hyperparameters of a random forest(RF). Using traffic accident data from Seattle, Washington, from January 2022 to February 2023, twelve features are selected. The synthetic minority oversampling technique is applied to increase the samples of severe-injury and fatal crashes and improve class balance. The FHSPSO-RF model is compared with support vector machine(SVM), K-nearest neighbors(KNN), and logistic regression(LR). Shapley additive explanations(SHAP) are used to interpret how each feature influences severity. Results indicate that after oversampling, the recall of severe-injury and fatal crashes increases significantly, and the FHSPSO-RF model achieves more balanced overall performance. The model attains higher accuracy, precision, recall, and F1 score than the three benchmarks, yielding the best severity prediction. Across all crash types, the numbers of injuries and vehicles are the most influential drivers with significant positive effects on severity. The number of pedestrians, high-impact collision types such as head-on crashes, and complex road environments such as intersections and ramps form key risk combinations for severe-injury and fatal crashes. Property-damage-only crashes are closely related to whether a parked roadside vehicle is struck. The FHSPSO-RF model demonstrates strong predictive performance and interpretability, providing support for crash risk prediction and prevention-control decision making.
[1] 何雅琴,段雨阳,王晨.基于累积Logistic模型的行人交通事故严重程度分析及对策研究[J].安全与环境学报,2021,21(3):1165-1172.
[2] 胡郁葱,韦湖,曾强.基于空间广义有序Probit模型的高速公路事故严重程度分析[J].华南理工大学学报(自然科学版),2023,51(1):114-122.
[3] 张萌萌,李泽文,徐云帆,等.基于优化CART的交通事故严重程度影响因素分析[J].重庆交通大学学报(自然科学版),2023,42(9):93-98.
[4] 成卫,马铭炜,张小龙.基于贝叶斯网络的高速公路交通事故严重程度预测及致因分析[J].重庆交通大学学报(自然科学版),2023,42(7):87-95.
[5] 潘义勇,徐翔宇.数据不平衡的MobileViT网络交通事故严重程度预测模型[J].吉林大学学报(工学版),2025,55(3):947-953.
[6] VADHWANI D,THAKOR D.An improved XGBoost model to predict the injury severity of person in road crash[J].International Journal of Crashworthiness,2025,30(2):115-124.
[7] KUMAR GANGADHARI R,KUMAR TAREI P.Qualitative investigation of the influential factors behind unsafe trucking behaviors in India[J].Transportation Research Record:Journal of the Transportation Research Board,2021,2675(1):67-78.
[8] 乔建刚,范颖蓉,陶瑞,等.山区高速隧道交通事故严重程度预测及特大事故决策规则提取[J].中国安全生产科学技术,2024,20(7):186-192.
[9] 熊向阳,杨小周,赵银超,等.基于超参数优化随机森林算法的森林生物量遥感反演[J].中南林业科技大学学报,2024,44(5):102-111.
[10] 严格齐,赵婉莹,于镇伟,等.基于超参数优化算法的随机森林模型预测奶牛呼吸频率[J].农业工程学报,2024,40(11):195-203.
[11] 佟敏,崔亚茹,党乐,等.带参数优化的随机森林变压器油中溶解气体浓度预测[J].河北师范大学学报(自然科学版),2022,46(5):468-473.
[12] 陈子廓,史宪睿.基于觅食生境选择的改进粒子群算法[J].辽宁工业大学学报(自然科学版),2022,42(1):19-21.
[13] 陆雅雯,李正权,谭立容,等.基于遗传粒子群算法的超混沌S盒设计[J].江苏大学学报(自然科学版),2024,45(6):701-708.
[14] SHI Y,EBERHART R.A modified particle swarm optimizer[C]//IEEE World Congress on Computational Intelligence.Proceedings of IEEE International Conference on Evolutionary Computation Proceedings.Anchorage,AK,USA.IEEE,1998:69-73.
[15] 中华人民共和国公安部.道路交通管理信息采集规范第3部分:道路交通事故处理信息采集:GA/T 946.3—2011[S].北京:中国标准出版社,2011.
[16] 罗聪,徐克,刘潜,等.安全风险分级管控相关概念辨析[J].中国安全科学学报,2019,29(10):43-50.
[17] 于琪,马路.固有匹配对下不同动力类型车辆交通事故乘员伤害严重程度影响因素研究[J].综合运输,2024,46(9):114-119.
[18] 司贺兵,赵丹.低等级道路交通事故严重程度影响因素分析[J].中国人民公安大学学报(自然科学版),2024,30(3):42-48.
[19] 秦海波,赵哲源,刘兵,等.基于皮尔逊系数的防窃电系统建设方案[J].电气技术与经济,2024(6):1-3.
[20] 张恒,陈焕明,党步伟,等.基于Attention机制和递归思想的LSTM车辆轨迹预测[J].青岛大学学报(工程技术版),2024,39(2):74-82.
[21] 潘比,余靖华,黄译贤,等.基于SMOTE算法和机器学习模型建立原发性肝癌术后的预后预测模型[J].陆军军医大学学报,2024,46(19):2236-2240.
[22] 陈丽萍,王洪海,何舒平.一种基于数据分布的不平衡数据过采样方法[J].安徽大学学报(自然科学版),2024,48(5):26-36.
[23] 李爱华,刘婉昕,陈思帆,等.面向不平衡数据的SMOTE-BO-XGBoost集成信用评分模型研究[PP].中国管理科学.(2024-10-11)[2024-11-5].https://link.cnki.net/doi/10.16381/j.cnki.issn1003-207x.2023.0635.
[24] 王健宇,陈献天,焦朋朋,等.考虑建成环境的交通事故严重程度致因交互效应研究[J].交通运输系统工程与信息,2024,24(2):272-280.
[25] 孙国锋,景云,李和壁,等.基于可解释机器学习框架的列车乘车区段客流分布预测方法[J].交通运输系统工程与信息,2024,24(2):249-262.
基本信息:
DOI:10.3969/j.issn.1672-0032.2026.01.003
中图分类号:U491.31
引用信息:
[1]王淑椒,胡学龙,唐磊.基于FHSPSO-RF的交通事故严重程度预测[J].山东交通学院学报,2026,34(01):25-33.DOI:10.3969/j.issn.1672-0032.2026.01.003.
2025-09-09
2025-09-09
2025-09-09