nav emailalert searchbtn searchbox tablepage yinyongbenwen piczone journalimg journalInfo journalinfonormal searchdiv searchzone qikanlogo popupnotification paper paperNew
2025, 12, v.46 9-14
基于XGBoost算法的T2DM合并颈动脉斑块风险预测及致因研究
基金项目(Foundation): 院内医政管理优化项目(20220206)
邮箱(Email): yingmeix@163.com;
DOI: 10.19745/j.1003-8868.2025212
摘要:

目的:基于梯度提升决策树的集成学习(eXtreme gradient boosting,XGBoost)算法进行2型糖尿病(type 2 diabetes mellitus,T2DM)合并颈动脉斑块的风险预测及影响因素研究,以期为该疾病的发现、预防和治疗提供有价值的指导。方法:收集2019—2023年上海市某三甲医院3 707例T2DM患者的临床数据,包括2 359例横断面数据和1 348例纵向随访数据。采用方差分析法和Pearson相关系数法对数据进行交叉验证和变量筛选。基于横断面数据和纵向随访数据,采用XGBoost算法构建预测模型,并与人工神经网络、支持向量机以及高斯朴素贝叶斯3种机器学习模型进行对比以验证XGBoost模型的性能;采用沙普利加法解释(Shapley additive explanations,SHAP)分析模型的可解释性,采用特征权重图分析模型中各特征的重要性,并基于横断面数据和纵向随访数据对影响T2DM合并颈动脉斑块的因素进行对比。结果:XGBoost模型在横断面数据中准确率为0.77、精确率为0.62、召回率为0.59、F1分数为0.51,在纵向随访数据中准确率为0.75、精确率为0.57、召回率为0.68、F1分数为0.62。综合2类数据下的性能表现,XGBoost模型整体预测效果优于人工神经网络、支持向量机以及高斯朴素贝叶斯3种机器学习模型。SHAP值散点图和特征权重图显示,口干、高血压和年龄是影响T2DM合并颈动脉斑块的主要因素。结论:通过XGBoost算法可以实现对T2DM合并颈动脉斑块的预测。临床实践中应对影响该疾病的相关因素加强监测、评估和干预,以降低罹患该疾病的可能性。

Abstract:

Objective To explore the risk prediction and influencing factors of type 2 diabetes mellitus(T2DM) combined with carotid plaque based on XGBoost algorithm, so as to guide its diagnosis, prevention and treatment. Methods The clinical data from 2019 to 2023 were collected from 3 707 T2DM patients in some Tertiary Grade A hospital of Shanghai, including 2 359 cross-sectional cases and 1 348 longitudinal follow-up cases. Data cross-validation and variable screening were performed using analyses of variance(ANOVA) and Pearson correlation coefficients. A prediction model was constructed based on the cross-sectional and longitudinal follow-up data and the eXtreme gradient boosting(XGBoost) algorithm of the gradient boosting decision tree(GBDT), which had its performance validated by the comparison with three machine learning models of artificial neural network, support vector machine and Gaussian naive Baye; the interpretability of the model established was analyzed using Shapley additive explanations(SHAP) analysis model, the importance of each feature in the model was analyzed using feature weight plots, and the factors affecting combined T2DM and carotid plaque were compared with the cross-sectional data and longitudinal follow-up data. Results The XGBoost model had an accuracy of 0.77, precision of 0.62, recall of 0.59 and F1 score of 0.51 for the cross-sectional data, and an accuracy of 0.75, precision of 0.57, recall of 0.68 and F1 score of 0.62 for the longitudinal followup data, for the two kinds of data which gained advantages over the three machine learning models of artificial neural network, support vector machine and Gaussian naive Baye. Scatter plots of SHAP values and feature weight plots showed that dry mouth, hypertension and age were the main factors affecting combined T2DM and carotid plaque. Conclusion The XGBoost algorithm enables prediction of T2DM combined with carotid plaque, and the influencing factors have to underwent monitoring, evaluation and intervention during clinical operations so as to decrease the morbidity of combined T2DM and carotid plaque. [Chinese Medical Equipment Journal,2025,46(12):9-14]

参考文献

[1]张蕊,赵晓冉,郭宏英,等. 2型糖尿病患者发生颈动脉斑块的影响因素分析[J].中国医药导报,2019,16(20):72-75.

[2]程海涛,范斌,李鹏.融合时域卷积网络与XGBoost的糖尿病患者血糖时序预测[C]//2022中国自动化大会论文集.南京:南京邮电大学计算机学院,南京邮电大学网络安全与可信计算研究所,2022:623-630.

[3]马乾凤,宁传永,赵梦,等.颈动脉粥样硬化斑块稳定性影响因素分析[J].宁夏医学杂志,2023,45(8):708-710.

[4]陶俐均,陈润霖,何土凤,等.心血管疾病低风险群体颈动脉斑块危险因素及预测模型构建[J].中国预防医学杂志,2023,24(9):881-887.

[5]丁安乐,王赞,郭鹏,等. 2型糖尿病足病预测模型的建立[J].齐齐哈尔医学院学报,2022,43(22):2 112-2 118.

[6]郭金旦,高艳艳,高怀林,等. 2型糖尿病风险预测模型性能比较研究[J].中国生物工程杂志,2023,43(11):35-42.

[7]王琳,马锦花,陈慧娟,等. 2型糖尿病合并高血压患者颈动脉粥样硬化斑块形成的危险因素Logistic回归分析[J].罕少疾病杂志,2023,30(5):22-24.

[8]谢翠华,谢慧琴,孙艳,等.不同性别2型糖尿病患者血糖控制与颈动脉斑块、骨密度相关性[J].分子诊断与治疗杂志,2022,14(12):2 106-2 109,2 114.

[9]薛丽丽,王灵杰,石彩云,等. CTA识别症状性颈动脉斑块的放射组学研究[J].中西医结合心脑血管病杂志,2023,21(11):2 083-2 088.

[10]吴伟鹏,苏标瑞,许婉娜,等.影响脑梗死患者近期临床预后危险因素分析[J].按摩与康复医学,2023,14(4):52-55.

[11]孙莹,杨琴,吴冬平,等.初诊2型糖尿病合并颈动脉斑块患者颈动脉斑块灰阶中位数值预测脑梗死的临床研究[J].卒中与神经疾病,2022,29(5):448-452,461.

[12]王国安,龚莉,陈雪品,等.残粒脂蛋白胆固醇与2型糖尿病患者颈动脉斑块的关系研究[J].中国医刊,2023,58(11):1 180-1 183.

[13]STAEF M,OTT C,KANNENKERIL D,et al. Determinants of arterial stiffness in patients with type 2 diabetes mellitus:a cross sectional analysis[J]. Sci Rep,2023,13(1):8944.

[14]李建敦,蒋鹏,李桃,等.基于3类属性预测颈动脉斑块的随机森林方法研究[J].医疗卫生装备,2022,43(5):14-17.

[15]周亚斌,李建敦,陈京京,等.基于随机森林算法的糖尿病性视网膜病变预测方法研究[J].医疗卫生装备,2024,45(11):8-14.

[16]粗糙集基本理论知识——入门必看[EB/OL].(2020-01-04)[2025-03-15]. https://blog.csdn.net/Merry_hj/article/de tails/103833419.

[17]殷豪,林淼,王鹏,等.基于XGBoost的二轮车碾压事故致因研究[J].安全与环境工程,2023,30(5):19-27,45.

[18]STEWART C R,OBI N,EPANE E C,et al. Effects of diabetes on salivary gland protein expression of tetrahydrobiopterin and nitric oxide synthesis and function[J]. J Periodontol,2016,87(6):735-741.

[19]STANKEVICIENE I,PURIENE A,MIELIAUSKAITE D,et al.Detection of xerostomia,Sicca,and Sjogren’s syndromes in a national sample of adults[J]. BMC Oral Health,2021,21(1):552.

[20]AFSANEH ABADI P,KOOPAIE M,MONTAZERI R. Comparison of salivary nitric oxide and oral health in diabetic patients with and without xerostomia[J]. Diabetes Metab Syndr,2020,14(1):11-15.

[21]PEWOWARUK R J,KORCARZ C,TEDLA Y,et al. Carotid artery stiffness mechanisms associated with cardiovascular disease events and incident hypertension:the Multi-Ethnic Study of Atherosclerosis(MESA)[J]. Hypertension,2022,79(3):659-666.

[22]PEWOWARUK R J,TEDLA Y,KORCARZ C E,et al. Carotid artery stiffening with aging:structural versus load-dependent mechanisms in MESA(the Multi-Ethnic Study of Atherosclerosis)[J]. Hypertension,2022,79(1):150-158.

[23]肖贵梅,陈琳琳,刘芳.健康体检者颈动脉斑块检出情况及其危险因素分析[J].右江医学,2022,50(2):145-148.

[24]蓝洋,王宏宇.雌激素水平与血管相关疾病[J].心血管病学进展,2019,40(4):529-532.

基本信息:

DOI:10.19745/j.1003-8868.2025212

中图分类号:R543.4;R587.2

引用信息:

[1]李桃,徐映梅,蒋伏松.基于XGBoost算法的T2DM合并颈动脉斑块风险预测及致因研究[J].医疗卫生装备,2025,46(12):9-14.DOI:10.19745/j.1003-8868.2025212.

基金信息:

院内医政管理优化项目(20220206)

检 索 高级检索

引用

GB/T 7714-2015 格式引文
MLA格式引文
APA格式引文