体检人群脂肪肝风险预测模型的构建与验证*

doi:10.3969/j.issn.1672-5069.2026.02.010

摘要/Abstract

摘要： 目的基于体检中心常规指标构建脂肪肝早期预测模型,实现精准且低成本的筛查方法。方法 2024年2月～2024年5月安徽理工大学第一附属医院体检中心接受体检的人群1212例,使用超声检查诊断脂肪肝,常规临床检测后计算各种指数。采用嵌套交叉验证(10折外层+5折内层)结合随机森林和XGBoost进行特征选择,应用LASSO回归建模。经SHAP值解释变量重要性,应用Bootstrap法(1000次迭代)进行内部验证,并采用随机划分30%数据进行外部验证。结果在纳入的1212例人群中,发现脂肪肝542例(44.7%);最终模型纳入4个关键变量,即甘油三酯-葡萄糖-BMI指数(TyG-BMI)、体脂率、舒张压和单核细胞与高密度脂蛋白胆固醇比值(MHR);模型效能优异,其嵌套交叉验证AUC为0.874(95%CI:0.855～0.893),最终模型AUC为0.880(95%CI:0.861～0.898),乐观校正后AUC为0.878(95%CI:0.860～0.897),外部验证AUC为0.866 (95%CI:0.830～0.902);校准度及稳定性良好(校准斜率≈1,Hosmer-Lemeshow检验P值=0.433,噪声鲁棒性检验AUC=0.878),SHAP分析显示TyG-BMI贡献度最大。结论本研究建立的脂肪肝预测模型判别力高、校准度好、易获取,可转化为体检中心脂肪肝“精准-高效-低成本”的筛查工具。

关键词: 脂肪肝, 机器学习预测模型, 甘油三酯-葡萄糖-体质指数, 体检人群

Abstract: Objective The aim of this study was to set up and validate a precise yet low-cost early prediction model for fatty liver disease based on routine indicators available in health-checkup centers. Methods A retrospective cohort of 1212 individuals for physical examination was analyzed, and the fatty liver was diagnosed based on ultrasonography. Various indexes were calculated based on clinical materials. Nested cross-validation (10-fold outer loop for validation and 5-fold inner loop for tuning) was combined random-forest and XGBoost with LASSO Logistic regression was conducted for feature selection. Variable importance was interpreted with SHAP values. Internal validation was used 1000-bootstrap optimism-corrected AUC and external validation was employed by a 30 % random split. Results Of the 1212 individuals, fatty liver was found in 542 cases(44.7%);the final model retained four variables, e.g.,triglyceride-glucose-body mass index (TyG-BMI), body-fat percentage, diastolic blood pressure and monocyte to high-density lipoprotein cholesterol (MHR); AUC of nested-cross-validation was 0.874 (95 % CI: 0.855-0.893), AUC of final-model was 0.880 (95 % CI: 0.861-0.898), AUC of optimism-corrected was 0.878 (95 % CI: 0.860-0.897) and AUC of external was 0.866 (95 % CI: 0.830-0.902); calibration was excellent (slope ≈ 1; Hosmer-Lemeshow P=0.433) and robust under 30 % Gaussian noise (AUC=0.878); SHAP analysis identified TyG-BMI as the dominant contributor. Conclusion The four-variable model demonstrates high discrimination, excellent calibration, easy acquisition and strong generalizability, which might offer health-checkup centers a“precise, efficient and low-cost” screening tool for fatty liver disease.

Key words: Fatty liver, Machine learning prediction mode, Triglyceride-glucose-body mass index, Health check-up population

张水珠, 丁梦寒, 周淑萍. 体检人群脂肪肝风险预测模型的构建与验证^*[J]. 实用肝脏病杂志, 2026, 29(2): 199-204.

Zhang Shuizhu, Ding Menghan, Zhou Shuping. Establishment and validation of a risk prediction model for fatty liver disease in health checkup individuals[J]. Journal of Practical Hepatology, 2026, 29(2): 199-204.

参考文献

[1] Dai JJ, Zhang YF, Zhang ZH. Global trends and hotspots of treatment for nonalcoholic fatty liver disease: A bibliometric and visualization analysis (2010-2023). World J Gastroenterol, 2023, 29(37): 5339-5360.
[2] Younossi ZM, Golabi P, Paik JM, et al. The global epidemiology of nonalcoholic fatty liver disease (NAFLD) and nonalcoholic steatohepatitis (NASH): a systematic review. Hepatology, 2023, 77(4): 1335-1347.
[3] Keating SE, Sabag A, Hallsworth K, et al. Exercise in the management of metabolic-associated fatty liver disease (MAFLD) in adults: a position statement from exercise and sport science Australia. Sports Med, 2023, 53(12): 2347-2371.
[4] Rinella ME, Neuschwander-Tetri BA, Siddiqui MS, et al. AASLD practice guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology, 2023, 77(5): 1797-1835.
[5] Shi YY, Liang YF. Prevalence and health management needs of non-alcoholic fatty liver disease in middle-aged and young people undergoing health check-ups: a survey. Heilongjiang Med Sci, 2022, 45(6): 161-162.
[6] Chen G, Shuai XJ, Luo W, et al. Application of the“expert consensus on the management of important abnormal findings in health check-ups (trial version)”in health check-up centers. Health Check-up and Management, 2023, 4(2): 112-116.
[7] Zheng Y, et al. External validation of fatty liver index and hepatic steatosis index in a large-scale Chinese physical examination cohort. BMJ Open, 2021, 11: e047822.
[8] Razmpour F, Daryabeygi-Khotbehsara R, Soleimani D, et al. Application of machine learning in predicting non-alcoholic fatty liver disease using anthropometric and body composition indices. Sci Rep, 2023, 13: 4942.
[9] Deurenberg P, Weststrate JA, Seidell JC. Body mass index as a measure of body fatness: age- and sex-specific prediction formulas. Br J Nutr, 1991, 65(2): 105-114.
[10] Simental-Mendía LE, Rodríguez-Morán M, Guerrero-Romero F. The product of fasting glucose and triglycerides as surrogate for identifying insulin resistance in apparently healthy subjects. Metab Syndr Relat Disord, 2008, 6(4): 299-304.
[11] Lim J, Kim J, Koo SH, et al. Comparison of triglyceride glucose index, and related parameters to predict insulin resistance in Korean adults: an analysis of the 2007-2010 Korean national health and nutrition examination survey. PLoS One, 2019, 14(3): e0212963.
[12] Mansoori A, Nosrati M, Dorchin M, et al. A novel index for diagnosis of type 2 diabetes mellitus: cholesterol, high density lipoprotein, and glucose (CHG) index. J Diabetes Investig, 2025, 16(2): 309-314.
[13] Wu J, Huang L, He H, et al. Red cell distribution width to platelet ratio is associated with increasing in-hospital mortality in critically ill patients with acute kidney injury. Dis Markers, 2022, 2022: 4802702.
[14] ArefhosseiniS, Aghajani T, Tutunchi H, et al. Association of systemic inflammatory indices with anthropometric measures, metabolic factors, and liver function in non-alcoholic fatty liver disease. Sci Rep, 2024, 14: 12829.
[15] Tosu AR, Biter H. Association of systemic immune-inflammation index (SII) with presence of isolated coronary artery ectasia. Arch Med Sci Atheroscler Dis, 2021, 6: e152-e157.
[16] Millán J, Pintó X, Muñoz A, et al. Lipoprotein ratios: physiological significance and clinical usefulness in cardiovascular prevention. Vasc Health Risk Manag, 2009, 5: 757-765.
[17] Soffer DE, Marston NA, Maki KC, et al. Role of apolipoprotein B in the clinical management of cardiovascular risk in adults: an expert clinical consensus from the National Lipid Association. J Clin Lipidol, 2024, 18(5): e647-e663.
[18] Qian X, Wu W, Chen B, et al. Value of triglyceride glucose-body mass index in predicting nonalcoholic fatty liver disease in individuals with type 2 diabetes mellitus. Front Endocrinol (Lausanne), 2025, 15: 1425024.
[19] Huang X, Hu Y, Dong L, et al. TyG-BMI as a superior predictor of MAFLD and pre-MAFLD in Chinese adults: a cross-sectional study. BMC Gastroenterol, 2025, 25: 495.
[20] Roh JH, Park JH, Lee H, et al. A close relationship between non-alcoholic fatty liver disease markerand new-onset hypertension in healthy Korean adults. Korean Circ J, 2020, 50(8): 695-705.
[21] Åberg F, Kantojärvi K, Männistö V, et al. Association between arterial hypertension and liver outcomes using polygenic risk scores: a population-based study. Sci Rep, 2022, 12: 15581.
[22] Kaya Eda,Yilmaz Yusuf.累及全身多系统的疾病:代谢相关(非酒精性)脂肪性肝病.中华肝脏病杂志,2025,33(1):77-87.
[23] Huang H, Wang Q, Shi X, et al. Association between monocyte to high-density lipoprotein cholesterol ratio and nonalcoholic fatty liver disease: across-sectional study. Mediators Inflamm, 2021, 2021: 6642246.
[24] Wang L, Dong J, Xu M, et al. Association between monocyte to high-density lipoprotein cholesterol ratio and risk of non-alcoholic fatty liver disease: a cross-sectional study. Front Med (Lausanne), 2022, 9:898931.
[25] Bril F, Barb D, Portillo-Sanchez P, et al. Metabolic and histological implications of intrahepatic triglyceride content in nonalcoholic fatty liver disease. Hepatology, 2017, 65(4): 1132-1144.
[26] Zhao H, Fang Y, Zhao J, et al. Nonlinear relationship between body fat percentage and NAFLD mediated by METS-IR: threshold effects and subgroup differences. Sci Rep, 2025, 15: 24917.
[27] 崔婷婷,安雪莲,孙德亮,等. 基于SHAP的可解释机器学习的滑坡易发性评价模型. 成都理工大学学报(自然科学版), 2025, 52 (01): 153-172.

体检人群脂肪肝风险预测模型的构建与验证^*

Establishment and validation of a risk prediction model for fatty liver disease in health checkup individuals

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	党学娟, 谢晓艳, 唐萍, 李敏. 一电力企业男性职工脂肪肝检出情况及影响因素分析研究^*[J]. 实用肝脏病杂志, 2025, 28(3): 474-476.
[2]	黄嘉伟, 纪雅丽, 周玲, 陈金军. 代谢相关脂肪性肝病患者SAF评分和脂肪肝进展抑制算法应用研究^*[J]. 实用肝脏病杂志, 2025, 28(1): 48-51.
[3]	沈玥, 朱宁, 王海. FibroTouch检测参数诊断非酒精性脂肪性肝炎患者效能分析^*[J]. 实用肝脏病杂志, 2025, 28(1): 60-63.
[4]	伏林, 张龄尹, 魏倩. MRI在脂肪肝背景下对局灶性结节性病变定性诊断价值分析^*[J]. 实用肝脏病杂志, 2025, 28(1): 140-143.
[5]	罗良德, 苏建明, 任成果, 申红. 复方甘草酸苷联合非诺贝特治疗代谢相关性脂肪性肝病患者疗效及其对脂代谢的影响^*[J]. 实用肝脏病杂志, 2023, 26(5): 650-653.
[6]	胡灵溪, 安薪宇, 李妹, 刘百成, 南月敏, 王荣琦. 超声衰减参数诊断体检人群脂肪肝临床应用价值分析^*[J]. 实用肝脏病杂志, 2023, 26(4): 488-491.
[7]	陈凤莲, 朱青蓝, 朱伶俐, 陈国飞. 非酒精性脂肪性肝病患者外周血恒定自然杀伤T细胞和CD4⁺/CD8⁺T细胞活化差异研究^*[J]. 实用肝脏病杂志, 2023, 26(1): 31-34.
[8]	徐伟强, 刘淑萍, 李潇萌. 体检人群非酒精性脂肪性肝病检出率及其危险因素分析^*[J]. 实用肝脏病杂志, 2023, 26(1): 35-38.
[9]	刘海霞, 朱云霞, 段忠辉, 赖曼, 陈煜. 近20年妊娠急性脂肪肝患者预后变化和死亡原因分析[J]. 实用肝脏病杂志, 2023, 26(1): 55-58.
[10]	王玉梅, 袁向东, 刘玉玲, 赵子瑜. 提示肥胖症儿童/青少年存在非酒精性脂肪性肝炎的指标分析^*[J]. 实用肝脏病杂志, 2022, 25(6): 808-811.
[11]	黄玮, 范智慧, 陈磊, 王丹. 脂肪肝对超声造影诊断肝内炎性假瘤的影响^*[J]. 实用肝脏病杂志, 2022, 25(6): 897-900.
[12]	杨蕊旭, 范建高. 非酒精性脂肪性肝病相关肝细胞癌流行病学与筛查^*[J]. 实用肝脏病杂志, 2022, 25(2): 153-156.
[13]	王琳, 林雪松, 丁楠. 非酒精性单纯性脂肪肝与非酒精性脂肪性肝炎患者血清脂质组学比较^*[J]. 实用肝脏病杂志, 2022, 25(2): 215-218.
[14]	葛海燕, 匡霞, 周瑞君. 非酒精性脂肪性肝病合并2型糖尿病患者外周血miR-17、miR-20a和miR-20b变化及其临床意义^*[J]. 实用肝脏病杂志, 2021, 24(5): 697-700.
[15]	李拓键, 张超, 陈宗涛. 重庆市800名教职工健康体检脂肪肝检出情况分析^*[J]. 实用肝脏病杂志, 2021, 24(4): 512-515.