Surrogate modeling for LLM black box interpretability
Surrogate model for 10-year cardiovascular disease risk prediction
Age
years
Smoking status
HDL cholesterol
mg/dL
Sex
Systolic BP
mmHg
LDL cholesterol
mg/dL
Diabetes
Diastolic BP
mmHg
Triglycerides
mg/dL
Treatment for hypertension
Height
cm
HbA1c
%
Dyslipidemia
Weight
kg
Creatinine
mg/dL
History of atrial fibrillation
Waist circumference
cm
Uric acid
mg/dL
Chronic kidney disease
Hip circumference
cm
C-reactive protein
mg/dL
Family history of cardiovascular disease in first degree relatives
Total cholesterol
mg/dL
Race/ethnicity
Pooled Cohort Equations
The equation for the developed surrogate model for 10-year CVD risk prediction is as follows:
Score = -65.243 + Age (years) x 0.784 + Sex
[Preprint] Han, C., Kim, D. W., Kim, S., Kim, J., Bae, S., & Yoon, D. A Novel GPT-Derived Scoring System for 10-Year Cardiovascular Disease Risk Estimation. Available at SSRN 4763170.
[1] Goff Jr, D. C., Lloyd-Jones, D. M., Bennett, G., Coady, S., D’agostino, R. B., Gibbons, R., ... & Wilson, P. W. (2014). 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation, 129(25_suppl_2), S49-S73.