Explainable machine learning for predicting disease flares in Axial Spondyloarthritis: a real-world EHR-based pilot study

[thumbnail of Open Access]
Preview
Text (Open Access)
- Published Version
· Available under License Creative Commons Attribution.
[thumbnail of Flare prediction in axial spondyloarthritis DHS Submission with changes.pdf]
Text
- Accepted Version
· Restricted to Repository staff only

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Chan, A., Moon, P., Li, W. ORCID: https://orcid.org/0000-0003-2878-3185 and Bazuaye, E. (2026) Explainable machine learning for predicting disease flares in Axial Spondyloarthritis: a real-world EHR-based pilot study. Digital Health, 12. ISSN 2055-2076 doi: 10.1177/20552076261433513

Abstract/Summary

Objective Flares of axial spondyloarthritis (axSpA) are common yet unpredictable. We aimed to develop and internally validate a machine learning (ML) model to forecast flares 3, 6, 9 and 12 months ahead using routinely collected electronic health record (EHR) data. Methods We performed a retrospective cohort study of 282 axSpA patients (January 2018 to May 2024) in our centre. Flares were defined as a ≥ 2.1 unit rise in Bath Ankylosing Spondylitis Disease Activity Index (BASDAI), ≥ 0.9 unit rise in axSpA Disease Activity Score and clinician confirmed flare. Ninety-eight candidate predictors spanning demographics, patient-reported outcomes, laboratory indices and comorbidities were aggregated into time-series windows. We applied Light Gradient-Boosting Machine (LGBM) and eXtreme Gradient Boosting to forecast the risk of flare. Model interpretability was assessed with SHapley Additive exPlanations (SHAP). Results Of 282 patients, 100 (35.5%) experienced at least one flare. Our LGBM model demonstrated key 3-month metrics of: accuracy 0.846 (95% CI 0.545–0.980), sensitivity 0.833 (95% CI 0.358–0.995), specificity 0.857 (95% CI 0.421–0.996) and area under the receiver operating characteristic curve (AUROC) 0.845 (95% CI 0.615–1.000). Performance decreased modestly at 12 months, AUROC 0.773 (95% CI 0.562–0.984). Top SHAP contributors included comorbidity burden, BASDAI, CRP deviation, lymphocyte count, age and deprivation index. Individual-level SHAP plots enabled personalised risk profiles. Conclusion This proof-of-concept study demonstrates the feasibility of explainable ML models to predict axSpA flares up to 1 year in advance using real-world EHR data. Embedding the algorithm in electronic records could triage high-risk patients to earlier review and therapy adjustment. This approach offers a novel strategy to inform treat-to-target care pathways and support future integration into digital rheumatology systems.

Altmetric Badge

Dimensions Badge

Item Type Article
URI https://centaur.reading.ac.uk/id/eprint/128580
Identification Number/DOI 10.1177/20552076261433513
Refereed Yes
Divisions Interdisciplinary centres and themes > Health Innovation Partnership (HIP)
Henley Business School > Digitalisation, Marketing and Entrepreneurship
Publisher Sage
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record