Accessibility navigation


Integrating genetics, metabolites, and clinical characteristics in predicting cardiometabolic health outcomes using machine learning algorithms - a systematic review

Zhu, X., Ventura, E. F., Bansal, S., Wijeyesekera, A. ORCID: https://orcid.org/0000-0001-6151-5065 and Vimaleswaran, K. S. ORCID: https://orcid.org/0000-0002-8485-8930 (2025) Integrating genetics, metabolites, and clinical characteristics in predicting cardiometabolic health outcomes using machine learning algorithms - a systematic review. Computers in Biology and Medicine, 186. 109661. ISSN 1879-0534

[thumbnail of Open Access]
Preview
Text (Open Access) - Published Version
· Available under License Creative Commons Attribution.
· Please see our End User Agreement before downloading.

2MB
[thumbnail of Zhu et al_Clean_manuscript.docx] Text - Accepted Version
· Restricted to Repository staff only

864kB

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

To link to this item DOI: 10.1016/j.compbiomed.2025.109661

Abstract/Summary

Background: Machine learning (ML) integration of clinical, metabolite, and genetic data reveals variable results in predicting cardiometabolic health (CMH) outcomes. Therefore, we aim to (1) evaluate whether a multi-modal approach incorporating all three data types using ML algorithms can improve CMH outcome prediction compared to single-modal or paired-modal models, and (2) compare the methodologies used in existing prediction models. Methods: We systematically searched five databases from 1998 to 2024 for ML predictive modelling studies using the multi-modal approach for CMH outcomes. Risk-of-bias assessment tools were used to assess methodological quality. Study characteristics, ML algorithms, data preprocessing, evaluation methods and metrics, feature selections, and feature importance parameters were synthesized narratively to show methodological heterogeneity. Results: Of the four included studies (3 ML algorithms), three were at low risk of bias, and one was at high risk. The multi-modal approach consistently improved T2D and BP prediction compared to single-modal or paired-modal models. Genetics showed the lowest predictive performance in three studies. Logistic regression (n = 2 studies) and random forest (n = 1) were used in T2D studies, while XGBoost was used in one BP study. One study with missing data and variations in feature selection across all studies hindered a comprehensive comparison of feature importance. Conclusions: Our review emphasizes the potential improvement in T2D and BP prediction using ML algorithms with the multi-modal approach. However, further studies using diverse ML algorithms with optimized methodologies on single-modal, paired-modal, and multi-modal models are needed to gain insights into biomarker selection for predicting CMH outcomes.

Item Type:Article
Refereed:Yes
Divisions:Interdisciplinary Research Centres (IDRCs) > Institute for Food, Nutrition and Health (IFNH)
Life Sciences > School of Chemistry, Food and Pharmacy > Department of Food and Nutritional Sciences > Human Nutrition Research Group
ID Code:122531
Publisher:Elsevier

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Page navigation