Article contents
PD33 Development And Validation Of A Machine Learning-Based Prediction Model For COVID-19 Diagnosis Using Patients’ Metabolomic Profile Data
Published online by Cambridge University Press: 23 December 2022
Abstract
We aimed to develop and validate machine learning (ML) -based algorithms to predict COVID-19 diagnosis as well as to identify new biomarkers associated with the disease.
Initially, 96 blood samples of patients diagnosed with COVID-19 (Thaizhou Hospital, China) were analyzed through liquid chromatography coupled to mass spectrometry. Samples of patients presenting other pneumonias or severe acute respiratory syndrome, but with negative RT-PCR for SARS-CoV-2, were used as positive controls. Samples from healthy volunteers were used as negative controls. The final database included around 1000 metabolites. Exploratory analyses for the development of ML-based models using principal component analysis (PCA) were performed. Leverage plot versus studentized residuals method was used to detect outliers. Three supervised ML-based models were developed: discriminant analysis by partial least squares (PLS-DA), artificial neural networks discriminant analysis (ANNDA) and k-nearest neighbors (KNN). Samples for the training (70%) and testing sets (30%) were randomly selected using the Kenrad Stone algorithm. Models’ performance was evaluated considering accuracy, sensitivity and specificity. Analyses were conducted in SOLO (Eigenvector-Research).
The PCA model was able to distinguish the three classes of patients’ samples (positive for COVID-19, negative controls, positive controls) with an overall accumulated variance of 94.27 percent. The PLS-DA model presented the best performance (accuracy, sensitivity, and specificity of 93%, 98% and 88%, respectively). Increased levels of the biomarkers uridine (linked to glucose homeostasis, lipid, and amino acid metabolisms), 4-hydroxyphenylacetoylcarnitine (metabolite from the tyrosine metabolism; probably associated with anorexia) and ribothymidine (resulting from oral and fecal microbiota alterations) were significantly associated with COVID-19.
Three different and updated ML-based algorithms were developed to predict COVID-19 diagnosis; PLS-DA led to the most accurate results. High levels of some metabolites were found as potentially predictors of the disease. These biomarkers should be further evaluated as potential therapeutic targets in well-designed clinical trials. These ML-based models can help the early diagnosis of COVID-19 and guide the development of tailored interventions.
- Type
- Poster Debate
- Information
- International Journal of Technology Assessment in Health Care , Volume 38 , Special Issue S1: Abstracts from the HTAi 2022 Meeting in Utrecht, Netherlands , December 2022 , pp. S101 - S102
- Copyright
- © The Author(s), 2022. Published by Cambridge University Press
- 1
- Cited by