Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/47512
Title: Development of multiple microbiome biomarkers using penalized regression methods
Authors: NGUYEN, Thi Huyen 
HAMAD, Ibrahim 
KLEINEWIETFELD, Markus 
AMARATUNGA, Dhammika 
Cabrera, Javier
Sargsyan, Davit
SENGUPTA, Rudradev 
OWOKOTOMO, Olajumoke Evangelina 
Katehakis, Michael N
SHKEDY, Ziv 
Issue Date: 2025
Publisher: Oxford Academic
Source: Journal of applied microbiology, lxaf242
Abstract: Aims Identifying biomarkers that reflect the complex relationship between the microbiome and health outcomes in microbiome studies is essential for advancing the understanding and improving disease management. While past research was focused on a single biomarker modeling approach, this study extends that work by combining multiple taxa to identify a subset of multiple biomarkers relevant to clinical outcomes. Methods and Results We extend the information theory framework for surrogate endpoint evaluation (Alonso and Molenberghs, 2007) by applying LASSO and Elastic Net models (Zou and Hastie, 2005; Hastie et al., 2015) to identify combinations of taxa as biomarkers for clinical outcomes. Feature selection for the biomarker’s construction is done in order to maximize the goodness of fit of the predictive biomarker model. Monte Carlo cross validation is used to enhance the reliability of feature selection. The high salt diet (HSD) study on mice, is used to illustrate the methodology for continuous outcome (tumor size). The top 5 selected genera yielded a correlation of 0.9274 between predicted and observed tumor size, with a 67.92% reduction in uncertainty when the multiple microbiome biomarkers score is known. To illustrate the methodology for binary outcome, the CERTIFI study on Crohn’s disease patients treated with ustekinumab is used. A multiple microbiome biomarkers score, constructed using the top 5 selected families, significantly improved prediction of remission 6 weeks after induction treatment (the clinical outcome of interest). Conclusions This study presents a unified approach for identifying multiple microbiome biomarkers using penalized regression for clinical outcome prediction. The proposed methods are applied to both continuous and binary outcomes. The method enhances the detection of meaningful biomarkers with potential for personalized treatment and disease management.
Keywords: microbiome;biomarkers detection;multiple biomarkers;penalized regression;LASSO/Elastic Net
Document URI: http://hdl.handle.net/1942/47512
ISSN: 1364-5072
e-ISSN: 1365-2672
DOI: 10.1093/jambio/lxaf242
Rights: The Author(s) 2025. Published by Oxford University Press on behalf of Applied Microbiology International. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site-for further information please contact journals.permissions@oup.com This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
Category: A1
Type: Journal Contribution
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
Published manuscript.pdf
  Until 2026-04-15
Peer-reviewed author version2.03 MBAdobe PDFView/Open    Request a copy
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.