Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/28963
Title: A broken promise: microbiome differential abundance methods do not control the false discovery rate
Authors: Hawinkel, Stijn
Mattiello, Federico
BIJNENS, Luc 
THAS, Olivier 
Issue Date: 2019
Publisher: OXFORD UNIV PRESS
Source: BRIEFINGS IN BIOINFORMATICS, 20(1), p. 210-221
Abstract: High-throughput sequencing technologies allow easy characterization of the human microbiome, but the statistical methods to analyze microbiome data are still in their infancy. Differential abundance methods aim at detecting associations between the abundances of bacterial species and subject grouping factors. The results of such methods are important to identify the microbiome as a prognostic or diagnostic biomarker or to demonstrate efficacy of prodrug or antibiotic drugs. Because of a lack of benchmarking studies in the microbiome field, no consensus exists on the performance of the statistical methods. We have compared a large number of popular methods through extensive parametric and nonparametric simulation as well as real data shuffling algorithms. The results are consistent over the different approaches and all point to an alarming excess of false discoveries. This raises great doubts about the reliability of discoveries in past studies and imperils reproducibility of microbiome experiments. To further improve method benchmarking, we introduce a new simulation tool that allows to generate correlated count data following any univariate count distribution; the correlation structure may be inferred from real data. Most simulation studies discard the correlation between species, but our results indicate that this correlation can negatively affect the performance of statistical methods.
Notes: [Hawinkel, Stijn; Mattiello, Federico] Univ Ghent, Dept Math Modelling Stat & Bioinformat, BioStat Grp, Biostat, Ghent, Belgium. [Mattiello, Federico] Roche, Basel, Switzerland. [Bijnens, Luc] Johnson & Johnson, Janssen Pharmaceut Co, Quantitat Sci, Beerse, Belgium. [Bijnens, Luc] Hasselt Univ, Ctr Stat, Stat, Hasselt, Belgium. [Thas, Olivier] Univ Ghent, Dept Math Modelling Stat & Bioinformat, Biostat, Ghent, Belgium. [Thas, Olivier] Univ Wollongong, NIASRA, Wollongong, NSW, Australia.
Keywords: microbiome; differential abundance; simulation; taxa correlation networks; false discovery rate;Biochemical Research Methods; Mathematical & Computational Biology
Document URI: http://hdl.handle.net/1942/28963
ISSN: 1467-5463
e-ISSN: 1477-4054
DOI: 10.1093/bib/bbx104
ISI #: 000456736200019
Category: A1
Type: Journal Contribution
Validations: ecoom 2020
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
oup-accepted-manuscript-2017.pdf
  Restricted Access
Published version1.21 MBAdobe PDFView/Open    Request a copy
SimulationsPaper.pdfPeer-reviewed author version1.73 MBAdobe PDFView/Open
Show full item record

SCOPUSTM   
Citations

19
checked on Sep 5, 2020

WEB OF SCIENCETM
Citations

99
checked on Apr 16, 2024

Page view(s)

174
checked on Sep 7, 2022

Download(s)

652
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.