Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/29955
Title: High-dimensional prediction of binary outcomes in the presence of between-study heterogeneity
Authors: Mbah, Chamberlain
De Neve, Jan
THAS, Olivier 
Issue Date: 2019
Publisher: SAGE PUBLICATIONS LTD
Source: STATISTICAL METHODS IN MEDICAL RESEARCH, 28(9), p. 2848-2867
Abstract: Many prediction methods have been proposed in the literature, but most of them ignore heterogeneity between populations. Either only data from a single study or population is available for model building and evaluation, or when data from multiple studies make up the training dataset, studies are pooled before model building. As a result, prediction models might perform less than expected when applied to new subjects from new study populations. We propose a linear method for building prediction models with high-dimensional data from multiple studies. Our method explicitly addresses between-population variability and tends to select predictors that are predictive in most of the study populations. We employ empirical Bayes estimators and hence avoid selection bias during the variable selection process. Simulation results demonstrate that the new method works better than other linear prediction methods that ignore the between-study variability. Our method is developed for classification into two groups.
Notes: [Mbah, Chamberlain] Univ Ghent, Dept Radiotherapy & Expt Canc Res, C Heymanslaan 10,Radiotherapiepk, B-9000 Ghent, Belgium. [De Neve, Jan] Univ Ghent, Fac Psychol & Educ Sci, Dept Data Anal, Ghent, Belgium. [Thas, Olivier] Hasselt Univ, Ctr Stat, Diepenbeek, Belgium. [Thas, Olivier] Univ Wollongong, Natl Inst Appl Stat Res Australia, Wollongong, NSW, Australia.
Keywords: Health Care Sciences & Services; Mathematical & Computational Biology; Medical Informatics; Statistics & Probability;Empirical Bayes; high-dimensional data; multiple studies; heterogeneity; naive Bayes
Document URI: http://hdl.handle.net/1942/29955
ISSN: 0962-2802
e-ISSN: 1477-0334
DOI: 10.1177/0962280218787544
ISI #: 000484532300020
Category: A1
Type: Journal Contribution
Validations: ecoom 2020
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
mbah2018.pdf
  Restricted Access
Published version501.44 kBAdobe PDFView/Open    Request a copy
Show full item record

Page view(s)

202
checked on Sep 7, 2022

Download(s)

174
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.