Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/29955
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMbah, Chamberlain-
dc.contributor.authorDe Neve, Jan-
dc.contributor.authorTHAS, Olivier-
dc.date.accessioned2019-11-13T11:00:26Z-
dc.date.available2019-11-13T11:00:26Z-
dc.date.issued2019-
dc.identifier.citationSTATISTICAL METHODS IN MEDICAL RESEARCH, 28(9), p. 2848-2867-
dc.identifier.issn0962-2802-
dc.identifier.urihttp://hdl.handle.net/1942/29955-
dc.description.abstractMany prediction methods have been proposed in the literature, but most of them ignore heterogeneity between populations. Either only data from a single study or population is available for model building and evaluation, or when data from multiple studies make up the training dataset, studies are pooled before model building. As a result, prediction models might perform less than expected when applied to new subjects from new study populations. We propose a linear method for building prediction models with high-dimensional data from multiple studies. Our method explicitly addresses between-population variability and tends to select predictors that are predictive in most of the study populations. We employ empirical Bayes estimators and hence avoid selection bias during the variable selection process. Simulation results demonstrate that the new method works better than other linear prediction methods that ignore the between-study variability. Our method is developed for classification into two groups.-
dc.description.sponsorshipThe author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This project received funding from the European Union's Seventh Framework Programme for research, technological development, and demonstration under grant agreement no. 601826.-
dc.language.isoen-
dc.publisherSAGE PUBLICATIONS LTD-
dc.subject.otherHealth Care Sciences & Services; Mathematical & Computational Biology; Medical Informatics; Statistics & Probability-
dc.subject.otherEmpirical Bayes; high-dimensional data; multiple studies; heterogeneity; naive Bayes-
dc.titleHigh-dimensional prediction of binary outcomes in the presence of between-study heterogeneity-
dc.typeJournal Contribution-
dc.identifier.epage2867-
dc.identifier.issue9-
dc.identifier.spage2848-
dc.identifier.volume28-
local.format.pages20-
local.bibliographicCitation.jcatA1-
dc.description.notes[Mbah, Chamberlain] Univ Ghent, Dept Radiotherapy & Expt Canc Res, C Heymanslaan 10,Radiotherapiepk, B-9000 Ghent, Belgium. [De Neve, Jan] Univ Ghent, Fac Psychol & Educ Sci, Dept Data Anal, Ghent, Belgium. [Thas, Olivier] Hasselt Univ, Ctr Stat, Diepenbeek, Belgium. [Thas, Olivier] Univ Wollongong, Natl Inst Appl Stat Res Australia, Wollongong, NSW, Australia.-
local.publisher.placeLONDON-
local.type.refereedRefereed-
local.type.specifiedArticle-
dc.identifier.doi10.1177/0962280218787544-
dc.identifier.isi000484532300020-
item.accessRightsRestricted Access-
item.validationecoom 2020-
item.fulltextWith Fulltext-
item.contributorMbah, Chamberlain-
item.contributorDe Neve, Jan-
item.contributorTHAS, Olivier-
item.fullcitationMbah, Chamberlain; De Neve, Jan & THAS, Olivier (2019) High-dimensional prediction of binary outcomes in the presence of between-study heterogeneity. In: STATISTICAL METHODS IN MEDICAL RESEARCH, 28(9), p. 2848-2867.-
crisitem.journal.issn0962-2802-
crisitem.journal.eissn1477-0334-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
mbah2018.pdf
  Restricted Access
Published version501.44 kBAdobe PDFView/Open    Request a copy
Show simple item record

Page view(s)

202
checked on Sep 7, 2022

Download(s)

174
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.