Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/16193
Title: Data mining for longitudinal data under multicollinearity and time dependence using penalized generalized estimating equations
Authors: BLOMMAERT, Adriaan 
HENS, Niel 
Beutels, Ph
Issue Date: 2014
Source: COMPUTATIONAL STATISTICS & DATA ANALYSIS, 71, p. 667-680
Abstract: Penalized generalized estimating equations with Elastic Net or L2-Smoothly Clipped Absolute Deviation penalization are proposed to simultaneously select the most important variables and estimate their effects for longitudinal Gaussian data when multicollinearity is present. The method is able to consistently select and estimate the main effects even when strong correlations are present. In addition, the potential pitfall of time-dependent covariates is clarified. Both asymptotic theory and simulation results reveal the effectiveness of penalization as a data mining tool for longitudinal data, especially when a large number of variables is present. The method is illustrated by mining for the main determinants of life expectancy in Europe. (C) 2013 Elsevier B.V. All rights reserved.
Notes: Blommaert, A (reprint author), Univ Pl 1 S4-11, BE-2610 Antwerp, Belgium. adriaan.blommaert@ua.ac.be; Niel.Hens@uhasselt.be; Philippe.Beutels@ua.ac.be
Keywords: Covariate selection; Generalized estimating equations; Longitudinal data; Multicollinearity; Penalization; Time-dependent covariates
Document URI: http://hdl.handle.net/1942/16193
ISSN: 0167-9473
e-ISSN: 1872-7352
DOI: 10.1016/j.csda.2013.02.023
ISI #: 000328869000050
Category: A1
Type: Journal Contribution
Validations: ecoom 2015
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
blommaert 1.pdf
  Restricted Access
Published version471.57 kBAdobe PDFView/Open    Request a copy
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.