Topics in Modeling Multilevel and Longitudinal Data

RENARD, Didier

Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/20556

Title:	Topics in Modeling Multilevel and Longitudinal Data
Authors:	RENARD, Didier
Advisors:	MOLENBERGHS, Geert
Issue Date:	2002
Abstract:	Dit werkt richt zich op het formuleren van modellen en het schatten van parameters en hun bijhorende precisiematen voor hiërarchische (multilevel) en longitudinale gegevens. Dergelijke gegevens komen zeer frequent voor. We noemen eerst enkele belangrijke voorbeelden. In de Belgische Volksgezondheidsenquˆete (1997 en 2001; Hoofdstuk 2) worden respondenten bevraagd. De selectie van respondenten gebeurt in verscheidene stappen. Eerst wordt een deel van de steekproef toegekend aan de drie regio’s van het land, waarna een proportionele verdeling over de provincies volgt. Binnen elke provincie worden gemeenten geselecteerd, met selectiekans proportioneel aan de grootte van de provincie. Wanneer een gemeente geselecteerd wordt betekent dit dat er 50 interviews zullen afgenomen worden. Bij grotere gemeenten kunnen verscheidene groepen geselecteerd worden. Binnen gemeenten worden huishoudens geselecteerd en binnen huishoudens respondenten. Het zal duidelijk wezen dat er in een zorgvuldige analyse rekening dient gehouden met de stratificatie over regio’s en provincies, en met de hiërarchische selectie van respondenten binnen huishoudens en huishoudens binnen gemeente. Bovendien zijn de selectiekansen verschillend van individu tot individu, waardoor met wegingsfactoren dient rekening gehouden te worden. Eén van de gevolgen is dat respondenten niet noodzakelijk van elkaar onafhankelijk zijn. Summary excerpts: This work has focused on methods aimed at modeling data that are correlated. More specifically, we have been concerned with two types of data frequently encountered in applied statistics: multilevel and longitudinal data. Even though formally, multilevel data structures can be regarded as encompassing nearly all other types of correlated data, including longitudinal data, the distinction between these two data types is maintained since they each have their own pecularities and field of research. A significant part of our work (Chapters 3, 4 and 6) was devoted to models for binary responses. The difficulty in evaluating the likelihood for models with discrete correlated data has led to alternative methods of estimation and to an extensive body of research in the literature. The aim of our work was to study in more depth the use of maximum pairwise likelihood (MPL) as an estimation tool in multilevel models with binary responses. Pairwise likelihood is a special example of pseudo-likelihood, a notion first introduced by Besag (1975) and which amounts to construct a product (of pieces) of likelihoods. Another area where pseudo-likelihood methods would benefit from further research is when data are incomplete (or missing). A problem with incomplete data is that ignoring the missingness mechanism can result in misleading inference. Based on the well-known terminology of Rubin (1976) and Little and Rubin (1987), missing data mechanisms can be classified as missing completely at random (MCAR), missing at random (MAR) and missing not at random (MNAR). A remarkable fact about likelihood-based estimation is that MCAR and MAR mechanisms are ignorable (provided an unrestrictive separability condition on the parameter spaces is satisfied). In practical terms this means that MCAR and MAR missingness mechanisms can simply be ignored in the analysis of the data within the likelihood framework.
Document URI:	http://hdl.handle.net/1942/20556
Category:	T1
Type:	Theses and Dissertations
Appears in Collections:	PhD theses Research publications

Files in This Item:

File	Description	Size	Format
Didier Renard.pdf		986.17 kB	Adobe PDF	View/Open

Show full item record

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Google Scholar^TM