Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/49405
Title: Statistical modeling of federated data through sufficient statistics
Authors: LIMPOCO, Marie Analiz April 
Advisors: Hens, Niel
Faes, Christel
Issue Date: 2026
Abstract: Statistical modeling on individual-level data is indispensable in health research. However, regulations for accessing individual-level data for health research have become more stringent as technology has advanced drastically in recent years. Consequently, individual-level data may not be obtained in a timely manner, and research progress is at risk of being hampered. This thesis addresses the data sharing challenges faced by data providers and data analysts. We propose a framework that enables data analysts to perform statistical inference at the individual level without having access to personal health data. Only summary statistics are shared once, from which pseudo-data are generated. These pseudo-data are used to replace the actual data when estimating generalized linear mixed models (GLMM). Scalability issues are addressed by generating compressed pseudo-data with associated frequency weights. Communication and resource efficiency as well as wide applicability distinguish our proposed framework from existing approaches in the literature.
Keywords: federated data analysis;summary statistics;generalized linear mixed models;pseudo-data
Document URI: http://hdl.handle.net/1942/49405
Category: T1
Type: Theses and Dissertations
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
PhD Limpoco Liz.pdf
  Until 2031-06-27
Published version37.75 MBAdobe PDFView/Open    Request a copy
Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.