Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/34142
Title: Fast and optimal algorithm for case-control matching using registry data: application on the antibiotics use of colorectal cancer patients
Authors: Mamouris, Pavlos
NASSIRI, Vahid 
MOLENBERGHS, Geert 
van den Akker, Marjan
van der Meer, Joep
Vaes, Bert
Issue Date: 2021
Publisher: BMC
Source: BMC Medical Research Methodology, 21 (1) (Art N° 62)
Abstract: Background In case-control studies most algorithms allow the controls to be sampled several times, which is not always optimal. If many controls are available and adjustment for several covariates is necessary, matching without replacement might increase statistical efficiency. Comparing similar units when having observational data is of utter importance, since confounding and selection bias is present. The aim was twofold, firstly to create a method that accommodates the option that a control is not resampled, and second, to display several scenarios that identify changes of Odds Ratios (ORs) while increasing the balance of the matched sample. Methods The algorithm was derived in an iterative way starting from the pre-processing steps to derive the data until its application in a study to investigate the risk of antibiotics on colorectal cancer in the INTEGO registry (Flanders, Belgium). Different scenarios were developed to investigate the fluctuation of ORs using the combination of exact and varying variables with or without replacement of controls. To achieve balance in the population, we introduced the Comorbidity Index (CI) variable, which is the sum of chronic diseases as a means to have comparable units for drawing valid associations. Results This algorithm is fast and optimal. We simulated data and demonstrated that the run-time of matching even with millions of patients is minimal. Optimal, since the closest controls is always captured (using the appropriate ordering and by creating some auxiliary variables), and in the scenario that a case has only one control, we assure that this control will be matched to this case, thus maximizing the cases to be used in the analysis. In total, 72 different scenarios were displayed indicating the fluctuation of ORs, and revealing patterns, especially a drop when balancing the population. Conclusions We created an optimal and computationally efficient algorithm to derive a matched case-control sample with and without replacement of controls. The code and the functions are publicly available as an open source in an R package. Finally, we emphasize the importance of displaying several scenarios and assess the difference of ORs while using an index to balance population in observational data.
Notes: Mamouris, P (corresponding author), Katholieke Univ Leuven, Dept Publ Hlth & Primary Care, Kapucijnenvoer 33,J Bldg, B-3000 Leuven, Belgium.
pavlos.mamouris@kuleuven.be
Other: Mamouris, P (corresponding author), Katholieke Univ Leuven, Dept Publ Hlth & Primary Care, Kapucijnenvoer 33,J Bldg, B-3000 Leuven, Belgium. pavlos.mamouris@kuleuven.be
Keywords: Case-control;Optimal matching;Comorbidity index;Colorectal cancer
Document URI: http://hdl.handle.net/1942/34142
e-ISSN: 1471-2288
DOI: 10.1186/s12874-021-01256-3
ISI #: WOS:000636729400001
Rights: The Author(s). 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Category: A1
Type: Journal Contribution
Validations: ecoom 2022
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
s12874-021-01256-3.pdfPublished version1.2 MBAdobe PDFView/Open
Show full item record

WEB OF SCIENCETM
Citations

13
checked on Apr 24, 2024

Page view(s)

40
checked on Sep 7, 2022

Download(s)

16
checked on Sep 7, 2022

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.