Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/43004
Full metadata record
DC FieldValueLanguage
dc.contributor.authorReymond, Mathieu-
dc.contributor.authorHayes, Conor F.-
dc.contributor.authorWILLEM, Lander-
dc.contributor.authorRadulescu, Roxana-
dc.contributor.authorABRAMS, Steven-
dc.contributor.authorRoijers, Diederik M.-
dc.contributor.authorHowley, Enda-
dc.contributor.authorMannion, Patrick-
dc.contributor.authorHENS, Niel-
dc.contributor.authorNowe, Ann-
dc.contributor.authorLIBIN, Pieter-
dc.date.accessioned2024-05-28T08:38:58Z-
dc.date.available2024-05-28T08:38:58Z-
dc.date.issued2024-
dc.date.submitted2024-05-28T06:11:46Z-
dc.identifier.citationEXPERT SYSTEMS WITH APPLICATIONS, 249 (Art N° 123686)-
dc.identifier.urihttp://hdl.handle.net/1942/43004-
dc.description.abstractInfectious disease outbreaks can have a disruptive impact on public health and societal processes. As decisionmaking in the context of epidemic mitigation is multi -dimensional hence complex, reinforcement learning in combination with complex epidemic models provides a methodology to design refined prevention strategies. Current research focuses on optimizing policies with respect to a single objective, such as the pathogen's attack rate. However, as the mitigation of epidemics involves distinct, and possibly conflicting, criteria (i.a., mortality, morbidity, economic cost, well-being), a multi -objective decision approach is warranted to obtain balanced policies. To enhance future decision -making, we propose a deep multi -objective reinforcement learning approach by building upon a state-of-the-art algorithm called Pareto Conditioned Networks (PCN) to obtain a set of solutions for distinct outcomes of the decision problem. We consider different deconfinement strategies after the first Belgian lockdown within the COVID-19 pandemic and aim to minimize both COVID-19 cases (i.e., infections and hospitalizations) and the societal burden induced by the mitigation measures. As such, we connected a multi -objective Markov decision process with a stochastic compartment model designed to approximate the Belgian COVID-19 waves and explore reactive strategies. As these social mitigation measures are implemented in a continuous action space that modulates the contact matrix of the age -structured epidemic model, we extend PCN to this setting. We evaluate the solution set that PCN returns, and observe that it explored the whole range of possible social restrictions, leading to high -quality trade-offs, as it captured the problem dynamics. In this work, we demonstrate that multi -objective reinforcement learning adds value to epidemiological modeling and provides essential insights to balance mitigation policies.-
dc.description.sponsorshipC.F.H. is funded by the University of Galway Hardiman Scholarship, Belgium. This research was supported by funding from the Flemish Government under the ‘‘Onderzoeksprogramma Artificiële Intelligentie (AI) Vlaanderen’’ program. This work also received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant number 101003688 – EpiPose project). P.J.K.L. gratefully acknowledges support from FWO via postdoctoral fellowship, Belgium 1242021N and the Research council of the Vrije Universiteit Brussel (OZR-VUB via grant number OZR3863BOF). N.H. acknowledges support from the Scientific Chair of Evidence-based Vaccinology under the umbrella of the Methusalem framework at the University of Antwerp. N.H. and A.N. acknowledge funding from the iBOF DESCARTES project (reference: iBOF-21-027). L.W. gratefully acknowledges support from FWO postdoctoral fellowship 1234620N. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. A.N. acknowledges the support by the FWO COVID19 research project G0H0420N. P.L. and L.W. acknowledge support from FWO grant G059423N.-
dc.language.isoen-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.rights2024 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).-
dc.subject.otherMulti-objective reinforcement learning-
dc.subject.otherEpidemic control-
dc.subject.otherCOVID-19 epidemic models-
dc.titleExploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning-
dc.typeJournal Contribution-
dc.identifier.volume249-
local.format.pages13-
local.bibliographicCitation.jcatA1-
dc.description.notesReymond, M (corresponding author), Vrije Univ Brussel, Brussels, Belgium.-
dc.description.notesmathieu.reymond@vub.be; c.hayes13@nuigalway.ie;-
dc.description.noteslander.willem@uantwerpen.be; roxana@ai.vub.ac.be;-
dc.description.notessteven.abrams@uantwerpen.be; diederik.roijers@vub.be;-
dc.description.notesenda.howley@nuigalway.ie; patrick.mannion@nuigalway.ie;-
dc.description.notesniel.hens@uhasselt.be; ann.nowe@ai.vub.ac.be; pieter.libin@vub.be-
local.publisher.placeTHE BOULEVARD, LANGFORD LANE, KIDLINGTON, OXFORD OX5 1GB, ENGLAND-
local.type.refereedRefereed-
local.type.specifiedArticle-
local.bibliographicCitation.artnr123686-
local.type.programmeH2020-
local.relation.h2020101003688-
dc.identifier.doi10.1016/j.eswa.2024.123686-
dc.identifier.isi001224116400001-
local.provider.typewosris-
local.description.affiliation[Reymond, Mathieu; Radulescu, Roxana; Roijers, Diederik M.; Nowe, Ann; Libin, Pieter] Vrije Univ Brussel, Brussels, Belgium.-
local.description.affiliation[Hayes, Conor F.; Howley, Enda; Mannion, Patrick] Natl Univ Ireland Galway, Galway, Ireland.-
local.description.affiliation[Willem, Lander; Abrams, Steven] Univ Antwerp, Antwerp, Belgium.-
local.description.affiliation[Hens, Niel] Hasselt Univ, Hasselt, Belgium.-
local.uhasselt.internationalyes-
item.fullcitationReymond, Mathieu; Hayes, Conor F.; WILLEM, Lander; Radulescu, Roxana; ABRAMS, Steven; Roijers, Diederik M.; Howley, Enda; Mannion, Patrick; HENS, Niel; Nowe, Ann & LIBIN, Pieter (2024) Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning. In: EXPERT SYSTEMS WITH APPLICATIONS, 249 (Art N° 123686).-
item.fulltextWith Fulltext-
item.contributorReymond, Mathieu-
item.contributorHayes, Conor F.-
item.contributorWILLEM, Lander-
item.contributorRadulescu, Roxana-
item.contributorABRAMS, Steven-
item.contributorRoijers, Diederik M.-
item.contributorHowley, Enda-
item.contributorMannion, Patrick-
item.contributorHENS, Niel-
item.contributorNowe, Ann-
item.contributorLIBIN, Pieter-
item.accessRightsOpen Access-
crisitem.journal.issn0957-4174-
crisitem.journal.eissn1873-6793-
Appears in Collections:Research publications
Files in This Item:
File Description SizeFormat 
Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning.pdfPublished version915.9 kBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.