Deep reinforcement learning for large-scale epidemic control

LIBIN, Pieter; Moonens, Arno; Verstraeten, Timothy; Perez-Sanjines, Fabian; HENS, Niel; Lemey, Philippe; Nowé, Ann

Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/34337

Full metadata record

DC Field	Value	Language
dc.contributor.author	LIBIN, Pieter	-
dc.contributor.author	Moonens, Arno	-
dc.contributor.author	Verstraeten, Timothy	-
dc.contributor.author	Perez-Sanjines, Fabian	-
dc.contributor.author	HENS, Niel	-
dc.contributor.author	Lemey, Philippe	-
dc.contributor.author	Nowé, Ann	-
dc.date.accessioned	2021-06-23T12:24:20Z	-
dc.date.available	2021-06-23T12:24:20Z	-
dc.date.issued	2021	-
dc.date.submitted	2021-06-17T14:19:12Z	-
dc.identifier.citation	Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (Ed.), Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020, Springer, p. 155-170.	-
dc.identifier.isbn	978-3-030-67669-8	-
dc.identifier.isbn	978-3-030-67670-4	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/1942/34337	-
dc.description.abstract	Epidemics of infectious diseases are an important threat to public health and global economies. Yet, the development of prevention strategies remains a challenging process, as epidemics are non-linear and complex processes. For this reason, we investigate a deep reinforcement learning approach to automatically learn prevention strategies in the context of pandemic influenza. Firstly, we construct a new epidemiological meta-population model, with 379 patches (one for each administrative district in Great Britain), that adequately captures the infection process of pandemic influenza. Our model balances complexity and computational efficiency such that the use of reinforcement learning techniques becomes attainable. Secondly, we set up a ground truth such that we can evaluate the performance of the 'Proximal Policy Optimization' algorithm to learn in a single district of this epidemiological model. Finally, we consider a large-scale problem, by conducting an experiment where we aim to learn a joint policy to control the districts in a community of 11 tightly coupled districts, for which no ground truth can be established. This experiment shows that deep reinforcement learning can be used to learn mitigation policies in complex epidemiological models with a large state space. Moreover, through this experiment, we demonstrate that there can be an advantage to consider collaboration between districts when designing prevention strategies.	-
dc.description.sponsorship	Pieter Libin and Timothy Verstraeten were supported by a PhD grant of the FWO (Fonds Wetenschappelijk Onderzoek - Vlaanderen). This research acknowledges funding from the Flemish Government (AI Research Program) and from the EpiPose project (H2020/101003688). We thank the anonymous reviewers for their insightful comments.	-
dc.language.iso	en	-
dc.publisher	SPRINGER INTERNATIONAL PUBLISHING AG	-
dc.relation.ispartofseries	Lecture Notes in Computer Science	-
dc.rights	Springer Nature Switzerland AG 2021	-
dc.subject	Computer Science - Learning	-
dc.subject	Computer Science - Learning	-
dc.subject	Computer Science - Artificial Intelligence	-
dc.subject	Computer Science - Multiagent Systems	-
dc.subject.other	Computer Science - Learning	-
dc.subject.other	Computer Science - Learning	-
dc.subject.other	Computer Science - Artificial Intelligence	-
dc.subject.other	Computer Science - Multiagent Systems	-
dc.title	Deep reinforcement learning for large-scale epidemic control	-
dc.type	Proceedings Paper	-
local.bibliographicCitation.authors	Dong, Y.	-
local.bibliographicCitation.authors	Ifrim, G.	-
local.bibliographicCitation.authors	Mladenić, D.	-
local.bibliographicCitation.authors	Saunders, C.	-
local.bibliographicCitation.authors	Van Hoecke, S.	-
local.bibliographicCitation.conferencedate	14-18 September 2020	-
local.bibliographicCitation.conferencename	Joint European Conference on Machine Learning and Knowledge Discovery in Databases	-
local.bibliographicCitation.conferenceplace	Gent, Belgium (Virtual)	-
dc.identifier.epage	170	-
dc.identifier.spage	155	-
dc.identifier.volume	12461	-
local.bibliographicCitation.jcat	C1	-
local.publisher.place	GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND	-
local.type.refereed	Refereed	-
local.type.specified	Proceedings Paper	-
local.relation.ispartofseriesnr	12461	-
local.type.programme	H2020	-
local.relation.h2020	101003688	-
dc.identifier.doi	10.1007/978-3-030-67670-4_10	-
dc.identifier.isi	000716884800010	-
dc.identifier.url	http://arxiv.org/abs/2003.13676v1	-
local.provider.type	ArXiv	-
local.bibliographicCitation.btitle	Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020	-
local.uhasselt.international	no	-
item.fullcitation	LIBIN, Pieter; Moonens, Arno; Verstraeten, Timothy; Perez-Sanjines, Fabian; HENS, Niel; Lemey, Philippe & Nowé, Ann (2021) Deep reinforcement learning for large-scale epidemic control. In: Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (Ed.), Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020, Springer, p. 155-170..	-
item.accessRights	Restricted Access	-
item.contributor	LIBIN, Pieter	-
item.contributor	Moonens, Arno	-
item.contributor	Verstraeten, Timothy	-
item.contributor	Perez-Sanjines, Fabian	-
item.contributor	HENS, Niel	-
item.contributor	Lemey, Philippe	-
item.contributor	Nowé, Ann	-
item.fulltext	With Fulltext	-
item.validation	ecoom 2022	-
Appears in Collections:	Research publications

Files in This Item:

File	Description	Size	Format
Pages from 2021_Book_MachineLearningAndKnowledgeDis.pdf Restricted Access	Published version	955.77 kB	Adobe PDF	View/Open Request a copy
2003.13676v1.pdf	Non Peer-reviewed author version	5.73 MB	Adobe PDF	View/Open

Show simple item record

SCOPUS^TM
Citations

25

checked on Oct 8, 2025

WEB OF SCIENCE^TM
Citations

20

checked on Oct 19, 2025

Google Scholar^TM

Check

Files in This Item:

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM