Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/34337
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | LIBIN, Pieter | - |
dc.contributor.author | Moonens, Arno | - |
dc.contributor.author | Verstraeten, Timothy | - |
dc.contributor.author | Perez-Sanjines, Fabian | - |
dc.contributor.author | HENS, Niel | - |
dc.contributor.author | Lemey, Philippe | - |
dc.contributor.author | Nowé, Ann | - |
dc.date.accessioned | 2021-06-23T12:24:20Z | - |
dc.date.available | 2021-06-23T12:24:20Z | - |
dc.date.issued | 2021 | - |
dc.date.submitted | 2021-06-17T14:19:12Z | - |
dc.identifier.citation | Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (Ed.), Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020, Springer, p. 155-170. | - |
dc.identifier.isbn | 978-3-030-67669-8 | - |
dc.identifier.isbn | 978-3-030-67670-4 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.uri | http://hdl.handle.net/1942/34337 | - |
dc.description.abstract | Epidemics of infectious diseases are an important threat to public health and global economies. Yet, the development of prevention strategies remains a challenging process, as epidemics are non-linear and complex processes. For this reason, we investigate a deep reinforcement learning approach to automatically learn prevention strategies in the context of pandemic influenza. Firstly, we construct a new epidemiological meta-population model, with 379 patches (one for each administrative district in Great Britain), that adequately captures the infection process of pandemic influenza. Our model balances complexity and computational efficiency such that the use of reinforcement learning techniques becomes attainable. Secondly, we set up a ground truth such that we can evaluate the performance of the 'Proximal Policy Optimization' algorithm to learn in a single district of this epidemiological model. Finally, we consider a large-scale problem, by conducting an experiment where we aim to learn a joint policy to control the districts in a community of 11 tightly coupled districts, for which no ground truth can be established. This experiment shows that deep reinforcement learning can be used to learn mitigation policies in complex epidemiological models with a large state space. Moreover, through this experiment, we demonstrate that there can be an advantage to consider collaboration between districts when designing prevention strategies. | - |
dc.description.sponsorship | Pieter Libin and Timothy Verstraeten were supported by a PhD grant of the FWO (Fonds Wetenschappelijk Onderzoek - Vlaanderen). This research acknowledges funding from the Flemish Government (AI Research Program) and from the EpiPose project (H2020/101003688). We thank the anonymous reviewers for their insightful comments. | - |
dc.language.iso | en | - |
dc.publisher | SPRINGER INTERNATIONAL PUBLISHING AG | - |
dc.relation.ispartofseries | Lecture Notes in Computer Science | - |
dc.rights | Springer Nature Switzerland AG 2021 | - |
dc.subject | Computer Science - Learning | - |
dc.subject | Computer Science - Learning | - |
dc.subject | Computer Science - Artificial Intelligence | - |
dc.subject | Computer Science - Multiagent Systems | - |
dc.subject.other | Computer Science - Learning | - |
dc.subject.other | Computer Science - Learning | - |
dc.subject.other | Computer Science - Artificial Intelligence | - |
dc.subject.other | Computer Science - Multiagent Systems | - |
dc.title | Deep reinforcement learning for large-scale epidemic control | - |
dc.type | Proceedings Paper | - |
local.bibliographicCitation.authors | Dong, Y. | - |
local.bibliographicCitation.authors | Ifrim, G. | - |
local.bibliographicCitation.authors | Mladenić, D. | - |
local.bibliographicCitation.authors | Saunders, C. | - |
local.bibliographicCitation.authors | Van Hoecke, S. | - |
local.bibliographicCitation.conferencedate | 14-18 September 2020 | - |
local.bibliographicCitation.conferencename | Joint European Conference on Machine Learning and Knowledge Discovery in Databases | - |
local.bibliographicCitation.conferenceplace | Gent, Belgium (Virtual) | - |
dc.identifier.epage | 170 | - |
dc.identifier.spage | 155 | - |
dc.identifier.volume | 12461 | - |
local.bibliographicCitation.jcat | C1 | - |
local.publisher.place | GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND | - |
local.type.refereed | Refereed | - |
local.type.specified | Proceedings Paper | - |
local.relation.ispartofseriesnr | 12461 | - |
local.type.programme | H2020 | - |
local.relation.h2020 | 101003688 | - |
dc.identifier.doi | 10.1007/978-3-030-67670-4_10 | - |
dc.identifier.isi | 000716884800010 | - |
dc.identifier.url | http://arxiv.org/abs/2003.13676v1 | - |
local.provider.type | ArXiv | - |
local.bibliographicCitation.btitle | Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020 | - |
local.uhasselt.international | no | - |
item.fulltext | With Fulltext | - |
item.contributor | LIBIN, Pieter | - |
item.contributor | Moonens, Arno | - |
item.contributor | Verstraeten, Timothy | - |
item.contributor | Perez-Sanjines, Fabian | - |
item.contributor | HENS, Niel | - |
item.contributor | Lemey, Philippe | - |
item.contributor | Nowé, Ann | - |
item.fullcitation | LIBIN, Pieter; Moonens, Arno; Verstraeten, Timothy; Perez-Sanjines, Fabian; HENS, Niel; Lemey, Philippe & Nowé, Ann (2021) Deep reinforcement learning for large-scale epidemic control. In: Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (Ed.), Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020, Springer, p. 155-170.. | - |
item.accessRights | Restricted Access | - |
item.validation | ecoom 2022 | - |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Pages from 2021_Book_MachineLearningAndKnowledgeDis.pdf Restricted Access | Published version | 955.77 kB | Adobe PDF | View/Open Request a copy |
2003.13676v1.pdf | Non Peer-reviewed author version | 5.73 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.