Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3542679acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Reinforcement Learning Enhances the Experts: Large-scale COVID-19 Vaccine Allocation with Multi-factor Contact Network

Published: 14 August 2022 Publication History
  • Get Citation Alerts
  • Abstract

    In the fight against the COVID-19 pandemic, vaccines are the most critical resource but are still in short supply around the world. Therefore, efficient vaccine allocation strategies are urgently called for, especially in large-scale metropolis where uneven health risk is manifested in nearby neighborhoods. However, there exist several key challenges in solving this problem: (1) great complexity in the large scale scenario adds to the difficulty in experts' vaccine allocation decision making; (2) heterogeneous information from all aspects in the metropolis' contact network makes information utilization difficult in decision making; (3) when utilizing the strong decision-making ability of reinforcement learning (RL) to solve the problem, poor explainability limits the credibility of the RL strategies. In this paper, we propose a reinforcement learning enhanced experts method. We deal with the great complexity via a specially designed algorithm aggregating blocks in the metropolis into communities and we hierarchically integrate RL among the communities and experts solution within each community. We design a self-supervised contact network representation algorithm to fuse the heterogeneous information for efficient vaccine allocation decision making. We conduct extensive experiments in three metropolis with real-world data and prove that our method outperforms the best baseline, reducing 9.01% infections and 12.27% deaths.We further demonstrate the explainability of the RL model, adding to its credibility and also enlightening the experts in turn.

    Supplemental Material

    MP4 File
    Presentation video for paper 'Reinforcement Learning Enhances the Experts: Large Scale COVID-19 Vaccines Allocation with Multi-factor Contact Network'

    References

    [1]
    Hamsa Bastani, Kimon Drakopoulos, Vishal Gupta, Ioannis Vlachogiannis, Christos Hadjicristodoulou, Pagona Lagiou, Gkikas Magiorkinis, Dimitrios Paraskevis, and Sotirios Tsiodras. 2021. Efficient and targeted COVID-19 border testing via reinforcement learning. Nature, Vol. 599, 7883 (2021), 108--113.
    [2]
    Bryan P Bednarski, Akash Deep Singh, and William M Jones. 2021. On collaborative reinforcement learning to optimize the redistribution of critical medical supplies throughout the COVID-19 pandemic. Journal of the American Medical Informatics Association, Vol. 28, 4 (2021), 874--878.
    [3]
    Kate M Bubar, Kyle Reinholt, Stephen M Kissler, Marc Lipsitch, Sarah Cobey, Yonatan H Grad, and Daniel B Larremore. 2021. Model-informed COVID-19 vaccine prioritization strategies by age and serostatus. Science, Vol. 371, 6532 (2021), 916--921.
    [4]
    Hui Cao and Simin Huang. 2012. Principles of scarce medical resource allocation in natural disaster relief: a simulation approach. Medical Decision Making, Vol. 32, 3 (2012), 470--476.
    [5]
    Serina Chang, Emma Pierson, Pang Wei Koh, Jaline Gerardin, Beth Redbird, David Grusky, and Jure Leskovec. 2021. Mobility network models of COVID-19 explain inequities and inform reopening. Nature, Vol. 589, 7840 (2021), 82--87.
    [6]
    Serina Chang, Mandy L Wilson, Bryan Lewis, Zakaria Mehrab, Komal K Dudakiya, Emma Pierson, Pang Wei Koh, Jaline Gerardin, Beth Redbird, David Grusky, et almbox. [n.d.]. Supporting covid-19 policy response with large-scale mobility-based modeling. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining.
    [7]
    Lin Chen, Fengli Xu, Zhenyu Han, Kun Tang, Pan Hui, James Evans, and Yong Li. 2021. Strategic COVID-19 vaccine distribution can simultaneously elevate social utility and equity. arXiv preprint arXiv:2111.06689 (2021).
    [8]
    Carlos Del Rio, Saad B Omer, and Preeti N Malani. 2021. Winter of omicron-the evolving COVID-19 pandemic. JAMA (2021).
    [9]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
    [10]
    Ezekiel J Emanuel, Govind Persad, Adam Kern, Allen Buchanan, Cécile Fabre, Daniel Halliday, Joseph Heath, Lisa Herzog, RJ Leland, Ephrem T Lemango, et al. 2020. An ethical framework for global vaccine allocation. Science, Vol. 369, 6509 (2020), 1309--1312.
    [11]
    Jim AC Everett, Clara Colombatto, Edmond Awad, Paulo Boggio, Björn Bos, William J Brady, Megha Chawla, Vladimir Chituc, Dongil Chung, Moritz A Drupp, et al. 2021. Moral dilemmas and trust in leaders during a global health crisis. Nature human behaviour, Vol. 5, 8 (2021), 1074--1088.
    [12]
    Samuel Greydanus, Anurag Koul, Jonathan Dodge, and Alan Fern. 2018. Visualizing and understanding atari agents. In International conference on machine learning. PMLR, 1792--1801.
    [13]
    Qianyue Hao, Lin Chen, Fengli Xu, and Yong Li. 2020. Understanding the urban pandemic spreading of COVID-19 with real world mobility data. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 3485--3492.
    [14]
    Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.
    [15]
    Yuanshuang Jiang, Linfang Hou, Yuxiang Liu, Zhuoye Ding, Yong Zhang, and Shengzhong Feng. 2020. Epidemic Control Based on Reinforcement Learning Approaches. (2020).
    [16]
    Soheyl Khalilpourazari and Hossein Hashemi Doulabi. 2021. Designing a hybrid reinforcement learning based algorithm with application in prediction of the COVID-19 pandemic in Quebec. Annals of Operations Research (2021), 1--45.
    [17]
    Gloria Hyunjung Kwak, Lowell Ling, and Pan Hui. 2021. Deep reinforcement learning approaches for global public health strategies for COVID-19 pandemic. Plos one, Vol. 16, 5 (2021), e0251550.
    [18]
    Laura Matrajt, Julia Eaton, Tiffany Leung, and Elizabeth R Brown. 2021. Vaccine optimization for COVID-19: Who to vaccinate first? Science Advances, Vol. 7, 6 (2021), eabf1374.
    [19]
    John N Nkengasong, Nicaise Ndembi, Akhona Tshangela, and Tajudeen Raji. 2020. COVID-19 vaccines: how to ensure Africa has access.
    [20]
    Abu Quwsar Ohi, MF Mridha, Muhammad Mostafa Monowar, Md Hamid, et al. 2020. Exploring optimal control of epidemic spread using reinforcement learning. Scientific reports, Vol. 10, 1 (2020), 1--19.
    [21]
    World Health Organization et al. 2020. WHO SAGE values framework for the allocation and prioritization of COVID-19 vaccination, 14 September 2020. Technical Report. World Health Organization.
    [22]
    Ferran Parés, Dario Garcia Gasulla, Armand Vilalta, Jonatan Moreno, Eduard Ayguadé, Jesús Labarta, Ulises Cortés, and Toyotaro Suzumura. 2017. Fluid communities: a competitive, scalable and diverse community detection algorithm. In International conference on complex networks and their applications. Springer, 229--240.
    [23]
    Govind Persad, Ezekiel J Emanuel, Samantha Sangenito, Aaron Glickman, Steven Phillips, and Emily A Largent. 2021. Public perspectives on COVID-19 vaccine prioritization. JAMA network open, Vol. 4, 4 (2021), e217943--e217943.
    [24]
    Govind Persad, Monica E Peek, and Ezekiel J Emanuel. 2020. Fairly prioritizing groups for access to COVID-19 vaccines. Jama, Vol. 324, 16 (2020), 1601--1602.
    [25]
    Stefano Giovanni Rizzo. 2020. Balancing precision and recall for costeffective epidemic containment. (2020).
    [26]
    Harald Schmidt, Rebecca Weintraub, Michelle A Williams, Kate Miller, Alison Buttenheim, Emily Sadecki, Helen Wu, Aditi Doiphode, Neha Nagpal, Lawrence O Gostin, et al. 2021. Equitable allocation of COVID-19 vaccines in the United States. Nature Medicine, Vol. 27, 7 (2021), 1298--1307.
    [27]
    John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, and Pieter Abbeel. 2015. High-dimensional continuous control using generalized advantage estimation. arXiv preprint arXiv:1506.02438 (2015).
    [28]
    John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017).
    [29]
    Max Schwarzer, Ankesh Anand, Rishab Goel, R Devon Hjelm, Aaron Courville, and Philip Bachman. 2020. Data-efficient reinforcement learning with self-predictive representations. arXiv preprint arXiv:2007.05929 (2020).
    [30]
    Aravind Srinivas, Michael Laskin, and Pieter Abbeel. 2020. Curl: Contrastive unsupervised representations for reinforcement learning. arXiv preprint arXiv:2004.04136 (2020).
    [31]
    Andrew J Stier, Marc G Berman, and Luis Bettencourt. 2020. COVID-19 attack rate increases with city size. arXiv preprint arXiv:2003.10376 (2020).
    [32]
    Adam Stooke, Kimin Lee, Pieter Abbeel, and Michael Laskin. 2021. Decoupling representation learning from reinforcement learning. In International Conference on Machine Learning. PMLR, 9870--9879.
    [33]
    Bing Xu, Naiyan Wang, Tianqi Chen, and Mu Li. 2015. Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015).
    [34]
    Denis Yarats, Amy Zhang, Ilya Kostrikov, Brandon Amos, Joelle Pineau, and Rob Fergus. 2019. Improving sample efficiency in model-free reinforcement learning from images. arXiv preprint arXiv:1910.01741 (2019).
    [35]
    Tao Yu, Cuiling Lan, Wenjun Zeng, Mingxiao Feng, Zhizheng Zhang, and Zhibo Chen. 2021. Playvirtual: Augmenting cycle-consistent virtual trajectories for reinforcement learning. Advances in Neural Information Processing Systems, Vol. 34 (2021).

    Cited By

    View all
    • (2023)GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement LearningProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599359(685-697)Online publication date: 6-Aug-2023
    • (2023)GCRL: Efficient Delivery Area Assignment for Last-mile Logistics with Group-based Cooperative Reinforcement Learning2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00269(3522-3534)Online publication date: Apr-2023
    • (2023)Using Reinforcement Learning for Optimizing COVID-19 Vaccine Distribution StrategiesMathematical Modeling and Intelligent Control for Combating Pandemics10.1007/978-3-031-33183-1_10(169-196)Online publication date: 12-May-2023

    Index Terms

    1. Reinforcement Learning Enhances the Experts: Large-scale COVID-19 Vaccine Allocation with Multi-factor Contact Network

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
        August 2022
        5033 pages
        ISBN:9781450393850
        DOI:10.1145/3534678
        This work is licensed under a Creative Commons Attribution International 4.0 License.

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 14 August 2022

        Check for updates

        Author Tags

        1. covid-19 pandemic
        2. model explainability
        3. reinforcement learning
        4. self-supervised representation learning
        5. vaccine allocation.

        Qualifiers

        • Research-article

        Data Availability

        Presentation video for paper 'Reinforcement Learning Enhances the Experts: Large Scale COVID-19 Vaccines Allocation with Multi-factor Contact Network' https://dl.acm.org/doi/10.1145/3534678.3542679#KDD22-health27.mp4

        Funding Sources

        Conference

        KDD '22
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

        Upcoming Conference

        KDD '24

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)517
        • Downloads (Last 6 weeks)135
        Reflects downloads up to 09 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement LearningProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599359(685-697)Online publication date: 6-Aug-2023
        • (2023)GCRL: Efficient Delivery Area Assignment for Last-mile Logistics with Group-based Cooperative Reinforcement Learning2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00269(3522-3534)Online publication date: Apr-2023
        • (2023)Using Reinforcement Learning for Optimizing COVID-19 Vaccine Distribution StrategiesMathematical Modeling and Intelligent Control for Combating Pandemics10.1007/978-3-031-33183-1_10(169-196)Online publication date: 12-May-2023

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media