Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.24963/ijcai.2023/660guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article

Limited resource allocation in a non-Markovian world: the case of maternal and child healthcare

Published: 19 August 2023 Publication History

Abstract

The success of many healthcare programs depends on participants' adherence. We consider the problem of scheduling interventions in low resource settings (e.g., placing timely support calls from health workers) to increase adherence and/or engagement. Past works have successfully developed several classes of Restless Multi-armed Bandit (RMAB) based solutions for this problem. Nevertheless, all past RMAB approaches assume that the participants' behaviour follows the Markov property. We demonstrate significant deviations from the Markov assumption on real-world data on a maternal health awareness program from our partner NGO, ARMMAN. Moreover, we extend RMABs to continuous state spaces, a previously understudied area. To tackle the generalised non-Markovian RMAB setting we (i) model each participant's trajectory as a time-series, (ii) leverage the power of timeseries forecasting models to learn complex patterns and dynamics to predict future states, and (iii) propose the Time-series Arm Ranking Index (TARI) policy, a novel algorithm that selects the RMAB arms that will benefit the most from an intervention, given our future state predictions. We evaluate our approach on both synthetic data, and a secondary analysis on real data from ARMMAN, and demonstrate significant increase in engagement compared to the SOTA, deployed Whittle index solution. This translates to 16.3 hours of additional content listened, 90.8% more engagement drops prevented, and reaching more than twice as many high dropout-risk beneficiaries.

References

[1]
Adebiyi A Ariyo, Adewumi O Adewumi, and Charles K Ayo. Stock price prediction using the arima model. In 2014 UKSim-AMSS 16th international conference on computer modelling and simulation, pages 106-112. IEEE, 2014.
[2]
ARMMAN. Maternal and newborn health. https://armman.org/maternal-newborn-health/#1552759270935-9f057600-22c8, 2023. [Online; accessed 8-February-2023].
[3]
ARMMAN. mmitra. https://armman.org/mmitra/, 2023. [Online; accessed 8- February-2023].
[4]
Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine learning, 47:235-256, 2002.
[5]
Shaojie Bai, J Zico Kolter, and Vladlen Koltun. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271, 2018.
[6]
James M. Buchanan. Opportunity Cost, pages 520-525. Palgrave Macmillan UK, London, 1991.
[7]
Guillaume Chevillon. Direct multi-step estimation and forecasting. Journal of Economic Surveys, 21(4):746- 785, 2007.
[8]
Flavio Chierichetti, Ravi Kumar, Prabhakar Raghavan, and Tamas Sarlos. Are web users really markovian? In Proceedings of the 21st International Conference on World Wide Web, WWW '12, page 609-618, New York, NY, USA, 2012. Association for Computing Machinery.
[9]
Panayiotis Danassis, Shresth Verma, Jackson Killian, Aparna Taneja, and Milind Tambe. Limited resource allocation in a non-markovian world: The case of maternal and child healthcare. arXiv preprint arXiv:2305.12640, 2023. https://arxiv.org/abs/2305.12640.
[10]
Joseph Early, Tom Bewley, Christine Evers, and Sarvapali Ramchurn. Non-markovian reward modelling from trajectory labels via interpretable multiple instance learning. arXiv preprint arXiv:2205.15367, 2022.
[11]
GatesFoundation. Global progress and projections for maternal mortality: Maternal deaths per 100,000 live births. https://www.gatesfoundation.org/goalkeepers/report/2021-report/progress-indicators/maternal-mortality/, 2020. [Online; accessed 8-February-2023].
[12]
Abheek Ghosh, Dheeraj Nagaraj, Manish Jain, and Milind Tambe. Indexability is not enough for whittle: Improved, near-optimal algorithms for restless bandits. In Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems, AAMAS '23. International Foundation for Autonomous Agents and Multiagent Systems, 2023.
[13]
Kevin D Glazebrook, Diego Ruiz-Hernandez, and Christopher Kirkbride. Some indexable families of restless bandit problems. Advances in Applied Probability, 38(3):643-672, 2006.
[14]
Alex Graves and Jürgen Schmidhuber. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural networks, 18(5- 6):602-610, 2005.
[15]
HelpMum. Preventing maternal and infant mortality in africa. https://helpmum.org/, 2023. [Online; accessed 8-February-2023].
[16]
Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.
[17]
Yu-Pin Hsu. Age of information: Whittle index for scheduling stochastic arrivals. In 2018 IEEE International Symposium on Information Theory (ISIT), pages 2634-2638. IEEE, 2018.
[18]
Igor Kadota, Elif Uysal-Biyikoglu, Rahul Singh, and Eytan Modiano. Minimizing the age of information in broadcast wireless networks. In 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 844-851. IEEE, 2016.
[19]
Jasvir Kaur, Manmeet Kaur, Venkatesan Chakrapani, Jacqui Webster, Joseph Alvin Santos, and Rajesh Kumar. Effectiveness of information technology-enabled 'smart eating'health promotion intervention: A cluster randomized controlled trial. PLoS One, 15(1):e0225892, 2020.
[20]
Seyed Mehran Kazemi, Rishab Goel, Sepehr Eghbali, Janahan Ramanan, Jaspreet Sahota, Sanjay Thakur, Stella Wu, Cathal Smyth, Pascal Poupart, and Marcus Brubaker. Time2vec: Learning a vector representation of time. arXiv preprint arXiv:1907.05321, 2019.
[21]
Guokun Lai, Wei-Cheng Chang, Yiming Yang, and Hanxiao Liu. Modeling long-and short-term temporal patterns with deep neural networks. In The 41st international ACM SIGIR conference on research & development in information retrieval, pages 95-104, 2018.
[22]
Minhao Liu, Ailing Zeng, Zhijian Xu, Qiuxia Lai, and Qiang Xu. Time series is a special sequence: Forecasting with sample convolution and interaction. arXiv preprint arXiv:2106.09305, 2021.
[23]
Aditya Mate, Lovish Madaan, Aparna Taneja, Neha Madhiwalla, Shresth Verma, Gargi Singh, Aparna Hegde, Pradeep Varakantham, and Milind Tambe. Field study in deploying restless multi-armed bandits: Assisting non-profits in improving maternal and child health. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 12017- 12025, 2022.
[24]
Yasuko Matsubara, Yasushi Sakurai, Willem G Van Panhuis, and Christos Faloutsos. Funnel: automatic mining of spatially coevolving epidemics. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 105-114, 2014.
[25]
Mark R. Meiss, Bruno Gonçalves, José J. Ramasco, Alessandro Flammini, and Filippo Menczer. Agents, bookmarks and clicks: A topical model of web navigation. In Proceedings of the 21st ACM Conference on Hypertext and Hypermedia, HT '10, page 229-234, New York, NY, USA, 2010. Association for Computing Machinery.
[26]
Rahul Meshram, Aditya Gopalan, and D Manjunath. Optimal recommendation to users that react: Online learning for a class of pomdps. In 2016 IEEE 55th Conference on Decision and Control (CDC), pages 7210-7215. IEEE, 2016.
[27]
Siddharth Nishtala, Harshavardhan Kamarthi, Divy Thakkar, Dhyanesh Narayanan, Anirudh Grama, Aparna Hegde, Ramesh Padmanabhan, Neha Madhiwalla, Suresh Chaudhary, Balaraman Ravindran, and Milind Tambe. Missed calls, automated calls and health support: Using ai to improve maternal health outcomes by increasing program engagement. In AI for Social Good Workshop, 2020.
[28]
Siddhart Nisthala, Lovish Madaan, Aditya Mate, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwala, Ramesh Padhmanabhan, Aparna Hegde, Pradeep Varakantham, Balaram Ravindran, and Milind Tambe. Selective intervention planning using restless multi-armed bandits to improve maternal and child health outcomes. In AAMAS workshop on Autonomous Agents for social good, 2021.
[29]
Christos H Papadimitriou and John N Tsitsiklis. The complexity of optimal queueing network control. In Proceedings of IEEE 9th Annual Conference on Structure in Complexity Theory, pages 318-322. IEEE, 1994.
[30]
Angela Pfammatter, Bonnie Spring, Nalini Saligram, Raj Davé, Arun Gowda, Linelle Blais, Monika Arora, Harish Ranjani, Om Ganda, Donald Hedeker, et al. mhealth intervention to improve diabetes risk behaviors in india: a prospective, parallel group cohort study. Journal of medical Internet research, 18(8):e207, 2016.
[31]
Martin L Puterman. Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons, 2014.
[32]
Yundi Qian, Chao Zhang, Bhaskar Krishnamachari, and Milind Tambe. Restless poachers: Handling exploration-exploitation tradeoffs in security domains. In Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, AAMAS '16, page 123-131, Richland, SC, 2016. International Foundation for Autonomous Agents and Multiagent Systems.
[33]
Amit Sinha and Aditya Mahajan. Robustness of whittle index policy to model approximation. Available at SSRN 4064507, 2022.
[34]
Souhaib Ben Taieb, Rob J Hyndman, et al. Recursive and direct multi-step forecasting: the best of both worlds, volume 19. Department of Econometrics and Business Statistics, Monash Univ., 2012.
[35]
TheWorldBank. Fact sheet: An adjustment to global poverty lines. https://www.worldbank.org/en/news/factsheet/2022/05/02/factsheet-an-adjustment-to-global-poverty-lines, 2023. [Online; accessed 8-February-2023].
[36]
UN. Sustainable development goals. https://sdgs.un.org/goals/goal3, 2023. [Online; accessed 8-February-2023].
[37]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, 30, 2017.
[38]
Shresth Verma, Gargi Singh, Aditya Mate, Paritosh Verma, Sruthi Gorantala, Neha Madhiwalla, Aparna Hegde, Divy Thakkar, Manish Jain, Milind Tambe, and Aparna Taneja. Increasing impact of mobile health programs: Saheli for maternal and childcare. In Innovative Applications of Artificial Intelligence (IAAI), 2023.
[39]
Kai Wang*, Shresth Verma*, Aditya Mate, Sanket Shah, Aparna Taneja, Neha Madhiwalla, Aparna Hegde, and Milind Tambe. Scalable decision-focused learning in restless multi-armed bandits with application to maternal and child health. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023.
[40]
Richard R Weber and Gideon Weiss. On an index policy for restless bandits. Journal of Applied Probability, 27(3):637-648, 1990.
[41]
Qingsong Wen, Tian Zhou, Chaoli Zhang, Weiqi Chen, Ziqing Ma, Junchi Yan, and Liang Sun. Transformers in time series: A survey. arXiv preprint arXiv:2202.07125, 2022.
[42]
Peter Whittle. Restless bandits: Activity allocation in a changing world. Journal of Applied Probability, 25(A):287- 298, 1988.
[43]
Neo Wu, Bradley Green, Xue Ben, and Shawn O'Banion. Deep transformer models for time series forecasting: The influenza prevalence case. arXiv preprint arXiv:2001.08317, 2020.
[44]
Haixu Wu, Jiehui Xu, Jianmin Wang, and Mingsheng Long. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems, 34:22419-22430, 2021.
[45]
Ailing Zeng, Muxi Chen, Lei Zhang, and Qiang Xu. Are transformers effective for time series forecasting? In Proceedings of the AAAI Conference on Artificial Intelligence, 2023.
[46]
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, and Wancai Zhang. Informer: Beyond efficient transformer for long sequence timeseries forecasting. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 11106-11115, 2021.

Cited By

View all
  • (2024)Improving health information access in the world's largest maternal mobile health program via bandit algorithmsProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i21.30329(22913-22919)Online publication date: 20-Feb-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
August 2023
7242 pages
ISBN:978-1-956792-03-4

Sponsors

  • International Joint Conferences on Artifical Intelligence (IJCAI)

Publisher

Unknown publishers

Publication History

Published: 19 August 2023

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Improving health information access in the world's largest maternal mobile health program via bandit algorithmsProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i21.30329(22913-22919)Online publication date: 20-Feb-2024

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media