Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–1 of 1 results for author: Mahmud, R A

.
  1. arXiv:2110.08480  [pdf, other

    cs.AI

    Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

    Authors: Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

    Abstract: Multi-agent Markov Decision Process (MMDP) has been an effective way of modelling sequential decision making algorithms for multi-agent cooperative environments. A number of algorithms based on centralized and decentralized planning have been developed in this domain. However, dynamically changing environment, coupled with exponential size of the state and joint action space, make it difficult for… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.