research-article

Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks

Authors:

Philipp Robbel,

Frans A. Oliehoek,

Wiebe van der HoekAuthors Info & Claims

AAMAS '15: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems

Pages 881 - 890

Published: 04 May 2015 Publication History

Abstract

Although multi-robot systems have received substantial research attention in recent years, multi-robot coordination still remains a difficult task. Especially, when dealing with spatially distributed tasks and many robots, central control quickly becomes infeasible due to the exponential explosion in the number of joint actions and states. We propose a general algorithm that allows for distributed control, that overcomes the exponential growth in the number of joint actions by aggregating the effect of other agents in the system into a probabilistic model, called subjective approximations, and then choosing the best response. We show for a multi-robot grid world how the algorithm can be implemented in the well studied Multiagent Markov Decision Process framework, as a sub-class called spatial task allocation problems (SPATAPs). In this framework, we show how to tackle SPATAPs using online, distributed planning by combining subjective agent approximations with restriction of attention to current tasks in the world. An empirical evaluation shows that the combination of both strategies allows to scale to very large problems, while providing near-optimal solutions.

References

[1]

Asrar Ahmed, Pradeep Varakantham, and Shih-Fen Cheng. Uncertain congestion games with assorted human agent populations. In Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, pages 44--53, 2012.

Digital Library

[2]

S. Alers, K. Tuyls, B. Ranjbar-Sahraei, D. Claes, and G. Weiss. Insect-inspired robot coordination: foraging and coverage. In Artificial life 14, pages 761--768, 2014.

[3]

Sofia Amador, Steven Okamoto, and Roie Zivan. Dynamic multi-agent task allocation with spatial and temporal constraints. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pages 1384--1390, 2014.

Digital Library

[4]

Raphen Becker, Shlomo Zilberstein, Victor Lesser, and Claudia V. Goldman. Transition-independent decentralized Markov decision processes. In AAMAS, pages 41--48, 2003.

Digital Library

[5]

Michael Beetz, Ulrich Klank, Ingo Kresse, Alexis Maldonado, Lorenz Mösenlechner, Dejan Pangercic, Thomas Rühr, and Moritz Tenorth. Robotic Roommates Making Pancakes. In 11th IEEE-RAS International Conference on Humanoid Robots, Bled, Slovenia, October, 26--28 2011.

[6]

Daniel S. Bernstein, Robert Givan, Neil Immerman, and Shlomo Zilberstein. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4):819--840, 2002.

Digital Library

[7]

Craig Boutilier. Planning, learning and coordination in multiagent decision processes. In Proc. of the 6th Conference on Theoretical Aspects of Rationality and Knowledge, pages 195--210, 1996.

Digital Library

[8]

Arne Brutschy, Giovanni Pini, Carlo Pinciroli, Mauro Birattari, and Marco Dorigo. Self-organized task allocation to sequentially interdependent tasks in swarm robotics. Autonomous Agents and Multi-Agent Systems, 28(1):101--125, 2014.

Digital Library

[9]

Jesús Capitán, Matthijs T. J. Spaan, Luis Merino, and Aníbal Ollero. Decentralized multi-robot cooperation with auctioned POMDPs. International Journal of Robotics Research, 32(6):650--671, 2013.

Digital Library

[10]

Yann-Michaël De Hauwere, Peter Vrancx, and Ann Nowé. Learning multi-agent state space representations. In AAMAS, pages 715--722, 2010.

Digital Library

[11]

Jilles S. Dibangoye, Christopher Amato, Olivier Buffet, and François Charpillet. Exploiting separability in multiagent planning with continuous-state MDPs. In Proceedings of the Thirteenth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 1281--1288, 2014.

Digital Library

[12]

Jilles Steeve Dibangoye, Christopher Amato, and Arnaud Doniec. Scaling up decentralized MDPs through heuristic search. In UAI, pages 217--226, 2012.

[13]

Prashant Doshi, Yifeng Zeng, and Qiongyu Chen. Graphical models for interactive POMDPs: representations and solutions. Autonomous Agents and Multi-Agent Systems, 18(3):376--416, 2008.

Digital Library

[14]

Greg Droge and Magnus Egerstedt. Adaptive look-ahead for robotic navigation in unknown environments. In IROS, pages 1134--1139, 2011.

[15]

Piotr J. Gmytrasiewicz and Prashant Doshi. A framework for sequential planning in multi-agent settings. Journal of Artificial Intelligence Research, 24:49--79, 2005.

[16]

Geoffrey J. Gordon, Pradeep Varakantham, William Yeoh, Hoong Chuin Lau, Ajay S. Aravamudhan, and Shih-Fen Cheng. Lagrangian relaxation for large-scale multi-agent planning. In 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, IAT 2012, Macau, China, December 4--7, 2012, pages 494--501, 2012.

Digital Library

[17]

Olivier Guéant, Jean-Michel Lasry, and Pierre-Louis Lions. Mean field games and applications. In Paris-Princeton Lectures on Mathematical Finance 2010, volume 2003 of Lecture Notes in Mathematics, pages 205--266. Springer Berlin Heidelberg, 2011.

[18]

Carlos Guestrin, Shobha Venkataraman, and Daphne Koller. Context specific multiagent coordination and planning with factored MDPs. In AAAI, pages 253--259, 2002.

Digital Library

[19]

Jesse Hoey, Robert St-Aubin, Alan J. Hu, and Craig Boutilier. SPUDD: Stochastic planning using decision diagrams. In UAI, pages 279--288, 1999.

Digital Library

[20]

Levente Kocsis and Csaba Szepesvári. Bandit based Monte-Carlo planning. In Machine Learning: ECML 2006, volume 4212 of Lecture Notes in Computer Science, pages 282--293. Springer, 2006.

Digital Library

[21]

Jelle R. Kok and Nikos Vlassis. Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research, 7:1789--1828, 2006.

Digital Library

[22]

Nyree Lemmens and Karl Tuyls. Stigmergic landmark optimization. Advances in Complex Systems, 15(8), 2012.

[23]

Han lim Choi, Luc Brunet, and Jonathan P. How. Consensus-based decentralized auctions for robust task allocation. IEEE Transactions on Robotics, 2009.

Digital Library

[24]

Laëtitia Matignon, Laurent Jeanpierre, and Abdel-Illah Mouaddib. Coordinated multi-robot exploration under communication constraints using decentralized markov decision processes. In AAAI, pages 2017--2023, 2012.

[25]

Francisco S. Melo and Manuela Veloso. Learning of coordination: exploiting sparse interactions in multiagent systems. In AAMAS, pages 773--780, 2009.

Digital Library

[26]

Martin L. Puterman. Markov Decision Processes - Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., 1994.

Digital Library

[27]

Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore, and Martin A. Riedmiller. Distributed value functions. In Proc. of the International Conference on Machine Learning, pages 371--378, 1999.

Digital Library

[28]

Yoav Shoham and Moshe Tennenholtz. On social laws for artificial agent societies: off-line design. Artificial Intelligence, 73(1--2):231--252, 1995.

Digital Library

[29]

Jason Sleight and Edmund H. Durfee. A decision-theoretic characterization of organizational inuences. In AAMAS, pages 323--330, 2012.

Digital Library

[30]

Matthijs T. J. Spaan and Francisco S. Melo. Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In AAMAS, pages 525--532, 2008.

Digital Library

[31]

Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. The MIT Press, March 1998.

Digital Library

[32]

Pradeep Varakantham, Shih-Fen Cheng, Geoffrey J. Gordon, and Asrar Ahmed. Decision support for agent populations in uncertain and congested environments. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, July 22--26, 2012, Toronto, Ontario, Canada., 2012.

[33]

Jianhui Wu and Edmund H. Durfee. Resource-driven mission-phasing techniques for constrained agents in stochastic environments. Journal of Artificial Intelligence Research, 38:415--473, 2010.

[34]

Yu Zhang and Lynne E. Parker. Task allocation with executable coalitions in multirobot tasks. In IEEE International Conference on Robotics and Automation, ICRA 2012, 14--18 May, 2012, St. Paul, Minnesota, USA, pages 3307--3314, 2012.

Cited By

Oliehoek FWitwicki SKaelbling L(2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
https://dl.acm.org/doi/10.1613/jair.1.12136
Kano TNaito EAoshima TIshiguro A(2020)Decentralized Control for Swarm Robots That Can Effectively Execute Spatially Distributed TasksArtificial Life10.1162/artl_a_0031726:2(242-259)Online publication date: 1-May-2020
https://dl.acm.org/doi/10.1162/artl_a_00317
Phan TSchmid KBelzner LGabor TFeld SLinnhoff-Popien CElkind EVeloso MAgmon NTaylor M(2019)Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent PoliciesProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3332044(2162-2164)Online publication date: 8-May-2019
https://dl.acm.org/doi/10.5555/3306127.3332044
Show More Cited By

Index Terms

Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks
1. Computer systems organization
  1. Embedded and cyber-physical systems
    1. Robotics
2. Computing methodologies
  1. Artificial intelligence
    1. Control methods
      1. Robotic planning
    2. Planning and scheduling
      1. Robotic planning

Recommendations

Multi Robot Collision Avoidance with Continuous Curvature Manoeuvres
AIR '13: Proceedings of Conference on Advances In Robotics

In this paper, we address the problem of reactive collision avoidance among multiple robots. However we impose the constraint of continuous curvature on the collision avoidance manoeuvre which adds to the complexity of the problem but is necessary for ...
Active visual sensing and collaboration on mobile robots using hierarchical POMDPs
AAMAS '12: Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1

A key challenge to widespread deployment of mobile robots in the real-world is the ability to robustly and autonomously sense the environment and collaborate with teammates. Real-world domains are characterized by partial observability, non-...
Multi-robot learning using non-deterministic argument games

Multi-robot learning becomes interesting and less complex when the participating robots work in coordination. Both centralised and de-centralised control mechanisms of robots exist in the robot literature. However, the choice of these mechanisms varies ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

AAMAS '15: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems

May 2015

2072 pages

ISBN:9781450334136

General Chairs:
Gerhard Weiss
Maastricht University, The Netherlands
,
Pınar Yolum
Bogazici University, Turkey
,
Program Chairs:
Rafael H. Bordini
PUCRS, Brazil
,
Edith Elkind
University of Oxford, UK

Sponsors

IFAAMAS

In-Cooperation

SIGAI: ACM Special Interest Group on Artificial Intelligence

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 04 May 2015

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

NWO Innovational Research Incentives Scheme Veni

Conference

AAMAS'15

Sponsor:

AAMAS'15: International Conference on Autonomous Agents and Multiagent Systems

May 4 - 8, 2015

Istanbul, Turkey

Acceptance Rates

AAMAS '15 Paper Acceptance Rate 108 of 670 submissions, 16%;

Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

7
Total Citations
View Citations
206
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Oliehoek FWitwicki SKaelbling L(2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
https://dl.acm.org/doi/10.1613/jair.1.12136
Kano TNaito EAoshima TIshiguro A(2020)Decentralized Control for Swarm Robots That Can Effectively Execute Spatially Distributed TasksArtificial Life10.1162/artl_a_0031726:2(242-259)Online publication date: 1-May-2020
https://dl.acm.org/doi/10.1162/artl_a_00317
Phan TSchmid KBelzner LGabor TFeld SLinnhoff-Popien CElkind EVeloso MAgmon NTaylor M(2019)Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent PoliciesProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3332044(2162-2164)Online publication date: 8-May-2019
https://dl.acm.org/doi/10.5555/3306127.3332044
Oliehoek F(2018)Interactive learning and decision makingProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304652.3304829(5703-5708)Online publication date: 13-Jul-2018
https://dl.acm.org/doi/10.5555/3304652.3304829
Claes DOliehoek FBaier HTuyls KLarson KWinikoff MDas SDurfee E(2017)Decentralised Online Planning for Multi-Robot Warehouse CommissioningProceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems10.5555/3091125.3091198(492-500)Online publication date: 8-May-2017
https://dl.acm.org/doi/10.5555/3091125.3091198
Feo Flushing EGambardella LDi Caro GJonker CMarsella SThangarajah JTuyls K(2016)On Decentralized Coordination for Spatial Task Allocation and Scheduling in Heterogeneous TeamsProceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems10.5555/2936924.2937070(988-996)Online publication date: 9-May-2016
https://dl.acm.org/doi/10.5555/2936924.2937070
Belzner LBeck MGabor TRoelle HSauer H(2016)Software engineering for distributed autonomous real-time systemsProceedings of the 2nd International Workshop on Software Engineering for Smart Cyber-Physical Systems10.1145/2897035.2897040(54-57)Online publication date: 14-May-2016
https://dl.acm.org/doi/10.1145/2897035.2897040

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten