Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2772879.2773265acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks

Published: 04 May 2015 Publication History

Abstract

Although multi-robot systems have received substantial research attention in recent years, multi-robot coordination still remains a difficult task. Especially, when dealing with spatially distributed tasks and many robots, central control quickly becomes infeasible due to the exponential explosion in the number of joint actions and states. We propose a general algorithm that allows for distributed control, that overcomes the exponential growth in the number of joint actions by aggregating the effect of other agents in the system into a probabilistic model, called subjective approximations, and then choosing the best response. We show for a multi-robot grid world how the algorithm can be implemented in the well studied Multiagent Markov Decision Process framework, as a sub-class called spatial task allocation problems (SPATAPs). In this framework, we show how to tackle SPATAPs using online, distributed planning by combining subjective agent approximations with restriction of attention to current tasks in the world. An empirical evaluation shows that the combination of both strategies allows to scale to very large problems, while providing near-optimal solutions.

References

[1]
Asrar Ahmed, Pradeep Varakantham, and Shih-Fen Cheng. Uncertain congestion games with assorted human agent populations. In Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, pages 44--53, 2012.
[2]
S. Alers, K. Tuyls, B. Ranjbar-Sahraei, D. Claes, and G. Weiss. Insect-inspired robot coordination: foraging and coverage. In Artificial life 14, pages 761--768, 2014.
[3]
Sofia Amador, Steven Okamoto, and Roie Zivan. Dynamic multi-agent task allocation with spatial and temporal constraints. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pages 1384--1390, 2014.
[4]
Raphen Becker, Shlomo Zilberstein, Victor Lesser, and Claudia V. Goldman. Transition-independent decentralized Markov decision processes. In AAMAS, pages 41--48, 2003.
[5]
Michael Beetz, Ulrich Klank, Ingo Kresse, Alexis Maldonado, Lorenz Mösenlechner, Dejan Pangercic, Thomas Rühr, and Moritz Tenorth. Robotic Roommates Making Pancakes. In 11th IEEE-RAS International Conference on Humanoid Robots, Bled, Slovenia, October, 26--28 2011.
[6]
Daniel S. Bernstein, Robert Givan, Neil Immerman, and Shlomo Zilberstein. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4):819--840, 2002.
[7]
Craig Boutilier. Planning, learning and coordination in multiagent decision processes. In Proc. of the 6th Conference on Theoretical Aspects of Rationality and Knowledge, pages 195--210, 1996.
[8]
Arne Brutschy, Giovanni Pini, Carlo Pinciroli, Mauro Birattari, and Marco Dorigo. Self-organized task allocation to sequentially interdependent tasks in swarm robotics. Autonomous Agents and Multi-Agent Systems, 28(1):101--125, 2014.
[9]
Jesús Capitán, Matthijs T. J. Spaan, Luis Merino, and Aníbal Ollero. Decentralized multi-robot cooperation with auctioned POMDPs. International Journal of Robotics Research, 32(6):650--671, 2013.
[10]
Yann-Michaël De Hauwere, Peter Vrancx, and Ann Nowé. Learning multi-agent state space representations. In AAMAS, pages 715--722, 2010.
[11]
Jilles S. Dibangoye, Christopher Amato, Olivier Buffet, and François Charpillet. Exploiting separability in multiagent planning with continuous-state MDPs. In Proceedings of the Thirteenth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 1281--1288, 2014.
[12]
Jilles Steeve Dibangoye, Christopher Amato, and Arnaud Doniec. Scaling up decentralized MDPs through heuristic search. In UAI, pages 217--226, 2012.
[13]
Prashant Doshi, Yifeng Zeng, and Qiongyu Chen. Graphical models for interactive POMDPs: representations and solutions. Autonomous Agents and Multi-Agent Systems, 18(3):376--416, 2008.
[14]
Greg Droge and Magnus Egerstedt. Adaptive look-ahead for robotic navigation in unknown environments. In IROS, pages 1134--1139, 2011.
[15]
Piotr J. Gmytrasiewicz and Prashant Doshi. A framework for sequential planning in multi-agent settings. Journal of Artificial Intelligence Research, 24:49--79, 2005.
[16]
Geoffrey J. Gordon, Pradeep Varakantham, William Yeoh, Hoong Chuin Lau, Ajay S. Aravamudhan, and Shih-Fen Cheng. Lagrangian relaxation for large-scale multi-agent planning. In 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, IAT 2012, Macau, China, December 4--7, 2012, pages 494--501, 2012.
[17]
Olivier Guéant, Jean-Michel Lasry, and Pierre-Louis Lions. Mean field games and applications. In Paris-Princeton Lectures on Mathematical Finance 2010, volume 2003 of Lecture Notes in Mathematics, pages 205--266. Springer Berlin Heidelberg, 2011.
[18]
Carlos Guestrin, Shobha Venkataraman, and Daphne Koller. Context specific multiagent coordination and planning with factored MDPs. In AAAI, pages 253--259, 2002.
[19]
Jesse Hoey, Robert St-Aubin, Alan J. Hu, and Craig Boutilier. SPUDD: Stochastic planning using decision diagrams. In UAI, pages 279--288, 1999.
[20]
Levente Kocsis and Csaba Szepesvári. Bandit based Monte-Carlo planning. In Machine Learning: ECML 2006, volume 4212 of Lecture Notes in Computer Science, pages 282--293. Springer, 2006.
[21]
Jelle R. Kok and Nikos Vlassis. Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research, 7:1789--1828, 2006.
[22]
Nyree Lemmens and Karl Tuyls. Stigmergic landmark optimization. Advances in Complex Systems, 15(8), 2012.
[23]
Han lim Choi, Luc Brunet, and Jonathan P. How. Consensus-based decentralized auctions for robust task allocation. IEEE Transactions on Robotics, 2009.
[24]
Laëtitia Matignon, Laurent Jeanpierre, and Abdel-Illah Mouaddib. Coordinated multi-robot exploration under communication constraints using decentralized markov decision processes. In AAAI, pages 2017--2023, 2012.
[25]
Francisco S. Melo and Manuela Veloso. Learning of coordination: exploiting sparse interactions in multiagent systems. In AAMAS, pages 773--780, 2009.
[26]
Martin L. Puterman. Markov Decision Processes - Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., 1994.
[27]
Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore, and Martin A. Riedmiller. Distributed value functions. In Proc. of the International Conference on Machine Learning, pages 371--378, 1999.
[28]
Yoav Shoham and Moshe Tennenholtz. On social laws for artificial agent societies: off-line design. Artificial Intelligence, 73(1--2):231--252, 1995.
[29]
Jason Sleight and Edmund H. Durfee. A decision-theoretic characterization of organizational inuences. In AAMAS, pages 323--330, 2012.
[30]
Matthijs T. J. Spaan and Francisco S. Melo. Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In AAMAS, pages 525--532, 2008.
[31]
Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. The MIT Press, March 1998.
[32]
Pradeep Varakantham, Shih-Fen Cheng, Geoffrey J. Gordon, and Asrar Ahmed. Decision support for agent populations in uncertain and congested environments. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, July 22--26, 2012, Toronto, Ontario, Canada., 2012.
[33]
Jianhui Wu and Edmund H. Durfee. Resource-driven mission-phasing techniques for constrained agents in stochastic environments. Journal of Artificial Intelligence Research, 38:415--473, 2010.
[34]
Yu Zhang and Lynne E. Parker. Task allocation with executable coalitions in multirobot tasks. In IEEE International Conference on Robotics and Automation, ICRA 2012, 14--18 May, 2012, St. Paul, Minnesota, USA, pages 3307--3314, 2012.

Cited By

View all
  • (2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
  • (2020)Decentralized Control for Swarm Robots That Can Effectively Execute Spatially Distributed TasksArtificial Life10.1162/artl_a_0031726:2(242-259)Online publication date: 1-May-2020
  • (2019)Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent PoliciesProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3332044(2162-2164)Online publication date: 8-May-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
AAMAS '15: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems
May 2015
2072 pages
ISBN:9781450334136

Sponsors

  • IFAAMAS

In-Cooperation

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Richland, SC

Publication History

Published: 04 May 2015

Check for updates

Author Tags

  1. multi-robot systems
  2. robot coordination
  3. robot planning and plan execution
  4. robot teams
  5. robotics

Qualifiers

  • Research-article

Funding Sources

  • NWO Innovational Research Incentives Scheme Veni

Conference

AAMAS'15
Sponsor:

Acceptance Rates

AAMAS '15 Paper Acceptance Rate 108 of 670 submissions, 16%;
Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
  • (2020)Decentralized Control for Swarm Robots That Can Effectively Execute Spatially Distributed TasksArtificial Life10.1162/artl_a_0031726:2(242-259)Online publication date: 1-May-2020
  • (2019)Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent PoliciesProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3332044(2162-2164)Online publication date: 8-May-2019
  • (2018)Interactive learning and decision makingProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304652.3304829(5703-5708)Online publication date: 13-Jul-2018
  • (2017)Decentralised Online Planning for Multi-Robot Warehouse CommissioningProceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems10.5555/3091125.3091198(492-500)Online publication date: 8-May-2017
  • (2016)On Decentralized Coordination for Spatial Task Allocation and Scheduling in Heterogeneous TeamsProceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems10.5555/2936924.2937070(988-996)Online publication date: 9-May-2016
  • (2016)Software engineering for distributed autonomous real-time systemsProceedings of the 2nd International Workshop on Software Engineering for Smart Cyber-Physical Systems10.1145/2897035.2897040(54-57)Online publication date: 14-May-2016

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media