Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2908738.2908739guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Leading multiple ad hoc teammates in joint action settings

Published: 01 January 2011 Publication History

Abstract

The growing use of autonomous agents in practice may require agents to cooperate as a team in situations where they have limited prior knowledge about one another, cannot communicate directly, or do not share the same world models. These situations raise the need to design ad hoc team members, i.e., agents that will be able to cooperate without coordination in order to reach an optimal team behavior. This paper considers problem of leading N-agent teams by a single agent toward their optimal joint utility, where the agents compute their next actions based only on their most recent observations of their teammates' actions. We show that compared to previous results in two-agent teams, in larger teams the agent might not be able to lead the team to the action with maximal joint utility. In these cases, the agent's optimal strategy leads the team to the best possible reachable cycle of joint actions. We describe a graphical model of the problem and a polynomial time algorithm for solving it. We then consider the problem of leading teams where the agents' base their actions on a longer history of past observations, showing that the an upper bound computation time exponential in the memory size is very likely to be tight.

References

[1]
Bowling, M., and McCracken, P. 2005. Coordination and adaptation in impromptu teams. In Proc. of AAAI'05, 53-58.
[2]
Chakraborty, D., and Stone, P. 2008. Online multiagent learning against memory bounded adversaries. In Machine Learning and Knowledge Discovery in Databases, volume 5212 of Lecture Notes in Artificial Intelligence, 211-26.
[3]
Garey, M. R., and Johnson, D. S. 1990. Computers and Intractability; A Guide to the Theory of NP-Completeness. W. H. Freeman & Co.
[4]
Jones, E.; Browning, B.; Dias, M.; Argall, B.; Veloso, M.; and Stentz, A. 2006. Dynamically formed heterogeneous robot teams performing tightly-coordinated tasks. In Proc. of ICRA'06, 570-575.
[5]
Karp, R. 1978. A characterization of the minimum cycle mean in a digraph. Discrete Mathematics 23.
[6]
Powers, R., and Shoham, Y. 2005. Learning against opponents with bounded memory. In Proc. of IJCAI'05, 817-822.
[7]
Stone, P., and Kraus, S. 2010. To teach or not to teach? decision making under uncertainty in ad hoc teams. In Proc. of AAMAS'10.
[8]
Stone, P.; Kaminka, G.; Kraus, S.; and Rosenschein, J. 2010. Ad hoc autonomous agent teams: Collaboration without precoordination. In Proc. of AAAI'10.
[9]
Stone, P.; Kaminka, G. A.; and Rosenschein, J. S. 2010. Leading a best-response teammate in an ad hoc team. In Agent-Mediated Electronic Commerce: Designing Trading Strategies and Mechanisms for Electronic Markets (AMEC).
[10]
Young, P. 1993. The evolution of conventions. Econometrica 61(1):57-84.

Cited By

View all
  • (2017)Allocating training instances to learning agents for team formationAutonomous Agents and Multi-Agent Systems10.1007/s10458-016-9355-331:4(905-940)Online publication date: 1-Jul-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
AAAIWS'11-13: Proceedings of the 13th AAAI Conference on Interactive Decision Theory and Game Theory
January 2011
67 pages

Publisher

AAAI Press

Publication History

Published: 01 January 2011

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Allocating training instances to learning agents for team formationAutonomous Agents and Multi-Agent Systems10.1007/s10458-016-9355-331:4(905-940)Online publication date: 1-Jul-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media