Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/2484920.2485118acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
poster

A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems

Published: 06 May 2013 Publication History
  • Get Citation Alerts
  • Abstract

    The ad hoc coordination problem is to design an ad hoc agent which is able to achieve optimal flexibility and efficiency in a multiagent system that admits no prior coordination between the ad hoc agent and the other agents. We conceptualise this problem formally as a stochastic Bayesian game in which the behaviour of a player is determined by its type. Based on this model, we derive a solution, called Harsanyi-Bellman Ad Hoc Coordination (HBA), which utilises a set of user-defined types to characterise players based on their observed behaviours. We evaluate HBA in the level-based foraging domain, showing that it outperforms several alternative algorithms using just a few user-defined types. We also report on a human-machine experiment in which the humans played Prisoner's Dilemma and Rock-Paper-Scissors against HBA and alternative algorithms. The results show that HBA achieved equal efficiency but a significantly higher welfare and winning rate.

    References

    [1]
    S. Albrecht and S. Ramamoorthy. Comparative evaluation of MAL algorithms in a diverse set of ad hoc team problems. In 11th Autonomous Agents and Multiagent Systems, 2012.
    [2]
    S. Albrecht and S. Ramamoorthy. A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems. Technical report, School of Informatics, University of Edinburgh, 2012. http://wcms.inf.ed.ac.uk/ipab/autonomy/publications/SAlbrecht_tech_report.pdf.
    [3]
    M. Bowling and P. McCracken. Coordination and adaptation in impromptu teams. In Proceedings of the National Conference on Artificial Intelligence, volume 20, page 53, 2005.
    [4]
    M. Dias, T. Harris, B. Browning, E. Jones, B. Argall, M. Veloso, A. Stentz, and A. Rudnicky. Dynamically formed human-robot teams performing coordinated tasks. In AAAI Spring Symposium, 2006.
    [5]
    J. Harsanyi. Games with incomplete information played by "Bayesian" players. Part I. The basic model. Management Science, 14(3):159--182, 1967.
    [6]
    P. Stone and S. Kraus. To teach or not to teach? Decision making under uncertainty in ad hoc teams. In 9th Autonomous Agents and Multiagent Systems, 2010.

    Cited By

    View all
    • (2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
    • (2019)ATSISProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367032.3367058(172-179)Online publication date: 10-Aug-2019
    • (2019)Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR FrameworkProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3331914(1773-1775)Online publication date: 8-May-2019
    • Show More Cited By

    Index Terms

    1. A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      AAMAS '13: Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
      May 2013
      1500 pages
      ISBN:9781450319935

      Sponsors

      • IFAAMAS

      In-Cooperation

      Publisher

      International Foundation for Autonomous Agents and Multiagent Systems

      Richland, SC

      Publication History

      Published: 06 May 2013

      Check for updates

      Author Tags

      1. ad hoc coordination
      2. harsanyi-bellman ad hoc coordination (HBA)
      3. stochastic bayesian games (SBG)

      Qualifiers

      • Poster

      Conference

      AAMAS '13
      Sponsor:

      Acceptance Rates

      AAMAS '13 Paper Acceptance Rate 140 of 599 submissions, 23%;
      Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)1

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)A Sufficient Statistic for Influence in Structured Multiagent EnvironmentsJournal of Artificial Intelligence Research10.1613/jair.1.1213670(789-870)Online publication date: 1-May-2021
      • (2019)ATSISProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367032.3367058(172-179)Online publication date: 10-Aug-2019
      • (2019)Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork - The STAR FrameworkProceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3306127.3331914(1773-1775)Online publication date: 8-May-2019
      • (2017)On Markov games played by bayesian and boundedly-rational playersProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298239.3298305(437-443)Online publication date: 4-Feb-2017
      • (2017)Coordinated versus decentralized exploration in multi-agent multi-armed banditsProceedings of the 26th International Joint Conference on Artificial Intelligence10.5555/3171642.3171667(164-170)Online publication date: 19-Aug-2017
      • (2017)Allocating training instances to learning agents for team formationAutonomous Agents and Multi-Agent Systems10.1007/s10458-016-9355-331:4(905-940)Online publication date: 1-Jul-2017
      • (2017)Can bounded and self-interested agents be teammates? Application to planning in ad hoc teamsAutonomous Agents and Multi-Agent Systems10.1007/s10458-016-9354-431:4(821-860)Online publication date: 1-Jul-2017
      • (2017)Efficiently detecting switches against non-stationary opponentsAutonomous Agents and Multi-Agent Systems10.1007/s10458-016-9352-631:4(767-789)Online publication date: 1-Jul-2017
      • (2014)On convergence and optimality of best-response learning with policy types in multiagent systemsProceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence10.5555/3020751.3020754(12-21)Online publication date: 23-Jul-2014
      • (2013)Ad hoc coordination in multiagent systems with applications to human-machine interactionProceedings of the 2013 international conference on Autonomous agents and multi-agent systems10.5555/2484920.2485253(1415-1416)Online publication date: 6-May-2013

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media