Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3522664.3528609acmconferencesArticle/Chapter ViewAbstractPublication PagescainConference Proceedingsconference-collections
short-paper

Engineering a platform for reinforcement learning workloads

Published: 17 October 2022 Publication History

Abstract

Reinforcement Learning (RL) is an area of machine learning concerned with teaching intelligent agents to take desired actions in a specific environment. The teaching part can be performed in a simulated environment where the agent can learn how to react to the (simulated) current state in order to reach a desired state. Offering Reinforcement Learning as a service with stringent reliability and scalability requirements, entails a set of challenges at both the architectural and implementation level. In this paper we present the Bonsai platform for RL workloads. We discuss the requirements, design and implementation of the Bonsai platform.

References

[1]
Richard S. Sutton and Andrew G. Barto Reinforcement Learning: An Introduction, MIT press, Cambridge, MA, 2018
[2]
Bonsai Platform, URL: https://www.microsoft.com/en-us/ai/autonomous-systems-project-bonsai, accessed in January 2022.
[3]
Inkling DSL language, URL: https://docs.microsoft.com/en-us/bonsai/inkling/basics, accessed in January 2022.
[4]
Azure AKS, URL: https://azure.microsoft.com/en-us/overview/kubernetes-on-azure, accessed in January 2022.
[5]
Azure Container Instances, URL: https://azure.microsoft.com/en-us/services/container-instances, accessed in January 2022.
[6]
The Four Pillars of High Availability, URL: https://alikanso.medium.com/the-four-pillars-of-high-availability-d41c1609e0ba, accessed in January 2022.

Index Terms

  1. Engineering a platform for reinforcement learning workloads

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CAIN '22: Proceedings of the 1st International Conference on AI Engineering: Software Engineering for AI
    May 2022
    254 pages
    ISBN:9781450392754
    DOI:10.1145/3522664
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    • IEEE TCSC: IEEE Technical Committee on Scalable Computing

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. auto-scaling
    2. cloud computing
    3. kubernetes
    4. machine learning
    5. reinforcement learning
    6. reliability

    Qualifiers

    • Short-paper

    Conference

    CAIN '22
    Sponsor:

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 38
      Total Downloads
    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 17 Oct 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media