Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–8 of 8 results for author: Hussing, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17466  [pdf, other

    cs.LG cs.MA

    Distributed Continual Learning

    Authors: Long Le, Marcel Hussing, Eric Eaton

    Abstract: This work studies the intersection of continual and federated learning, in which independent agents face unique tasks in their environments and incrementally develop and share knowledge. We introduce a mathematical framework capturing the essential aspects of distributed continual learning, including agent model and statistical heterogeneity, continual distribution shift, network topology, and com… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2405.16739  [pdf, other

    cs.LG cs.AI eess.SY

    Oracle-Efficient Reinforcement Learning for Max Value Ensembles

    Authors: Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

    Abstract: Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardinality) and experimentally (where function approximation and policy gradient techniques often scale poorly and suffer from instability and high variance). One line of research attempting to address thes… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2403.05996  [pdf, other

    cs.LG cs.AI

    Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence

    Authors: Marcel Hussing, Claas Voelcker, Igor Gilitschenski, Amir-massoud Farahmand, Eric Eaton

    Abstract: We show that deep reinforcement learning can maintain its ability to learn without resetting network parameters in settings where the number of gradient updates greatly exceeds the number of environment samples. Under such large update-to-data ratios, a recent study by Nikishin et al. (2022) suggested the emergence of a primacy bias, in which agents overfit early interactions and downplay later ex… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2307.07091  [pdf, other

    cs.LG cs.AI cs.RO

    Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning

    Authors: Marcel Hussing, Jorge A. Mendez, Anisha Singrodia, Cassandra Kent, Eric Eaton

    Abstract: Offline reinforcement learning (RL) is a promising direction that allows RL agents to pre-train on large datasets, avoiding the recurrence of expensive data collection. To advance the field, it is crucial to generate large-scale datasets. Compositional RL is particularly appealing for generating such large datasets, since 1) it permits creating many tasks from few components, 2) the task structure… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  5. arXiv:2305.15284  [pdf, other

    cs.LG

    Replicable Reinforcement Learning

    Authors: Eric Eaton, Marcel Hussing, Michael Kearns, Jessica Sorrell

    Abstract: The replicability crisis in the social, behavioral, and data sciences has led to the formulation of algorithm frameworks for replicability -- i.e., a requirement that an algorithm produce identical outputs (with high probability) when run on two different samples from the same underlying distribution. While still in its infancy, provably replicable algorithms have been developed for many fundament… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  6. arXiv:2212.03084  [pdf, other

    cs.CV cs.AI cs.LG

    Land Use Prediction using Electro-Optical to SAR Few-Shot Transfer Learning

    Authors: Marcel Hussing, Karen Li, Eric Eaton

    Abstract: Satellite image analysis has important implications for land use, urbanization, and ecosystem monitoring. Deep learning methods can facilitate the analysis of different satellite modalities, such as electro-optical (EO) and synthetic aperture radar (SAR) imagery, by supporting knowledge transfer between the modalities to compensate for individual shortcomings. Recent progress has shown how distrib… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: Published at Tackling Climate Change with Machine Learning workshop at NeurIPS 2022

  7. arXiv:2207.04136  [pdf, other

    cs.LG cs.AI cs.RO

    CompoSuite: A Compositional Reinforcement Learning Benchmark

    Authors: Jorge A. Mendez, Marcel Hussing, Meghna Gummadi, Eric Eaton

    Abstract: We present CompoSuite, an open-source simulated robotic manipulation benchmark for compositional multi-task reinforcement learning (RL). Each CompoSuite task requires a particular robot arm to manipulate one individual object to achieve a task objective while avoiding an obstacle. This compositional definition of the tasks endows CompoSuite with two remarkable properties. First, varying the robot/… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Published at 1st Conference on Lifelong Learning Agents, 2022; code: https://github.com/Lifelong-ML/CompoSuite

  8. arXiv:1910.02425  [pdf, other

    cs.LG cs.CV stat.ML

    Structured Object-Aware Physics Prediction for Video Modeling and Planning

    Authors: Jannik Kossen, Karl Stelzner, Marcel Hussing, Claas Voelcker, Kristian Kersting

    Abstract: When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with complicated and previously unseen interactions. For computers, however, learning such models from videos in an unsupervised fashion is an unsolved research problem. In this paper, we present STOVE, a novel state-space model for videos, which ex… ▽ More

    Submitted 12 February, 2020; v1 submitted 6 October, 2019; originally announced October 2019.

    Comments: Published as a conference paper at 2020 International Conference for Learning Representations