Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 73 results for author: Martius, G

.
  1. arXiv:2407.05920  [pdf, other

    cs.LG

    LPGD: A General Framework for Backpropagation through Embedded Optimization Layers

    Authors: Anselm Paulus, Georg Martius, Vít Musil

    Abstract: Embedding parameterized optimization problems as layers into machine learning architectures serves as a powerful inductive bias. Training such architectures with stochastic gradient descent requires care, as degenerate derivatives of the embedded optimization problem often render the gradients uninformative. We propose Lagrangian Proximal Gradient Descent (LPGD) a flexible framework for training a… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: ICML 2024 conference paper

  2. arXiv:2405.18917  [pdf, other

    cs.LG cs.AI cs.RO

    Causal Action Influence Aware Counterfactual Data Augmentation

    Authors: Núria Armengol Urpí, Marco Bagatella, Marin Vlastelica, Georg Martius

    Abstract: Offline data are both valuable and practical resources for teaching robots complex behaviors. Ideally, learning agents should not be constrained by the scarcity of available demonstrations, but rather generalize beyond the training distribution. However, the complexity of real-world scenarios typically requires huge amounts of data to prevent neural network policies from picking up on spurious cor… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted in 41st International Conference on Machine Learning (ICML 2024)

  3. arXiv:2404.11735  [pdf, other

    cs.LG cs.CV cs.RO

    Learning with 3D rotations, a hitchhiker's guide to SO(3)

    Authors: A. René Geist, Jonas Frey, Mikel Zobro, Anna Levina, Georg Martius

    Abstract: Many settings in machine learning require the selection of a rotation representation. However, choosing a suitable representation from the many available options is challenging. This paper acts as a survey and guide through rotation representations. We walk through their properties that harm or benefit deep learning with gradient-based optimization. By consolidating insights from rotation-based le… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Published at ICML 2024

  4. arXiv:2404.07110  [pdf, other

    cs.RO cs.CV cs.LG

    Wild Visual Navigation: Fast Traversability Learning via Pre-Trained Models and Online Self-Supervision

    Authors: Matías Mattamala, Jonas Frey, Piotr Libera, Nived Chebrolu, Georg Martius, Cesar Cadena, Marco Hutter, Maurice Fallon

    Abstract: Natural environments such as forests and grasslands are challenging for robotic navigation because of the false perception of rigid obstacles from high grass, twigs, or bushes. In this work, we present Wild Visual Navigation (WVN), an online self-supervised learning system for visual traversability estimation. The system is able to continuously adapt from a short human demonstration in the field,… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Extended version of arXiv:2305.08510

  5. arXiv:2402.05371  [pdf, other

    cs.RO

    Learning to Control Emulated Muscles in Real Robots: Towards Exploiting Bio-Inspired Actuator Morphology

    Authors: Pierre Schumacher, Lorenz Krause, Jan Schneider, Dieter Büchler, Georg Martius, Daniel Haeufle

    Abstract: Recent studies have demonstrated the immense potential of exploiting muscle actuator morphology for natural and robust movement -- in simulation. A validation on real robotic hardware is yet missing. In this study, we emulate muscle actuator properties on hardware in real-time, taking advantage of modern and affordable electric motors. We demonstrate that our setup can emulate a simplified muscle… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2402.03913  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    Machine learning stochastic differential equations for the evolution of order parameters of classical many-body systems in and out of equilibrium

    Authors: Francesco Carnazza, Federico Carollo, Sabine Andergassen, Georg Martius, Miriam Klopotek, Igor Lesanovsky

    Abstract: We develop a machine learning algorithm to infer the emergent stochastic equation governing the evolution of an order parameter of a many-body system. We train our neural network to independently learn the directed force acting on the order parameter as well as an effective diffusive noise. We illustrate our approach using the classical Ising model endowed with Glauber dynamics, and the contact pr… ▽ More

    Submitted 4 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 11 pages, 6 figure, 1 table

  7. Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

    Authors: Jakob Hollenstein, Georg Martius, Justus Piater

    Abstract: Proximal Policy Optimization (PPO), a popular on-policy deep reinforcement learning method, employs a stochastic policy for exploration. In this paper, we propose a colored noise-based stochastic policy variant of PPO. Previous research highlighted the importance of temporal correlation in action noise for effective exploration in off-policy reinforcement learning. Building on this, we investigate… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Journal ref: (2024) Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12466-12472

  8. arXiv:2312.01473  [pdf, other

    cs.LG

    Regularity as Intrinsic Reward for Free Play

    Authors: Cansu Sancaktar, Justus Piater, Georg Martius

    Abstract: We propose regularity as a novel reward signal for intrinsically-motivated reinforcement learning. Taking inspiration from child development, we postulate that striving for structure and order helps guide exploration towards a subspace of tasks that are not favored by naive uncertainty-based intrinsic rewards. Our generalized formulation of Regularity as Intrinsic Reward (RaIR) allows us to operat… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 camera-ready version. Project webpage at http://sites.google.com/view/rair-project

  9. arXiv:2311.16996  [pdf, other

    cs.LG cs.AI

    Goal-conditioned Offline Planning from Curious Exploration

    Authors: Marco Bagatella, Georg Martius

    Abstract: Curiosity has established itself as a powerful exploration strategy in deep reinforcement learning. Notably, leveraging expected future novelty as intrinsic motivation has been shown to efficiently generate exploratory trajectories, as well as a robust dynamics model. We consider the challenge of extracting goal-conditioned behavior from the products of such unsupervised exploration techniques, wi… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  10. arXiv:2311.04358  [pdf, other

    cond-mat.stat-mech cond-mat.soft

    Machine learning of a density functional for anisotropic patchy particles

    Authors: Alessandro Simon, Jens Weimar, Georg Martius, Martin Oettel

    Abstract: Anisotropic patchy particles have become an archetypical statistical model system for associating fluids. Here we formulate an approach to the Kern-Frenkel model via classical density functional theory to describe the positionally and orientationally resolved equilibrium density distributions in flat wall geometries. The density functional is split into a reference part for the orientationally ave… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  11. arXiv:2311.04056  [pdf, other

    cs.LG cs.AI

    Multi-View Causal Representation Learning with Partial Observability

    Authors: Dingling Yao, Danru Xu, Sébastien Lachapelle, Sara Magliacane, Perouz Taslakian, Georg Martius, Julius von Kügelgen, Francesco Locatello

    Abstract: We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of v… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 28 pages, 10 figures, 11 tables

  12. arXiv:2310.02440  [pdf, other

    cs.RO cs.AI

    Learning Diverse Skills for Local Navigation under Multi-constraint Optimality

    Authors: Jin Cheng, Marin Vlastelica, Pavel Kolev, Chenhao Li, Georg Martius

    Abstract: Despite many successful applications of data-driven control in robotics, extracting meaningful diverse behaviors remains a challenge. Typically, task performance needs to be compromised in order to achieve diversity. In many scenarios, task requirements are specified as a multitude of reward terms, each requiring a different trade-off. In this work, we take a constrained optimization viewpoint on… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 7 pages, 6 figures, in submission to ICRA 2024

  13. arXiv:2309.12927  [pdf, other

    cs.NE q-bio.NC

    Emergent mechanisms for long timescales depend on training curriculum and affect performance in memory tasks

    Authors: Sina Khajehabdollahi, Roxana Zeraati, Emmanouil Giannakakis, Tim Jakob Schäfer, Georg Martius, Anna Levina

    Abstract: Recurrent neural networks (RNNs) in the brain and in silico excel at solving tasks with intricate temporal dependencies. Long timescales required for solving such tasks can arise from properties of individual neurons (single-neuron timescale, $τ$, e.g., membrane time constant in biological neurons) or recurrent interactions among them (network-mediated timescale). However, the contribution of each… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Journal ref: The Twelfth International Conference on Learning Representations (2024)

  14. arXiv:2309.05582  [pdf, other

    cs.LG cs.AI cs.RO

    Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning

    Authors: Marin Vlastelica, Sebastian Blaes, Cristina Pineri, Georg Martius

    Abstract: We introduce a simple but effective method for managing risk in model-based reinforcement learning with trajectory sampling that involves probabilistic safety constraints and balancing of optimism in the face of epistemic uncertainty and pessimism in the face of aleatoric uncertainty of an ensemble of stochastic neural networks.Various experiments indicate that the separation of uncertainties is e… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  15. arXiv:2309.02976  [pdf, other

    cs.RO cs.LG

    Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal Models

    Authors: Pierre Schumacher, Thomas Geijtenbeek, Vittorio Caggiano, Vikash Kumar, Syn Schmitt, Georg Martius, Daniel F. B. Haeufle

    Abstract: Humans excel at robust bipedal walking in complex natural environments. In each step, they adequately tune the interaction of biomechanical muscle dynamics and neuronal signals to be robust against uncertainties in ground conditions. However, it is still not fully understood how the nervous system resolves the musculoskeletal redundancy to solve the multi-objective control problem considering stab… ▽ More

    Submitted 7 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  16. arXiv:2308.07741  [pdf, other

    cs.RO cs.LG

    Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World

    Authors: Nico Gürtler, Felix Widmaier, Cansu Sancaktar, Sebastian Blaes, Pavel Kolev, Stefan Bauer, Manuel Wüthrich, Markus Wulfmeier, Martin Riedmiller, Arthur Allshire, Qiang Wang, Robert McCarthy, Hangyeol Kim, Jongchan Baek, Wookyong Kwon, Shanliang Qian, Yasunori Toshimitsu, Mike Yan Michelis, Amirhossein Kazemipour, Arman Raayatsanati, Hehui Zheng, Barnabas Gavin Cangan, Bernhard Schölkopf, Georg Martius

    Abstract: Experimentation on real robots is demanding in terms of time and costs. For this reason, a large part of the reinforcement learning (RL) community uses simulators to develop and benchmark algorithms. However, insights gained in simulation do not necessarily translate to real robots, in particular for tasks involving complex interactions with the environment. The Real Robot Challenge 2022 therefore… ▽ More

    Submitted 24 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Typo in author list fixed

  17. arXiv:2307.15690  [pdf, other

    cs.LG cs.RO

    Benchmarking Offline Reinforcement Learning on Real-Robot Hardware

    Authors: Nico Gürtler, Sebastian Blaes, Pavel Kolev, Felix Widmaier, Manuel Wüthrich, Stefan Bauer, Bernhard Schölkopf, Georg Martius

    Abstract: Learning policies from previously recorded data is a promising direction for real-world robotics tasks, as online learning is often infeasible. Dexterous manipulation in particular remains an open problem in its general form. The combination of offline reinforcement learning with large diverse datasets, however, has the potential to lead to a breakthrough in this challenging domain analogously to… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: The Eleventh International Conference on Learning Representations. 2022. Published at ICLR 2023. Datasets available at https://github.com/rr-learning/trifinger_rl_datasets

  18. arXiv:2307.11373  [pdf, other

    cs.LG cs.AI cs.RO

    Offline Diversity Maximization Under Imitation Constraints

    Authors: Marin Vlastelica, Jin Cheng, Georg Martius, Pavel Kolev

    Abstract: There has been significant recent progress in the area of unsupervised skill discovery, utilizing various information-theoretic objectives as measures of diversity. Despite these advances, challenges remain: current methods require significant online interaction, fail to leverage vast amounts of available task-agnostic data and typically lack a quantitative measure of skill utility. We address the… ▽ More

    Submitted 21 June, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: RLC 2024

  19. arXiv:2306.16922  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks

    Authors: Aaron Spieler, Nasim Rahaman, Georg Martius, Bernhard Schölkopf, Anna Levina

    Abstract: Biological cortical neurons are remarkably sophisticated computational devices, temporally integrating their vast synaptic input over an intricate dendritic tree, subject to complex, nonlinearly interacting internal biological processes. A recent study proposed to characterize this complexity by fitting accurate surrogate models to replicate the input-output relationship of a detailed biophysical… ▽ More

    Submitted 17 March, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 25 pages, 14 figures, 13 tables, additional experiments and clarifications, accepted to ICLR 2024

  20. arXiv:2306.07067  [pdf, other

    cs.NE nlin.AO nlin.CG

    Locally adaptive cellular automata for goal-oriented self-organization

    Authors: Sina Khajehabdollahi, Emmanouil Giannakakis, Victor Buendia, Georg Martius, Anna Levina

    Abstract: The essential ingredient for studying the phenomena of emergence is the ability to generate and manipulate emergent systems that span large scales. Cellular automata are the model class particularly known for their effective scalability but are also typically constrained by fixed local rules. In this paper, we propose a new model class of adaptive cellular automata that allows for the generation o… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  21. arXiv:2306.04829  [pdf, other

    cs.CV cs.LG

    Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities

    Authors: Andrii Zadaianchuk, Maximilian Seitzer, Georg Martius

    Abstract: Unsupervised video-based object-centric learning is a promising avenue to learn structured representations from large, unlabeled video collections, but previous approaches have only managed to scale to real-world datasets in restricted domains. Recently, it was shown that the reconstruction of pre-trained self-supervised features leads to object-centric representations on unconstrained real-world… ▽ More

    Submitted 8 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. Website and code available at https://martius-lab.github.io/videosaur

  22. arXiv:2306.03935  [pdf, other

    quant-ph cond-mat.dis-nn cond-mat.quant-gas cond-mat.stat-mech

    Inferring interpretable dynamical generators of local quantum observables from projective measurements through machine learning

    Authors: Giovanni Cemin, Francesco Carnazza, Sabine Andergassen, Georg Martius, Federico Carollo, Igor Lesanovsky

    Abstract: To characterize the dynamical behavior of many-body quantum systems, one is usually interested in the evolution of so-called order-parameters rather than in characterizing the full quantum state. In many situations, these quantities coincide with the expectation value of local observables, such as the magnetization or the particle density. In experiment, however, these expectation values can only… ▽ More

    Submitted 20 February, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 7+4 pages, 3+5 figures

  23. arXiv:2306.03655  [pdf, other

    cs.LG math.OC

    Online Learning under Adversarial Nonlinear Constraints

    Authors: Pavel Kolev, Georg Martius, Michael Muehlebach

    Abstract: In many applications, learning systems are required to process continuous non-stationary data streams. We study this problem in an online learning framework and propose an algorithm that can deal with adversarial time-varying and nonlinear constraints. As we show in our work, the algorithm called Constraint Violation Velocity Projection (CVV-Pro) achieves $\sqrt{T}$ regret and converges to the fea… ▽ More

    Submitted 13 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  24. arXiv:2305.13341  [pdf, other

    physics.data-an cs.AI cs.LG stat.ME

    Discovering Causal Relations and Equations from Data

    Authors: Gustau Camps-Valls, Andreas Gerhardus, Urmi Ninad, Gherardo Varando, Georg Martius, Emili Balaguer-Ballester, Ricardo Vinuesa, Emiliano Diaz, Laure Zanna, Jakob Runge

    Abstract: Physics is a field of science that has traditionally used the scientific method to answer questions about why natural phenomena occur and to make testable models that explain the phenomena. Discovering equations, laws and principles that are invariant, robust and causal explanations of the world has been fundamental in physical sciences throughout the centuries. Discoveries emerge from observing t… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 137 pages

  25. arXiv:2304.10990  [pdf, other

    cs.RO

    Minsight: A Fingertip-Sized Vision-Based Tactile Sensor for Robotic Manipulation

    Authors: Iris Andrussow, Huanbo Sun, Katherine J. Kuchenbecker, Georg Martius

    Abstract: Intelligent interaction with the physical world requires perceptual abilities beyond vision and hearing; vibrant tactile sensing is essential for autonomous robots to dexterously manipulate unfamiliar objects or safely contact humans. Therefore, robotic manipulators need high-resolution touch sensors that are compact, robust, inexpensive, and efficient. The soft vision-based haptic sensor presente… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  26. arXiv:2304.04664  [pdf, other

    physics.ao-ph cs.LG

    Inductive biases in deep learning models for weather prediction

    Authors: Jannik Thuemmel, Matthias Karlbauer, Sebastian Otte, Christiane Zarfl, Georg Martius, Nicole Ludwig, Thomas Scholten, Ulrich Friedrich, Volker Wulfmeyer, Bedartha Goswami, Martin V. Butz

    Abstract: Deep learning has gained immense popularity in the Earth sciences as it enables us to formulate purely data-driven models of complex Earth system processes. Deep learning-based weather prediction (DLWP) models have made significant progress in the last few years, achieving forecast skills comparable to established numerical weather prediction models with comparatively lesser computational costs. I… ▽ More

    Submitted 30 April, 2024; v1 submitted 6 April, 2023; originally announced April 2023.

  27. When to be critical? Performance and evolvability in different regimes of neural Ising agents

    Authors: Sina Khajehabdollahi, Jan Prosi, Emmanouil Giannakakis, Georg Martius, Anna Levina

    Abstract: It has long been hypothesized that operating close to the critical state is beneficial for natural, artificial and their evolutionary systems. We put this hypothesis to test in a system of evolving foraging agents controlled by neural networks that can adapt agents' dynamical regime throughout evolution. Surprisingly, we find that all populations that discover solutions, evolve to be subcritical.… ▽ More

    Submitted 24 November, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.12184

    Journal ref: Artificial Life (2022) 28 (4): 458-478

  28. arXiv:2303.09628  [pdf, other

    cs.LG cs.RO

    Efficient Learning of High Level Plans from Play

    Authors: Núria Armengol Urpí, Marco Bagatella, Otmar Hilliges, Georg Martius, Stelian Coros

    Abstract: Real-world robotic manipulation tasks remain an elusive challenge, since they involve both fine-grained environment interaction, as well as the ability to plan for long-horizon goals. Although deep reinforcement learning (RL) methods have shown encouraging results when planning end-to-end in high-dimensional environments, they remain fundamentally limited by poor sample efficiency due to inefficie… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted to the International Conference on Robotics and Automation 2023

  29. arXiv:2209.07899  [pdf, other

    cs.RO cs.AI cs.LG

    Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

    Authors: Chenhao Li, Sebastian Blaes, Pavel Kolev, Marin Vlastelica, Jonas Frey, Georg Martius

    Abstract: Learning diverse skills is one of the main challenges in robotics. To this end, imitation learning approaches have achieved impressive results. These methods require explicitly labeled datasets or assume consistent skill execution to enable learning and active control of individual behaviors, which limits their applicability. In this work, we propose a cooperative adversarial method for obtaining… ▽ More

    Submitted 11 February, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

  30. arXiv:2207.03952  [pdf, other

    cs.RO cs.LG

    Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks

    Authors: Isabell Wochner, Pierre Schumacher, Georg Martius, Dieter Büchler, Syn Schmitt, Daniel F. B. Haeufle

    Abstract: Humans are able to outperform robots in terms of robustness, versatility, and learning of new tasks in a wide variety of movements. We hypothesize that highly nonlinear muscle dynamics play a large role in providing inherent stability, which is favorable to learning. While recent advances have been made in applying modern learning techniques to muscle-actuated systems both in simulation as well as… ▽ More

    Submitted 16 January, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

  31. arXiv:2206.11693  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

    Authors: Chenhao Li, Marin Vlastelica, Sebastian Blaes, Jonas Frey, Felix Grimminger, Georg Martius

    Abstract: Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method fo… ▽ More

    Submitted 21 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  32. arXiv:2206.11403  [pdf, other

    cs.LG cs.AI cs.RO

    Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

    Authors: Cansu Sancaktar, Sebastian Blaes, Georg Martius

    Abstract: It has been a long-standing dream to design artificial agents that explore their environment efficiently via intrinsic motivation, similar to how children perform curious free play. Despite recent advances in intrinsically motivated reinforcement learning (RL), sample-efficient exploration in object manipulation scenarios remains a significant challenge as most of the relevant information lies in… ▽ More

    Submitted 26 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version

  33. arXiv:2206.02416  [pdf, other

    stat.ML cs.AI cs.LG

    Embrace the Gap: VAEs Perform Independent Mechanism Analysis

    Authors: Patrik Reizinger, Luigi Gresele, Jack Brady, Julius von Kügelgen, Dominik Zietlow, Bernhard Schölkopf, Georg Martius, Wieland Brendel, Michel Besserve

    Abstract: Variational autoencoders (VAEs) are a popular framework for modeling complex data distributions; they can be efficiently trained via variational inference by maximizing the evidence lower bound (ELBO), at the expense of a gap to the exact (log-)marginal likelihood. While VAEs are commonly used for representation learning, it is unclear why ELBO maximization would yield useful representations, sinc… ▽ More

    Submitted 27 January, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS2022 final version

  34. arXiv:2206.02042  [pdf, other

    cs.LG cs.AI

    Developing hierarchical anticipations via neural network-based event segmentation

    Authors: Christian Gumbsch, Maurits Adam, Birgit Elsner, Georg Martius, Martin V. Butz

    Abstract: Humans can make predictions on various time scales and hierarchical levels. Thereby, the learning of event encodings seems to play a crucial role. In this work we model the development of hierarchical predictions via autonomously learned latent event codes. We present a hierarchical recurrent neural network architecture, whose inductive learning biases foster the development of sparsely changing l… ▽ More

    Submitted 28 August, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: accepted at ICDL 2022

  35. arXiv:2206.00484  [pdf, other

    cs.RO cs.LG

    DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems

    Authors: Pierre Schumacher, Daniel Häufle, Dieter Büchler, Syn Schmitt, Georg Martius

    Abstract: Muscle-actuated organisms are capable of learning an unparalleled diversity of dexterous movements despite their vast amount of muscles. Reinforcement learning (RL) on large musculoskeletal models, however, has not been able to show similar performance. We conjecture that ineffective exploration in large overactuated action spaces is a key problem. This is supported by the finding that common expl… ▽ More

    Submitted 27 April, 2023; v1 submitted 30 May, 2022; originally announced June 2022.

  36. arXiv:2205.15213  [pdf, other

    cs.LG

    Backpropagation through Combinatorial Algorithms: Identity with Projection Works

    Authors: Subham Sekhar Sahoo, Anselm Paulus, Marin Vlastelica, Vít Musil, Volodymyr Kuleshov, Georg Martius

    Abstract: Embedding discrete solvers as differentiable layers has given modern deep learning architectures combinatorial expressivity and discrete reasoning capabilities. The derivative of these solvers is zero or undefined, therefore a meaningful replacement is crucial for effective gradient-based learning. Prior works rely on smoothing the solver with input perturbations, relaxing the solver to continuous… ▽ More

    Submitted 17 March, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICLR 2023 conference paper. The first two authors contributed equally

  37. arXiv:2203.09168  [pdf, other

    cs.LG stat.ML

    On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks

    Authors: Maximilian Seitzer, Arash Tavakoli, Dimitrije Antic, Georg Martius

    Abstract: Capturing aleatoric uncertainty is a critical part of many machine learning systems. In deep learning, a common approach to this end is to train a neural network to estimate the parameters of a heteroscedastic Gaussian distribution by maximizing the logarithm of the likelihood function under the observed data. In this work, we examine this approach and identify potential hazards associated with th… ▽ More

    Submitted 1 April, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 camera-ready version. Code available at http://github.com/martius-lab/beta-nll

  38. arXiv:2201.11599  [pdf, other

    quant-ph cond-mat.quant-gas

    Inferring Markovian quantum master equations of few-body observables in interacting spin chains

    Authors: Francesco Carnazza, Federico Carollo, Dominik Zietlow, Sabine Andergassen, Georg Martius, Igor Lesanovsky

    Abstract: Full information about a many-body quantum system is usually out-of-reach due to the exponential growth -- with the size of the system -- of the number of parameters needed to encode its state. Nonetheless, in order to understand the complex phenomenology that can be observed in these systems, it is often sufficient to consider dynamical or stationary properties of local observables or, at most, o… ▽ More

    Submitted 25 July, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 24 pages, 4 figures

    Journal ref: New J. Phys. 24 073033 (2022)

  39. arXiv:2112.03100  [pdf, other

    cs.LG

    Hierarchical Reinforcement Learning with Timed Subgoals

    Authors: Nico Gürtler, Dieter Büchler, Georg Martius

    Abstract: Hierarchical reinforcement learning (HRL) holds great potential for sample-efficient learning on challenging long-horizon tasks. In particular, letting a higher level assign subgoals to a lower level has been shown to enable fast learning on difficult problems. However, such subgoal-based methods have been designed with static reinforcement learning environments in mind and consequently struggle w… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Published at NeurIPS 2021. Code available at https://github.com/martius-lab/HiTS

  40. arXiv:2111.05934  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    A soft thumb-sized vision-based sensor with accurate all-round force perception

    Authors: Huanbo Sun, Katherine J. Kuchenbecker, Georg Martius

    Abstract: Vision-based haptic sensors have emerged as a promising approach to robotic touch due to affordable high-resolution cameras and successful computer-vision techniques. However, their physical design and the information they provide do not yet meet the requirements of real applications. We present a robust, soft, low-cost, vision-based, thumb-sized 3D haptic sensor named Insight: it continually prov… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 1 table, 5 figures, 24 pages for the main manuscript. 5 tables, 12 figures, 27 pages for the supplementary material. 8 supplementary videos

  41. arXiv:2110.15949  [pdf, other

    cs.LG cs.AI

    Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

    Authors: Christian Gumbsch, Martin V. Butz, Georg Martius

    Abstract: A common approach to prediction and planning in partially observable domains is to use recurrent neural networks (RNNs), which ideally develop and maintain a latent memory about hidden, task-relevant factors. We hypothesize that many of these hidden factors in the physical world are constant over time, changing only sparsely. To study this hypothesis, we propose Gated $L_0$ Regularized Dynamics (G… ▽ More

    Submitted 13 January, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021

  42. arXiv:2110.06149  [pdf, other

    cs.LG cs.AI

    Planning from Pixels in Environments with Combinatorially Hard Search Spaces

    Authors: Marco Bagatella, Mirek Olšák, Michal Rolínek, Georg Martius

    Abstract: The ability to form complex plans based on raw visual input is a litmus test for current capabilities of artificial intelligence, as it requires a seamless combination of visual processing and abstract algorithmic execution, two traditionally separate areas of computer science. A recent surge of interest in this field brought advances that yield good performance in tasks ranging from arcade games… ▽ More

    Submitted 18 March, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  43. arXiv:2109.04150  [pdf, other

    cs.LG cs.RO

    Self-supervised Reinforcement Learning with Independently Controllable Subgoals

    Authors: Andrii Zadaianchuk, Georg Martius, Fanny Yang

    Abstract: To successfully tackle challenging manipulation tasks, autonomous agents must learn a diverse set of skills and how to combine them. Recently, self-supervised agents that set their own abstract goals by exploiting the discovered structure in the environment were shown to perform well on many different tasks. In particular, some of them were applied to learn basic manipulation skills in composition… ▽ More

    Submitted 30 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  44. arXiv:2106.03443  [pdf, other

    cs.LG

    Causal Influence Detection for Improving Efficiency in Reinforcement Learning

    Authors: Maximilian Seitzer, Bernhard Schölkopf, Georg Martius

    Abstract: Many reinforcement learning (RL) environments consist of independent entities that interact sparsely. In such environments, RL agents have only limited influence over other entities in any particular situation. Our idea in this work is that learning can be efficiently guided by knowing when and what the agent can influence with its actions. To achieve this, we introduce a measure of \emph{situatio… ▽ More

    Submitted 2 December, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera-ready version. Code available at http://github.com/martius-lab/cid-in-rl

  45. arXiv:2105.11914  [pdf, other

    cs.RO eess.SP

    Theory and Design of Super-resolution Haptic Skins

    Authors: Huanbo Sun, Georg Martius

    Abstract: Haptic feedback is important to make robots more dexterous and effective in unstructured environments. High-resolution haptic sensors are still not widely available, and their application is often bound by the resolution-robustness dilemma. A route towards high-resolution and robust skin embeds a few sensor units (taxels) into a flexible surface material and uses signal processing to achieve sensi… ▽ More

    Submitted 24 August, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  46. Assessing aesthetics of generated abstract images using correlation structure

    Authors: Sina Khajehabdollahi, Georg Martius, Anna Levina

    Abstract: Can we generate abstract aesthetic images without bias from natural or human selected image corpi? Are aesthetic images singled out in their correlation functions? In this paper we give answers to these and more questions. We generate images using compositional pattern-producing networks with random weights and varying architecture. We demonstrate that even with the randomly selected weights the c… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Journal ref: 2019 IEEE Symposium Series on Computational Intelligence (SSCI), 306-313

  47. arXiv:2105.06331  [pdf, other

    cs.LG

    Informed Equation Learning

    Authors: Matthias Werner, Andrej Junginger, Philipp Hennig, Georg Martius

    Abstract: Distilling data into compact and interpretable analytic equations is one of the goals of science. Instead, contemporary supervised machine learning methods mostly produce unstructured and dense maps from input to output. Particularly in deep learning, this property is owed to the generic nature of simple standard link functions. To learn equations rather than maps, standard non-linearities can be… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  48. arXiv:2105.02343  [pdf, other

    cs.LG

    CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints

    Authors: Anselm Paulus, Michal Rolínek, Vít Musil, Brandon Amos, Georg Martius

    Abstract: Bridging logical and algorithmic reasoning with modern machine learning techniques is a fundamental challenge with potentially transformative impact. On the algorithmic side, many NP-hard problems can be expressed as integer programs, in which the constraints play the role of their "combinatorial specification." In this work, we aim to integrate integer programming solvers into neural network arch… ▽ More

    Submitted 11 April, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: ICML 2021 conference paper

  49. arXiv:2103.12184  [pdf, other

    cs.NE cond-mat.dis-nn cs.MA nlin.AO q-bio.PE

    The dynamical regime and its importance for evolvability, task performance and generalization

    Authors: Jan Prosi, Sina Khajehabdollahi, Emmanouil Giannakakis, Georg Martius, Anna Levina

    Abstract: It has long been hypothesized that operating close to the critical state is beneficial for natural and artificial systems. We test this hypothesis by evolving foraging agents controlled by neural networks that can change the system's dynamical regime throughout evolution. Surprisingly, we find that all populations, regardless of their initial regime, evolve to be subcritical in simple tasks and ev… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 8 Pages, 7 Figures, Artificial Life Conference 2021

  50. arXiv:2102.07456  [pdf, other

    cs.LG cs.AI cs.DM

    Neuro-algorithmic Policies enable Fast Combinatorial Generalization

    Authors: Marin Vlastelica, Michal Rolínek, Georg Martius

    Abstract: Although model-based and model-free approaches to learning the control of systems have achieved impressive results on standard benchmarks, generalization to task variations is still lacking. Recent results suggest that generalization for standard architectures improves only after obtaining exhaustive amounts of data. We give evidence that generalization capabilities are in many cases bottlenecked… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 15 pages