Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Babadi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.00249  [pdf, other

    cs.RO

    Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles

    Authors: Yuhang Yang, Kalle Kujanpaa, Amin Babadi, Joni Pajarinen, Alexander Ilin

    Abstract: Developing reliable autonomous driving algorithms poses challenges in testing, particularly when it comes to safety-critical traffic scenarios involving pedestrians. An open question is how to simulate rare events, not necessarily found in autonomous driving datasets or scripted simulations, but which can occur in testing, and, in the end may lead to severe pedestrian related accidents. This paper… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 6 pages; 5 figures; 2 tables

  2. arXiv:2210.01426  [pdf, other

    cs.AI cs.LG cs.RO

    Continuous Monte Carlo Graph Search

    Authors: Kalle Kujanpää, Amin Babadi, Yi Zhao, Juho Kannala, Alexander Ilin, Joni Pajarinen

    Abstract: Online planning is crucial for high performance in many complex sequential decision-making tasks. Monte Carlo Tree Search (MCTS) employs a principled mechanism for trading off exploration for exploitation for efficient online planning, and it outperforms comparison methods in many discrete decision-making domains such as Go, Chess, and Shogi. Subsequently, extensions of MCTS to continuous domains… ▽ More

    Submitted 7 February, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted at AAMAS 2024 (full paper & oral)

  3. arXiv:2009.10337  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Learning Task-Agnostic Action Spaces for Movement Optimization

    Authors: Amin Babadi, Michiel van de Panne, C. Karen Liu, Perttu Hämäläinen

    Abstract: We propose a novel method for exploring the dynamics of physically based animated characters, and learning a task-agnostic action space that makes movement optimization easier. Like several previous papers, we parameterize actions as target states, and learn a short-horizon goal-conditioned low-level control policy that drives the agent's state towards the targets. Our novel contribution is that w… ▽ More

    Submitted 23 July, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: Accepted as a regular paper by IEEE Transactions on Visualization and Computer Graphics (TVCG) in July 2021

  4. arXiv:1909.07869  [pdf, other

    cs.LG stat.ML

    Visualizing Movement Control Optimization Landscapes

    Authors: Perttu Hämäläinen, Juuso Toikka, Amin Babadi, C. Karen Liu

    Abstract: A large body of animation research focuses on optimization of movement control, either as action sequences or policy parameters. However, as closed-form expressions of the objective functions are often not available, our understanding of the optimization problems is limited. Building on recent work on analyzing neural network training, we contribute novel visualizations of high-dimensional control… ▽ More

    Submitted 22 August, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to IEEE Transactions on Visualization and Computer Graphics (IEEE TVCG)

  5. arXiv:1907.11842  [pdf, other

    cs.GR cs.LG cs.RO

    Self-Imitation Learning of Locomotion Movements through Termination Curriculum

    Authors: Amin Babadi, Kourosh Naderi, Perttu Hämäläinen

    Abstract: Animation and machine learning research have shown great advancements in the past decade, leading to robust and powerful methods for learning complex physically-based animations. However, learning can take hours or days, especially if no reference movement data is available. In this paper, we propose and evaluate a novel combination of techniques for accelerating the learning of stable locomotion… ▽ More

    Submitted 20 September, 2019; v1 submitted 27 July, 2019; originally announced July 2019.

    Comments: 2019 ACM SIGGRAPH Conference on Motion, Interaction and Games (MIG 2019)

  6. arXiv:1810.02541  [pdf, other

    cs.LG stat.ML

    PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

    Authors: Perttu Hämäläinen, Amin Babadi, Xiaoxiao Ma, Jaakko Lehtinen

    Abstract: Proximal Policy Optimization (PPO) is a highly popular model-free reinforcement learning (RL) approach. However, we observe that in a continuous action space, PPO can prematurely shrink the exploration variance, which leads to slow progress and may make the algorithm prone to getting stuck in local optima. Drawing inspiration from CMA-ES, a black-box evolutionary optimization method designed for r… ▽ More

    Submitted 3 November, 2020; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: This paper has been accepted to IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2020). The arxiv version also includes an appendix that covers more results

  7. arXiv:1808.06201  [pdf, other

    cs.GR

    Intelligent Middle-Level Game Control

    Authors: Amin Babadi, Kourosh Naderi, Perttu Hämäläinen

    Abstract: We propose the concept of intelligent middle-level game control, which lies on a continuum of control abstraction levels between the following two dual opposites: 1) high-level control that translates player's simple commands into complex actions (such as pressing Space key for jumping), and 2) low-level control which simulates real-life complexities by directly manipulating, e.g., joint rotations… ▽ More

    Submitted 19 August, 2018; originally announced August 2018.

    Comments: 2018 IEEE Conference on Computational Intelligence and Games (IEEE CIG 2018)