Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–8 of 8 results for author: Konen, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2204.13307  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    AlphaZero-Inspired Game Learning: Faster Training by Using MCTS Only at Test Time

    Authors: Johannes Scheiermann, Wolfgang Konen

    Abstract: Recently, the seminal algorithms AlphaGo and AlphaZero have started a new era in game learning and deep reinforcement learning. While the achievements of AlphaGo and AlphaZero - playing Go and other complex games at super human level - are truly impressive, these architectures have the drawback that they require high computational resources. Many researchers are looking for methods that are simila… ▽ More

    Submitted 24 September, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: 11 pages, 10 figures

  2. arXiv:2111.14375  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Final Adaptation Reinforcement Learning for N-Player Games

    Authors: Wolfgang Konen, Samineh Bagheri

    Abstract: This paper covers n-tuple-based reinforcement learning (RL) algorithms for games. We present new algorithms for TD-, SARSA- and Q-learning which work seamlessly on various games with arbitrary number of players. This is achieved by taking a player-centered view where each player propagates his/her rewards back to previous rounds. We add a new element called Final Adaptation RL (FARL) to all these… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 23 pages

  3. arXiv:1907.06508  [pdf, other

    cs.AI cs.LG stat.ML

    General Board Game Playing for Education and Research in Generic AI Game Learning

    Authors: Wolfgang Konen

    Abstract: We present a new general board game (GBG) playing and learning framework. GBG defines the common interfaces for board games, game states and their AI agents. It allows one to run competitions of different agents on different games. It standardizes those parts of board game playing and learning that otherwise would be tedious and repetitive parts in coding. GBG is suitable for arbitrary 1-, 2-, ...… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 8 pages, for: Conference on Games (CoG), London, 2019. Index Terms: game learning, general game playing, AI, temporal difference learning, board games, n-tuple systems

  4. arXiv:1904.08397  [pdf, other

    stat.ML cs.LG math.OC

    SACOBRA with Online Whitening for Solving Optimization Problems with High Conditioning

    Authors: Samineh Bagheri, Wolfgang Konen, Thomas Bäck

    Abstract: Real-world optimization problems often have expensive objective functions in terms of cost and time. It is desirable to find near-optimal solutions with very few function evaluations. Surrogate-assisted optimizers tend to reduce the required number of function evaluations by replacing the real function with an efficient mathematical model built on few evaluated points. Problems with a high conditi… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: 20 pages, 10 figures

  5. arXiv:1512.09251  [pdf, other

    math.OC cs.NE stat.ML

    Solving the G-problems in less than 500 iterations: Improved efficient constrained optimization by surrogate modeling and adaptive parameter control

    Authors: Samineh Bagheri, Wolfgang Konen, Michael Emmerich, Thomas Bäck

    Abstract: Constrained optimization of high-dimensional numerical problems plays an important role in many scientific and industrial applications. Function evaluations in many industrial applications are severely limited and no analytical information about objective function and constraint functions is available. For such expensive black-box optimization tasks, the constraint optimization algorithm COBRA was… ▽ More

    Submitted 31 December, 2015; originally announced December 2015.

  6. arXiv:1105.1951  [pdf, other

    nlin.AO cs.LG stat.ML

    Self-configuration from a Machine-Learning Perspective

    Authors: Wolfgang Konen

    Abstract: The goal of machine learning is to provide solutions which are trained by data or by experience coming from the environment. Many training algorithms exist and some brilliant successes were achieved. But even in structured environments for machine learning (e.g. data mining or board games), most applications beyond the level of toy problems need careful hand-tuning or human ingenuity (i.e. detecti… ▽ More

    Submitted 5 September, 2011; v1 submitted 10 May, 2011; originally announced May 2011.

    Comments: 12 pages, 5 figures, Dagstuhl seminar 11181 "Organic Computing - Design of Self-Organizing Systems", May 2011

    Report number: DPA-11181

  7. arXiv:0912.1064  [pdf, other

    stat.ML stat.ME

    On the numeric stability of the SFA implementation sfa-tk

    Authors: Wolfgang Konen

    Abstract: Slow feature analysis (SFA) is a method for extracting slowly varying features from a quickly varying multidimensional signal. An open source Matlab-implementation sfa-tk makes SFA easily useable. We show here that under certain circumstances, namely when the covariance matrix of the nonlinearly expanded data does not have full rank, this implementation runs into numerical instabilities. We prop… ▽ More

    Submitted 5 December, 2009; originally announced December 2009.

    Comments: 12 pages

  8. arXiv:0911.4397  [pdf, other

    stat.ML

    How slow is slow? SFA detects signals that are slower than the driving force

    Authors: Wolfgang Konen, Patrick Koch

    Abstract: Slow feature analysis (SFA) is a method for extracting slowly varying driving forces from quickly varying nonstationary time series. We show here that it is possible for SFA to detect a component which is even slower than the driving force itself (e.g. the envelope of a modulated sine wave). It is shown that it depends on circumstances like the embedding dimension, the time series predictability… ▽ More

    Submitted 23 November, 2009; originally announced November 2009.