Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Hein, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11751  [pdf, other

    cs.LG

    Why long model-based rollouts are no reason for bad Q-value estimates

    Authors: Philipp Wissmann, Daniel Hein, Steffen Udluft, Volker Tresp

    Abstract: This paper explores the use of model-based offline reinforcement learning with long model rollouts. While some literature criticizes this approach due to compounding errors, many practitioners have found success in real-world applications. The paper aims to demonstrate that long rollouts do not necessarily result in exponentially growing errors and can actually produce better Q-value estimates tha… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted at ESANN 2024

  2. arXiv:2407.10856  [pdf, other

    eess.IV cs.CV physics.med-ph

    Physics-Inspired Generative Models in Medical Imaging: A Review

    Authors: Dennis Hein, Afshin Bozorgpour, Dorit Merhof, Ge Wang

    Abstract: Physics-inspired Generative Models (GMs), in particular Diffusion Models (DMs) and Poisson Flow Models (PFMs), enhance Bayesian methods and promise great utility in medical imaging. This review examines the transformative role of such generative methods. First, a variety of physics-inspired GMs, including Denoising Diffusion Probabilistic Models (DDPMs), Score-based Diffusion Models (SDMs), and Po… ▽ More

    Submitted 23 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2404.10017  [pdf, other

    quant-ph cs.AI cs.LG

    Model-based Offline Quantum Reinforcement Learning

    Authors: Simon Eisenmann, Daniel Hein, Steffen Udluft, Thomas A. Runkler

    Abstract: This paper presents the first algorithm for model-based offline quantum reinforcement learning and demonstrates its functionality on the cart-pole benchmark. The model and the policy to be optimized are each implemented as variational quantum circuits. The model is trained by gradient descent to fit a pre-recorded data set. The policy is optimized with a gradient-free optimization scheme using the… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  4. arXiv:2402.08159  [pdf, other

    eess.IV cs.CV

    Poisson flow consistency models for low-dose CT image denoising

    Authors: Dennis Hein, Adam Wang, Ge Wang

    Abstract: Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step s… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2312.09754  [pdf, other

    eess.IV cs.CV physics.med-ph

    PPFM: Image denoising in photon-counting CT using single-step posterior sampling Poisson flow generative models

    Authors: Dennis Hein, Staffan Holmin, Timothy Szczykutowicz, Jonathan S Maltz, Mats Danielsson, Ge Wang, Mats Persson

    Abstract: Diffusion and Poisson flow models have shown impressive performance in a wide range of generative tasks, including low-dose CT image denoising. However, one limitation in general, and for clinical applications in particular, is slow sampling. Due to their iterative nature, the number of function evaluations (NFE) required is usually on the order of $10-10^3$, both for conditional and unconditional… ▽ More

    Submitted 19 December, 2023; v1 submitted 15 December, 2023; originally announced December 2023.

  6. arXiv:2309.02562  [pdf

    cs.CV cs.AI

    Recurrence-Free Survival Prediction for Anal Squamous Cell Carcinoma Chemoradiotherapy using Planning CT-based Radiomics Model

    Authors: Shanshan Tang, Kai Wang, David Hein, Gloria Lin, Nina N. Sanford, Jing Wang

    Abstract: Objectives: Approximately 30% of non-metastatic anal squamous cell carcinoma (ASCC) patients will experience recurrence after chemoradiotherapy (CRT), and currently available clinical variables are poor predictors of treatment response. We aimed to develop a model leveraging information extracted from radiation pretreatment planning CT to predict recurrence-free survival (RFS) in ASCC patients aft… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  7. Learning Control Policies for Variable Objectives from Offline Data

    Authors: Marc Weber, Phillip Swazinna, Daniel Hein, Steffen Udluft, Volkmar Sterzing

    Abstract: Offline reinforcement learning provides a viable approach to obtain advanced control strategies for dynamical systems, in particular when direct interaction with the environment is not available. In this paper, we introduce a conceptual extension for model-based policy search methods, called variable objective policy (VOP). With this approach, policies are trained to generalize efficiently over a… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 7 figures

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence

  8. arXiv:2206.04741  [pdf

    quant-ph cs.LG

    Quantum Policy Iteration via Amplitude Estimation and Grover Search -- Towards Quantum Advantage for Reinforcement Learning

    Authors: Simon Wiedemann, Daniel Hein, Steffen Udluft, Christian Mendl

    Abstract: We present a full implementation and simulation of a novel quantum reinforcement learning method. Our work is a detailed and formal proof of concept for how quantum algorithms can be used to solve reinforcement learning problems and shows that, given access to error-free, efficient quantum realizations of the agent and environment, quantum methods can yield provable improvements over classical Mon… ▽ More

    Submitted 10 May, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

  9. arXiv:2201.05433  [pdf, ps, other

    cs.LG

    Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning

    Authors: Phillip Swazinna, Steffen Udluft, Daniel Hein, Thomas Runkler

    Abstract: Offline reinforcement learning (RL) Algorithms are often designed with environments such as MuJoCo in mind, in which the planning horizon is extremely long and no noise exists. We compare model-free, model-based, as well as hybrid offline RL approaches on various industrial benchmark (IB) datasets to test the algorithms in settings closer to real world problems, including complex noise and partial… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: Submitted to IFAC Conference on Intelligent Control and Automation Sciences (ICONS)2022

  10. arXiv:2108.13381  [pdf, other

    cs.AI cs.LG cs.NE cs.SE eess.SY

    Trustworthy AI for Process Automation on a Chylla-Haase Polymerization Reactor

    Authors: Daniel Hein, Daniel Labisch

    Abstract: In this paper, genetic programming reinforcement learning (GPRL) is utilized to generate human-interpretable control policies for a Chylla-Haase polymerization reactor. Such continuously stirred tank reactors (CSTRs) with jacket cooling are widely used in the chemical industry, in the production of fine chemicals, pigments, polymers, and medical products. Despite appearing rather simple, controlli… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Journal ref: Proceedings of the Genetic and Evolutionary Computation Conference Companion GECCO 21 (2021)

  11. arXiv:2107.05479  [pdf, other

    cs.LG

    Behavior Constraining in Weight Space for Offline Reinforcement Learning

    Authors: Phillip Swazinna, Steffen Udluft, Daniel Hein, Thomas Runkler

    Abstract: In offline reinforcement learning, a policy needs to be learned from a single pre-collected dataset. Typically, policies are thus regularized during training to behave similarly to the data generating policy, by adding a penalty based on a divergence between action distributions of generating and trained policy. We propose a new algorithm, which constrains the policy directly in its weight space i… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted at ESANN 2021

  12. arXiv:2007.09964  [pdf, other

    cs.LG cs.AI cs.RO cs.SC eess.SY

    Interpretable Control by Reinforcement Learning

    Authors: Daniel Hein, Steffen Limmer, Thomas A. Runkler

    Abstract: In this paper, three recently introduced reinforcement learning (RL) methods are used to generate human-interpretable policies for the cart-pole balancing benchmark. The novel RL methods learn human-interpretable policies in the form of compact fuzzy controllers and simple algebraic equations. The representations as well as the achieved control performances are compared with two classical controll… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  13. arXiv:2001.07295  [pdf, other

    cs.AI cs.MM cs.SE

    AutoMATES: Automated Model Assembly from Text, Equations, and Software

    Authors: Adarsh Pyarelal, Marco A. Valenzuela-Escarcega, Rebecca Sharp, Paul D. Hein, Jon Stephens, Pratik Bhandari, HeuiChan Lim, Saumya Debray, Clayton T. Morrison

    Abstract: Models of complicated systems can be represented in different ways - in scientific papers, they are represented using natural language text as well as equations. But to be of real use, they must also be implemented as software, thus making code a third form of representing models. We introduce the AutoMATES project, which aims to build semantically-rich unified representations of models from scien… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

    Comments: 8 pages, 6 figures, accepted to Modeling the World's Systems 2019

    ACM Class: D.3.3; D.3.4; H.1.0; I.2.2; I.2.5; I.2.7; I.6.4; I.6.5

  14. arXiv:1912.06290  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Meta-Learning Initializations for Image Segmentation

    Authors: Sean M. Hendryx, Andrew B. Leach, Paul D. Hein, Clayton T. Morrison

    Abstract: We extend first-order model agnostic meta-learning algorithms (including FOMAML and Reptile) to image segmentation, present a novel neural network architecture built for fast learning which we call EfficientLab, and leverage a formal definition of the test error of meta-learning algorithms to decrease error on out of distribution tasks. We show state of the art results on the FSS-1000 dataset by m… ▽ More

    Submitted 7 May, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  15. arXiv:1812.06199  [pdf, other

    cs.CL cs.LG stat.ML

    Inter-sentence Relation Extraction for Associating Biological Context with Events in Biomedical Texts

    Authors: Enrique Noriega-Atala, Paul D. Hein, Shraddha S. Thumsi, Zechy Wong, Xia Wang, Clayton T. Morrison

    Abstract: We present an analysis of the problem of identifying biological context and associating it with biochemical events in biomedical texts. This constitutes a non-trivial, inter-sentential relation extraction task. We focus on biological context as descriptions of the species, tissue type and cell type that are associated with biochemical events. We describe the properties of an annotated corpus of co… ▽ More

    Submitted 14 December, 2018; originally announced December 2018.

  16. Generating Interpretable Fuzzy Controllers using Particle Swarm Optimization and Genetic Programming

    Authors: Daniel Hein, Steffen Udluft, Thomas A. Runkler

    Abstract: Autonomously training interpretable control strategies, called policies, using pre-existing plant trajectory data is of great interest in industrial applications. Fuzzy controllers have been used in industry for decades as interpretable and efficient system controllers. In this study, we introduce a fuzzy genetic programming (GP) approach called fuzzy GP reinforcement learning (FGPRL) that can sel… ▽ More

    Submitted 29 April, 2018; originally announced April 2018.

    Comments: Accepted at Genetic and Evolutionary Computation Conference 2018 (GECCO '18)

  17. arXiv:1712.04170  [pdf, other

    cs.AI cs.NE eess.SY

    Interpretable Policies for Reinforcement Learning by Genetic Programming

    Authors: Daniel Hein, Steffen Udluft, Thomas A. Runkler

    Abstract: The search for interpretable reinforcement learning policies is of high academic and industrial interest. Especially for industrial systems, domain experts are more likely to deploy autonomously learned controllers if they are understandable and convenient to evaluate. Basic algebraic equations are supposed to meet these requirements, as long as they are restricted to an adequate complexity. Here… ▽ More

    Submitted 4 April, 2018; v1 submitted 12 December, 2017; originally announced December 2017.

  18. arXiv:1709.09480  [pdf, other

    cs.AI cs.LG eess.SY

    A Benchmark Environment Motivated by Industrial Control Problems

    Authors: Daniel Hein, Stefan Depeweg, Michel Tokic, Steffen Udluft, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing

    Abstract: In the research area of reinforcement learning (RL), frequently novel and promising methods are developed and introduced to the RL community. However, although many researchers are keen to apply their methods on real-world problems, implementing such methods in real industry environments often is a frustrating and tedious process. Generally, academic research groups have only limited access to rea… ▽ More

    Submitted 24 November, 2022; v1 submitted 27 September, 2017; originally announced September 2017.

    Journal ref: 2017 IEEE Symposium Series on Computational Intelligence (SSCI)

  19. arXiv:1705.07262  [pdf, ps, other

    cs.LG cs.AI cs.NE eess.SY

    Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

    Authors: Daniel Hein, Steffen Udluft, Michel Tokic, Alexander Hentschel, Thomas A. Runkler, Volkmar Sterzing

    Abstract: The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement lea… ▽ More

    Submitted 27 July, 2017; v1 submitted 20 May, 2017; originally announced May 2017.

    Journal ref: 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp. 4214-4221

  20. arXiv:1610.05984  [pdf, other

    cs.NE cs.AI cs.LG eess.SY

    Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies

    Authors: Daniel Hein, Alexander Hentschel, Thomas Runkler, Steffen Udluft

    Abstract: Fuzzy controllers are efficient and interpretable system controllers for continuous state and action spaces. To date, such controllers have been constructed manually or trained automatically either using expert-generated problem-specific cost functions or incorporating detailed knowledge about the optimal control strategy. Both requirements for automatic training processes are not found in most re… ▽ More

    Submitted 15 August, 2017; v1 submitted 19 October, 2016; originally announced October 2016.

    Journal ref: Engineering Applications of Artificial Intelligence, Volume 65C, October 2017, Pages 87-98

  21. arXiv:1610.03793  [pdf, ps, other

    cs.LG

    Introduction to the "Industrial Benchmark"

    Authors: Daniel Hein, Alexander Hentschel, Volkmar Sterzing, Michel Tokic, Steffen Udluft

    Abstract: A novel reinforcement learning benchmark, called Industrial Benchmark, is introduced. The Industrial Benchmark aims at being be realistic in the sense, that it includes a variety of aspects that we found to be vital in industrial applications. It is not designed to be an approximation of any real system, but to pose the same hardness and complexity.

    Submitted 28 September, 2017; v1 submitted 12 October, 2016; originally announced October 2016.

    Comments: 11 pages