Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Mahmud, S M N

.
  1. arXiv:2210.06637  [pdf, other

    eess.SY

    Output Feedback Adaptive Optimal Control of Affine Nonlinear systems with a Linear Measurement Model

    Authors: Tochukwu Elijah Ogri, S. M. Nahid Mahmud, Zachary I. Bell, Rushikesh Kamalapurkar

    Abstract: Real-world control applications in complex and uncertain environments require adaptability to handle model uncertainties and robustness against disturbances. This paper presents an online, output-feedback, critic-only, model-based reinforcement learning architecture that simultaneously learns and implements an optimal controller while maintaining stability during the learning phase. Using multipli… ▽ More

    Submitted 3 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 16 pages, 5 figures, submitted to 2023 IEEE Conference on Control Technology and Applications

  2. arXiv:2204.01409  [pdf, other

    eess.SY

    Safe Controller for Output Feedback Linear Systems using Model-Based Reinforcement Learning

    Authors: S M Nahid Mahmud, Moad Abudia, Scott A Nivison, Zachary I. Bell, Rushikesh Kamalapurkar

    Abstract: The objective of this research is to enable safety-critical systems to simultaneously learn and execute optimal control policies in a safe manner to achieve complex autonomy. Learning optimal policies via trial and error, i.e., traditional reinforcement learning, is difficult to implement in safety-critical systems, particularly when task restarts are unavailable. Safe model-based reinforcement le… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.00271

  3. arXiv:2110.00271  [pdf, other

    eess.SY

    Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems

    Authors: S M Nahid Mahmud, Moad Abudia, Scott A Nivison, Zachary I. Bell, Rushikesh Kamalapurkar

    Abstract: The ability to learn and execute optimal control policies safely is critical to realization of complex autonomy, especially where task restarts are not available and/or the systems are safety-critical. Safety requirements are often expressed in terms of state and/or control constraints. Methods such as barrier transformation and control barrier functions have been successfully used, in conjunction… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.12666

  4. arXiv:2008.08972  [pdf, other

    eess.SY

    Online inverse reinforcement learning with limited data

    Authors: Ryan Self, S M Nahid Mahmud, Katrine Hareland, Rushikesh Kamalapurkar

    Abstract: This paper addresses the problem of online inverse reinforcement learning for systems with limited data and uncertain dynamics. In the developed approach, the state and control trajectories are recorded online by observing an agent perform a task, and reward function estimation is performed in real-time using a novel inverse reinforcement learning approach. Parameter estimation is performed concur… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 8 pages, 5 figures. arXiv admin note: text overlap with arXiv:2003.03912

  5. arXiv:2007.12666  [pdf, other

    eess.SY math.OC

    Safe Model-Based Reinforcement Learning for Systems with Parametric Uncertainties

    Authors: S M Nahid Mahmud, Scott A Nivison, Zachary I. Bell, Rushikesh Kamalapurkar

    Abstract: Reinforcement learning has been established over the past decade as an effective tool to find optimal control policies for dynamical systems, with recent focus on approaches that guarantee safety during the learning and/or execution phases. In general, safety guarantees are critical in reinforcement learning when the system is safety-critical and/or task restarts are not practically feasible. In o… ▽ More

    Submitted 5 October, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: This manuscript has been accepted in Frontiers in Robotics and AI. doi: 10.3389/frobt.2021.733104

  6. Online Simultaneous State and Parameter Estimation

    Authors: Ryan Self, Moad Abudia, S. M. Nahid Mahmud, Rushikesh Kamalapurkar

    Abstract: In this paper, a concurrent learning based adaptive observer is developed for a class of second-order nonlinear time-invariant systems with uncertain dynamics. The developed technique results in uniformly ultimately bounded state and parameter estimation errors. As opposed to persistent excitation which is required for parameter convergence in traditional adaptive control methods, the developed te… ▽ More

    Submitted 13 October, 2020; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: arXiv admin note: text overlap with arXiv:1609.05879