Search Results for author: Hossein Nejatbakhsh Esfahani

Found 7 papers, 0 papers with code

Policy Gradient Reinforcement Learning for Uncertain Polytopic LPV Systems based on MHE-MPC

no code implementations10 Jun 2022 Hossein Nejatbakhsh Esfahani, Sebastien Gros

In this paper, we propose a learning-based Model Predictive Control (MPC) approach for the polytopic Linear Parameter-Varying (LPV) systems with inexact scheduling parameters (as exogenous signals with inexact bounds), where the Linear Time Invariant (LTI) models (vertices) captured by combinations of the scheduling parameters becomes wrong.

Model Predictive Control reinforcement-learning +2

Quasi-Newton Iteration in Deterministic Policy Gradient

no code implementations25 Mar 2022 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, WenQi Cai, Sebastien Gros

We show that the approximate Hessian converges to the exact Hessian at the optimal policy, and allows for a superlinear convergence in the learning, provided that the policy parametrization is rich.

reinforcement-learning Reinforcement Learning (RL)

Backstepping-based Integral Sliding Mode Control with Time Delay Estimation for Autonomous Underwater Vehicles

no code implementations19 Nov 2021 Hossein Nejatbakhsh Esfahani, Behdad Aminian, Esten Ingar Grøtli, Sebastien Gros

The aim of this paper is to propose a high performance control approach for trajectory tracking of Autonomous Underwater Vehicles (AUVs).

Approximate Robust NMPC using Reinforcement Learning

no code implementations6 Apr 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

We present a Reinforcement Learning-based Robust Nonlinear Model Predictive Control (RL-RNMPC) framework for controlling nonlinear systems in the presence of disturbances and uncertainties.

Model Predictive Control reinforcement-learning +1

Bias Correction in Deterministic Policy Gradient Using Robust MPC

no code implementations6 Apr 2021 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, Sebastien Gros

In this paper, we discuss the deterministic policy gradient using the Actor-Critic methods based on the linear compatible advantage function approximator, where the input spaces are continuous.

Model Predictive Control

Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics

no code implementations22 Mar 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

This paper proposes an observer-based framework for solving Partially Observable Markov Decision Processes (POMDPs) when an accurate model is not available.

Model Predictive Control reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.