Search Results for author: Arash Bahari Kordabad

Found 9 papers, 0 papers with code

Equivalence of Optimality Criteria for Markov Decision Process and Model Predictive Control

no code implementations9 Oct 2022 Arash Bahari Kordabad, Mario Zanon, Sebastien Gros

This paper shows that the optimal policy and value functions of a Markov Decision Process (MDP), either discounted or not, can be captured by a finite-horizon undiscounted Optimal Control Problem (OCP), even if based on an inexact model.

Model Predictive Control reinforcement-learning +1

Functional Stability of Discounted Markov Decision Processes Using Economic MPC Dissipativity Theory

no code implementations31 Mar 2022 Arash Bahari Kordabad, Sebastien Gros

This paper discusses the functional stability of closed-loop Markov Chains under optimal policies resulting from a discounted optimality criterion, forming Markov Decision Processes (MDPs).

Model Predictive Control Q-Learning +1

Quasi-Newton Iteration in Deterministic Policy Gradient

no code implementations25 Mar 2022 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, WenQi Cai, Sebastien Gros

We show that the approximate Hessian converges to the exact Hessian at the optimal policy, and allows for a superlinear convergence in the learning, provided that the policy parametrization is rich.

reinforcement-learning Reinforcement Learning (RL)

Verification of Dissipativity and Evaluation of Storage Function in Economic Nonlinear MPC using Q-Learning

no code implementations24 May 2021 Arash Bahari Kordabad, Sebastien Gros

In the Economic Nonlinear Model Predictive (ENMPC) context, closed-loop stability relates to the existence of a storage function satisfying a dissipation inequality.

Q-Learning Reinforcement Learning (RL)

Approximate Robust NMPC using Reinforcement Learning

no code implementations6 Apr 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

We present a Reinforcement Learning-based Robust Nonlinear Model Predictive Control (RL-RNMPC) framework for controlling nonlinear systems in the presence of disturbances and uncertainties.

Model Predictive Control reinforcement-learning +1

MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage

no code implementations6 Apr 2021 Arash Bahari Kordabad, WenQi Cai, Sebastien Gros

In this paper, we are interested in optimal control problems with purely economic costs, which often yield optimal policies having a (nearly) bang-bang structure.

Model Predictive Control reinforcement-learning +1

Bias Correction in Deterministic Policy Gradient Using Robust MPC

no code implementations6 Apr 2021 Arash Bahari Kordabad, Hossein Nejatbakhsh Esfahani, Sebastien Gros

In this paper, we discuss the deterministic policy gradient using the Actor-Critic methods based on the linear compatible advantage function approximator, where the input spaces are continuous.

Model Predictive Control

Reinforcement Learning based on MPC/MHE for Unmodeled and Partially Observable Dynamics

no code implementations22 Mar 2021 Hossein Nejatbakhsh Esfahani, Arash Bahari Kordabad, Sebastien Gros

This paper proposes an observer-based framework for solving Partially Observable Markov Decision Processes (POMDPs) when an accurate model is not available.

Model Predictive Control reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.