Search Results for author: Prashansa Panda

Found 2 papers, 0 papers with code

Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation

no code implementations • 2 Feb 2024 • Prashansa Panda, Shalabh Bhatnagar

In recent years, there has been a lot of research activity focused on carrying out non-asymptotic convergence analyses for actor-critic algorithms.

Paper
Add Code

Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms

no code implementations • 25 Oct 2023 • Prashansa Panda, Shalabh Bhatnagar

Actor Critic methods have found immense applications on a wide range of Reinforcement Learning tasks especially when the state-action space is large.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.