Search Results for author: Prashant Mehta

Found 2 papers, 0 papers with code

Sufficient Exploration for Convex Q-learning

no code implementations17 Oct 2022 Fan Lu, Prashant Mehta, Sean Meyn, Gergely Neu

The main contributions follow: (i) The dual of convex Q-learning is not precisely Manne's LP or a version of logistic Q-learning, but has similar structure that reveals the need for regularization to avoid over-fitting.

OpenAI Gym Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.