no code implementations • 12 Jun 2023 • Dustin Morrill, Thomas J. Walsh, Daniel Hernandez, Peter R. Wurman, Peter Stone
Empirical results demonstrate that RPOSST finds a small set of test cases that identify high quality policies in a toy one-shot game, poker datasets, and a high-fidelity racing simulator.
no code implementations • 1 Nov 2022 • Varun Kompella, Thomas J. Walsh, Samuel Barrett, Peter Wurman, Peter Stone
Experience replay (ER) is a crucial component of many deep reinforcement learning (RL) systems.
no code implementations • 26 Sep 2013 • Alborz Geramifard, Thomas J. Walsh, Nicholas Roy, Jonathan How
Matching pursuit (MP) methods are a promising class of feature construction algorithms for value function approximation.
no code implementations • NeurIPS 2011 • Thomas J. Walsh, Daniel K. Hewlett, Clayton T. Morrison
We present theoretical and empirical results for a framework that combines the benefits of apprenticeship and autonomous reinforcement learning.