no code implementations • 2 Oct 2023 • Emmeran Johnson, Ciara Pike-Burke, Patrick Rebeschini
An algorithm is sample-efficient if it uses a number of queries $n$ to the environment that is polynomial in the dimension $d$ of the problem.
reinforcement-learning