no code implementations • 16 Jan 2024 • Benjamin Clément, Hélène Sauzéon, Didier Roy, Pierre-Yves Oudeyer
In this context, the ZPDES algorithm, based on the Learning Progress Hypothesis (LPH) and multi-armed bandit machine learning techniques, sequences exercises that maximize learning progress (LP).