no code implementations • 9 May 2024 • Owen Randall, Martin Müller, Ting Han Wei, Ryan Hayward
We propose Expected Work Search (EWS), a new game solving algorithm.
no code implementations • 25 Sep 2019 • Chao GAO, Martin Mueller, Ryan Hayward, Hengshuai Yao, Shangling Jui
A three-head network architecture has been recently proposed that can learn a third action-value head on a fixed dataset the same as for two-head net.
no code implementations • 18 Sep 2019 • David Spies, Jia-Huai You, Ryan Hayward
We present an approach to representing large sets of mutual exclusions, also known as mutexes or mutex constraints.
1 code implementation • 31 Jul 2019 • David Spies, Jia-Huai You, Ryan Hayward
Experiments to compare the two approaches with the only known cost-optimal planner in SAT reveal good potentials for stepless planning in ASP.
no code implementations • ICLR 2018 • Chao Gao, Martin Mueller, Ryan Hayward
As policy gradient method is a kind of generalized policy iteration, we show how these differences in policy iteration are reflected in policy gradient for AMGs.
no code implementations • 24 Apr 2016 • Kenny Young, Ryan Hayward, Gautham Vasan
DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e. g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises many questions, including to what extent these methods will succeed in other domains.