Search Results for author: Ryan Hayward

Found 7 papers, 1 papers with code

Expected Work Search: Combining Win Rate and Proof Size Estimation

no code implementations • 9 May 2024 • Owen Randall, Martin Müller, Ting Han Wei, Ryan Hayward

We propose Expected Work Search (EWS), a new game solving algorithm.

Paper
Add Code

Three-Head Neural Network Architecture for AlphaZero Learning

no code implementations • 25 Sep 2019 • Chao GAO, Martin Mueller, Ryan Hayward, Hengshuai Yao, Shangling Jui

A three-head network architecture has been recently proposed that can learn a third action-value head on a fixed dataset the same as for two-head net.

Paper
Add Code

Mutex Graphs and Multicliques: Reducing Grounding Size for Planning

no code implementations • 18 Sep 2019 • David Spies, Jia-Huai You, Ryan Hayward

We present an approach to representing large sets of mutual exclusions, also known as mutexes or mutex constraints.

Paper
Add Code

Domain-Independent Cost-Optimal Planning in ASP

1 code implementation • 31 Jul 2019 • David Spies, Jia-Huai You, Ryan Hayward

Experiments to compare the two approaches with the only known cost-optimal planner in SAT reveal good potentials for stepless planning in ASP.

Paper
Code

Adversarial Policy Gradient for Alternating Markov Games

no code implementations • ICLR 2018 • Chao Gao, Martin Mueller, Ryan Hayward

As policy gradient method is a kind of generalized policy iteration, we show how these differences in policy iteration are reflected in policy gradient for AMGs.

Policy Gradient Methods

Paper
Add Code

Neurohex: A Deep Q-learning Hex Agent

no code implementations • 24 Apr 2016 • Kenny Young, Ryan Hayward, Gautham Vasan

DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e. g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises many questions, including to what extent these methods will succeed in other domains.

Atari Games Game of Go +1

Paper
Add Code

Solving Substitution Ciphers with Combined Language Models

no code implementations • COLING 2014 • Bradley Hauer, Ryan Hayward, Grzegorz Kondrak

Optical Character Recognition (OCR) Transliteration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.