1 code implementation • 2 Dec 2018 • Sijia Xu, Hongyu Kuang, Zhi Zhuang, Renjie Hu, Yang Liu, Huyang Sun
These rules are not scalable and efficient enough to cope with the enormous yet partially observed state space in the game.
reinforcement-learning Reinforcement Learning (RL) +1