no code implementations • 6 Sep 2021 • Fengpei Yuan, Ran Zhang, Dania Bilal, Xiaopeng Zhao
A Q-learning (QL) algorithm is then designed to achieve the best conversation strategy for the robot.
Q-Learning