1 code implementation • 1 Feb 2024 • Weijie Xu, Zicheng Huang, Wenxiang Hu, Xi Fang, Rajesh Kumar Cherukuri, Naumaan Nayyar, Lorenzo Malandri, Srinivasan H. Sengamedu
The data generation pipeline is transferable and can be easily adapted for labeled conversation data generation in other domains.
no code implementations • 4 May 2015 • Naumaan Nayyar, Dileep Kalathil, Rahul Jain
The objective is to design a policy that maximizes the expected reward over a time horizon for a single player setting and the sum of expected rewards for the multiplayer setting.