no code implementations • 11 Apr 2024 • Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama
In this paper, we investigate an offline reinforcement learning (RL) problem where datasets are collected from two domains.
no code implementations • 10 Apr 2024 • Xingyu Song, Zhan Li, Shi Chen, Xin-Qiang Cai, Kazuyuki Demachi
Action recognition, an essential component of computer vision, plays a pivotal role in multiple applications.
no code implementations • 6 Feb 2024 • Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama
In Reinforcement Learning (RL), it is commonly assumed that an immediate reward signal is generated for each action taken by the agent, helping the agent maximize cumulative rewards to obtain the optimal policy.
no code implementations • 16 Sep 2023 • Kaiyan Zhao, Qiyu Wu, Xin-Qiang Cai, Yoshimasa Tsuruoka
Learning multi-lingual sentence embeddings is a fundamental task in natural language processing.
no code implementations • 17 Jun 2021 • Xin-Qiang Cai, Yao-Xiang Ding, Zi-Xuan Chen, Yuan Jiang, Masashi Sugiyama, Zhi-Hua Zhou
In many real-world imitation learning tasks, the demonstrator and the learner have to act under different observation spaces.
no code implementations • 9 Sep 2019 • Xin-Qiang Cai, Yao-Xiang Ding, Yuan Jiang, Zhi-Hua Zhou
One of the key issues for imitation learning lies in making policy learned from limited samples to generalize well in the whole state-action space.