1 code implementation • 28 Feb 2024 • Yuiga Wada, Kanta Kaneda, Daichi Saito, Komei Sugiura
Establishing an automatic evaluation metric that closely aligns with human judgments is essential for effectively developing image captioning models.
1 code implementation • 12 Nov 2023 • Kanta Kaneda, Ryosuke Korekata, Yuiga Wada, Shunya Nagashima, Motonari Kambara, Yui Iioka, Haruka Matsuo, Yuto Imai, Takayuki Nishimura, Komei Sugiura
This paper focuses on the DialFRED task, which is the task of embodied instruction following in a setting where an agent can actively ask questions about the task.
1 code implementation • 7 Nov 2023 • Yuiga Wada, Kanta Kaneda, Komei Sugiura
Image captioning studies heavily rely on automatic evaluation metrics such as BLEU and METEOR.
no code implementations • 17 Jul 2023 • Yui Iioka, Yu Yoshida, Yuiga Wada, Shumpei Hatanaka, Komei Sugiura
In this study, we aim to develop a model that comprehends a natural language instruction (e. g., "Go to the living room and get the nearest pillow to the radio art on the wall") and generates a segmentation mask for the target everyday object.