no code implementations • 17 Jul 2023 • Yui Iioka, Yu Yoshida, Yuiga Wada, Shumpei Hatanaka, Komei Sugiura
In this study, we aim to develop a model that comprehends a natural language instruction (e. g., "Go to the living room and get the nearest pillow to the radio art on the wall") and generates a segmentation mask for the target everyday object.