no code implementations • LREC 2020 • Taichi Nishimura, Suzushi Tomori, Hayato Hashimoto, Atsushi Hashimoto, Yoko Yamakata, Jun Harashima, Yoshitaka Ushiku, Shinsuke Mori
Visual grounding is provided as bounding boxes to image sequences of recipes, and each bounding box is linked to an element of the workflow.
no code implementations • IJCNLP 2017 • Atsushi Ushiku, Hayato Hashimoto, Atsushi Hashimoto, Shinsuke Mori
In this paper, we focus on procedure execution videos, in which a human makes or repairs something and propose a method for generating procedural texts from them.