no code implementations • 4 Mar 2024 • Xin Zhang, Tao Xiao, GePeng Ji, Xuan Wu, Keren Fu, Qijun Zhao
The prompt fed to the motion stream is learned by supervising optical flow in a self-supervised manner.
Motion Estimation object-detection +3