1 code implementation • NeurIPS 2023 • Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng
Audio-Visual Source Localization (AVSL) aims to locate sounding objects within video frames given the paired audio clips.
Contrastive Learning