Search Results for author: Diwei Huang

Found 2 papers, 1 papers with code

MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

no code implementations • 22 May 2024 • Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan

Few-shot audio-visual acoustics modeling seeks to synthesize the room impulse response in arbitrary locations with few-shot observations.

Decoder

Paper
Add Code

Learning Vision-and-Language Navigation from YouTube Videos

1 code implementation • ICCV 2023 • Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan

In this paper, we propose to learn an agent from these videos by creating a large-scale dataset which comprises reasonable path-instruction pairs from house tour videos and pre-training the agent on it.

Navigate Vision and Language Navigation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.