Search Results for author: Diwei Huang

Found 2 papers, 1 papers with code

MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

no code implementations22 May 2024 Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan

Few-shot audio-visual acoustics modeling seeks to synthesize the room impulse response in arbitrary locations with few-shot observations.

Decoder

Learning Vision-and-Language Navigation from YouTube Videos

1 code implementation ICCV 2023 Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan

In this paper, we propose to learn an agent from these videos by creating a large-scale dataset which comprises reasonable path-instruction pairs from house tour videos and pre-training the agent on it.

Navigate Vision and Language Navigation

Cannot find the paper you are looking for? You can Submit a new open access paper.