Search Results for author: Qiudan Zhang

Found 3 papers, 0 papers with code

Weakly-Supervised 3D Scene Graph Generation via Visual-Linguistic Assisted Pseudo-labeling

no code implementations • 3 Apr 2024 • Xu Wang, YiFan Li, Qiudan Zhang, Wenhui Wu, Mark Junjie Li, Jianmin Jinag

However, previous 3D scene graph generation methods utilize a fully supervised learning manner and require a large amount of entity-level annotation data of objects and relations, which is extremely resource-consuming and tedious to obtain.

3d scene graph generation Graph Generation +1

Paper
Add Code

Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment

no code implementations • 15 Dec 2023 • Xiaoxu Xu, Yitian Yuan, Qiudan Zhang, Wenhui Wu, Zequn Jie, Lin Ma, Xu Wang

During the inference stage, the learned text-3D correspondence will help us ground the text queries to the 3D target objects even without 2D images.

Natural Language Queries Scene Understanding +1

Paper
Add Code

Learning to Explore Intrinsic Saliency for Stereoscopic Video

no code implementations • CVPR 2019 • Qiudan Zhang, Xu Wang, Shiqi Wang, Shikai Li, Sam Kwong, Jianmin Jiang

Finally, a Convolutional Long Short-Term Memory (Conv-LSTM) based fusion network is developed to model the instantaneous interactions between spatio-temporal and depth attributes, such that the ultimate stereoscopic saliency maps over time are produced.

Saliency Prediction Video Saliency Detection +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.