Zero-shot Scene Classification (unified classes)

2 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Zero-shot Scene Classification (unified classes)

Trend	Dataset	Best Model	Paper	Code	Compare
	NYU Depth v2	LanguageBind			See all

Datasets

NYUv2

Most implemented papers

Most implemented Social Latest No code

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

pku-yuangroup/languagebind • • 3 Oct 2023

We thus propose VIDAL-10M with Video, Infrared, Depth, Audio and their corresponding Language, naming as VIDAL-10M.

Paper
Code

ImageBind: One Embedding Space To Bind Them All

facebookresearch/imagebind • • CVPR 2023

We show that all combinations of paired data are not necessary to train such a joint embedding, and only image-paired data is sufficient to bind the modalities together.