no code implementations • 9 Mar 2023 • Mesut Erhan Unal, Adriana Kovashka
In this paper, we tackle HOI detection with the weakest supervision setting in the literature, using only image-level interaction labels, with the help of a pretrained vision-language model (VLM) and a large language model (LLM).
no code implementations • 29 Sep 2021 • Mesut Erhan Unal, Adriana Kovashka
We present a framework to better leverage natural language supervision for a specific downstream task, namely weakly-supervised object detection (WSOD).