Search Results for author: Jacob Samuel Huffman

Found 1 papers, 0 papers with code

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

no code implementations • 30 Apr 2024 • Yunhao Ge, Xiaohui Zeng, Jacob Samuel Huffman, Tsung-Yi Lin, Ming-Yu Liu, Yin Cui

VFC consists of three steps: 1) proposal, where image-to-text captioning models propose multiple initial captions; 2) verification, where a large language model (LLM) utilizes tools such as object detection and VQA models to fact-check proposed captions; 3) captioning, where an LLM generates the final caption by summarizing caption proposals and the fact check verification results.

Caption Generation Hallucination +7

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.