no code implementations • 15 Apr 2024 • Ziniu Zhang, Shulin Tian, Liangyu Chen, Ziwei Liu
To answer this question, we present MMInA, a multihop and multimodal benchmark to evaluate the embodied agents for compositional Internet tasks, with several appealing properties: 1) Evolving real-world multimodal websites.