1 code implementation • 9 Dec 2023 • Ran Zhang, Aida Kostikova, Christoph Leiter, Jonas Belouadi, Daniil Larionov, Yanran Chen, Vivian Fresen, Steffen Eger
Artificial Intelligence (AI) has witnessed rapid growth, especially in the subfields Natural Language Processing (NLP), Machine Learning (ML) and Computer Vision (CV).
1 code implementation • 30 Oct 2023 • Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger
Specifically, we propose a novel competition setting in which we select a list of allowed LLMs and disallow fine-tuning to ensure a focus on prompting.
1 code implementation • 31 Jul 2023 • Steffen Eger, Christoph Leiter, Jonas Belouadi, Ran Zhang, Aida Kostikova, Daniil Larionov, Yanran Chen, Vivian Fresen
In particular, we compile a list of the 40 most popular papers based on normalized citation counts from the first half of 2023.
no code implementations • 22 Jun 2023 • Christoph Leiter, Piyawat Lertvittayakumjorn, Marina Fomicheva, Wei Zhao, Yang Gao, Steffen Eger
In this context, we also discuss the latest state-of-the-art approaches to explainable metrics based on generative models such as ChatGPT and GPT4.
no code implementations • 20 Feb 2023 • Christoph Leiter, Ran Zhang, Yanran Chen, Jonas Belouadi, Daniil Larionov, Vivian Fresen, Steffen Eger
ChatGPT, a chatbot developed by OpenAI, has gained widespread popularity and media attention since its release in November 2022.
1 code implementation • 20 Dec 2022 • Christoph Leiter, Hoa Nguyen, Steffen Eger
We then combine this segment-level score with the original metric to obtain a better metric.
1 code implementation • 20 Sep 2022 • Daniil Larionov, Jens Grünwald, Christoph Leiter, Steffen Eger
In this work, we provide a comprehensive evaluation of efficiency for MT evaluation metrics.
1 code implementation • 21 Mar 2022 • Christoph Leiter, Piyawat Lertvittayakumjorn, Marina Fomicheva, Wei Zhao, Yang Gao, Steffen Eger
We also provide a synthesizing overview over recent approaches for explainable machine translation metrics and discuss how they relate to those goals and properties.