2 code implementations • 7 Feb 2024 • Sebastian Schmidt, Ines Zelch, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast
In this paper, we thus take a first step to investigate whether LLMs can also be used as a countermeasure, i. e., to block generated native ads.
no code implementations • 8 Nov 2023 • Lukas Gienapp, Harrisen Scells, Niklas Deckers, Janek Bevendorff, Shuai Wang, Johannes Kiesel, Shahbaz Syed, Maik Fröbe, Guido Zuccon, Benno Stein, Matthias Hagen, Martin Potthast
To lay a foundation for developing new evaluation methods for generative retrieval systems, we survey the relevant literature from the fields of information retrieval and natural language processing, identify search tasks and system architectures in generative retrieval, develop a new user model, and study its operationalization.
1 code implementation • 30 May 2023 • Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Simon Reich, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast
Standardization is achieved when a retrieval approach implements PyTerrier's interfaces and the input and output of an experiment are compatible with ir_datasets and ir_measures.
no code implementations • 4 Nov 2022 • Janek Bevendorff, Philipp Sauer, Lukas Gienapp, Wolfgang Kircheis, Erik Körner, Benno Stein, Martin Potthast
The rapidly growing volume of scientific publications offers an interesting challenge for research on methods for analyzing the authorship of documents with one or more authors.
1 code implementation • 22 Nov 2021 • Janek Bevendorff, Martin Potthast, Benno Stein
Web search and other large-scale web data analytics rely on processing archives of web pages stored in a standardized and efficient format.
no code implementations • 21 Nov 2021 • Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, Martin Potthast
Commercial web search engines employ near-duplicate detection to ensure that users see each relevant result only once, albeit the underlying web crawls typically include (near-)duplicates of many web pages.
no code implementations • ACL 2020 • Janek Bevendorff, Khalid Al Khatib, Martin Potthast, Benno Stein
This paper introduces the Webis Gmane Email Corpus 2019, the largest publicly available and fully preprocessed email corpus to date.
1 code implementation • ACL 2019 • Janek Bevendorff, Martin Potthast, Matthias Hagen, Benno Stein
Authorship verification is the task of determining whether two texts were written by the same author.
1 code implementation • ACL 2019 • Janek Bevendorff, Matthias Hagen, Benno Stein, Martin Potthast
The PAN series of shared tasks is well known for its continuous and high quality research in the field of digital text forensics.
1 code implementation • NAACL 2019 • Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast
Authorship verification is the problem of inferring whether two texts were written by the same author.
no code implementations • WS 2017 • Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, Benno Stein
Computational argumentation is expected to play a critical role in the future of web search.
1 code implementation • ACL 2018 • Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, Benno Stein
The articles originated from 9 well-known political publishers, 3 each from the mainstream, the hyperpartisan left-wing, and the hyperpartisan right-wing.