Search Results for author: Julia Persson

Found 1 papers, 1 papers with code

Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark

1 code implementation • 27 May 2023 • Jason Hoelscher-Obermaier, Julia Persson, Esben Kran, Ioannis Konstas, Fazl Barez

We use this improved benchmark to evaluate recent model editing techniques and find that they suffer from low specificity.

Model Editing Specificity

17

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.