Paper tables with annotated results for Editing a classifier by rewriting its prediction rules

Paper

Editing a classifier by rewriting its prediction rules

We present a methodology for modifying the behavior of a classifier by directly rewriting its prediction rules. Our approach requires virtually no additional data collection and can be applied to a variety of settings, including adapting a model to new environments, and modifying it to ignore spurious features. Our code is available at https://github.com/MadryLab/EditingClassifiers .

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

Editing a classifier by rewriting its prediction rules

Reader Guidelines

Editor Guidelines