Search Results for author: Jeffrey G. Wang

Found 1 papers, 1 papers with code

Pandora's White-Box: Increased Training Data Leakage in Open LLMs

1 code implementation • 26 Feb 2024 • Jeffrey G. Wang, Jason Wang, Marvin Li, Seth Neel

In fine-tuning, we find that given access to the loss of the fine-tuned and base models, a fine-tuned loss ratio attack FLoRA is able to achieve near perfect MIA peformance.

Language Modelling

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.