Search Results for author: Marvin Li

Found 3 papers, 1 papers with code

Critical windows: non-asymptotic theory for feature emergence in diffusion models

no code implementations3 Mar 2024 Marvin Li, Sitan Chen

Additionally, preliminary experiments on Stable Diffusion suggest critical windows may serve as a useful tool for diagnosing fairness and privacy violations in real-world diffusion models.

Fairness Image Generation

Pandora's White-Box: Increased Training Data Leakage in Open LLMs

1 code implementation26 Feb 2024 Jeffrey G. Wang, Jason Wang, Marvin Li, Seth Neel

In fine-tuning, we find that given access to the loss of the fine-tuned and base models, a fine-tuned loss ratio attack FLoRA is able to achieve near perfect MIA peformance.

Language Modelling

MoPe: Model Perturbation-based Privacy Attacks on Language Models

no code implementations22 Oct 2023 Marvin Li, Jason Wang, Jeffrey Wang, Seth Neel

In this paper, we present Model Perturbations (MoPe), a new method to identify with high confidence if a given text is in the training data of a pre-trained language model, given white-box access to the models parameters.

Language Modelling Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.