TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text-based Image Editing	PIE-Bench	Null-Text Inversion+Prompt-to-Prompt	CLIPSIM	24.75	# 8
Text-based Image Editing	PIE-Bench	Null-Text Inversion+Prompt-to-Prompt	Structure Distance	13.44	# 3
Text-based Image Editing	PIE-Bench	Null-Text Inversion+Prompt-to-Prompt	Background PSNR	27.03	# 4
Text-based Image Editing	PIE-Bench	Null-Text Inversion+Prompt-to-Prompt	Background LPIPS	60.67	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/null-text-inversion-for-editing-real-images/text-based-image-editing-on-pie-bench)](https://paperswithcode.com/sota/text-based-image-editing-on-pie-bench?p=null-text-inversion-for-editing-real-images)`

Null-text Inversion for Editing Real Images using Guided Diffusion Models

CVPR 2023 · Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, Daniel Cohen-Or ·

Recent text-guided diffusion models provide powerful image generation capabilities. Currently, a massive effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. To edit a real image using these state-of-the-art tools, one must first invert the image with a meaningful text prompt into the pretrained model's domain. In this paper, we introduce an accurate inversion technique and thus facilitate an intuitive text-based modification of the image. Our proposed inversion consists of two novel key components: (i) Pivotal inversion for diffusion models. While current methods aim at mapping random noise samples to a single input image, we use a single pivotal noise vector for each timestamp and optimize around it. We demonstrate that a direct inversion is inadequate on its own, but does provide a good anchor for our optimization. (ii) NULL-text optimization, where we only modify the unconditional textual embedding that is used for classifier-free guidance, rather than the input text embedding. This allows for keeping both the model weights and the conditional embedding intact and hence enables applying prompt-based editing while avoiding the cumbersome tuning of the model's weights. Our Null-text inversion, based on the publicly available Stable Diffusion model, is extensively evaluated on a variety of images and prompt editing, showing high-fidelity editing of real images.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

google/prompt-to-prompt official

2,870

thepowerfuldeez/null-text-inversion

↳ Quickstart in

Colab

phymhan/prompt-to-prompt

qwopqwop200/semantic-image-editing-…

Tasks

Add Remove

Image Generation

Text-based Image Editing

Datasets

PIE-Bench

Results from the Paper

Edit

Ranked #4 on Text-based Image Editing on PIE-Bench

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text-based Image Editing	PIE-Bench	Null-Text Inversion+Prompt-to-Prompt	CLIPSIM	24.75	# 8	Compare
			Structure Distance	13.44	# 3	Compare
			Background PSNR	27.03	# 4	Compare
			Background LPIPS	60.67	# 5	Compare

Methods

Add Remove

Diffusion

Edit Social Preview

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove