Skip to content

Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024

License

Notifications You must be signed in to change notification settings

ToyotaResearchInstitute/gradient-estimation-sampler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gradient estimation sampler

Frank Permenter, Chenyang Yuan

Paper (ICML 2024)

This repo contains the code for the gradient estimation sampler introduced in the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective".

Example outputs on text-to-image Stable Diffusion when limited to N = 10 function evaluations. FID scores for text-to-image generation on MS-COCO 30K.

Using the sampler

We provide an implementation of the gradient estimation sampler as GEScheduler in gescheduler/scheduling_gradient_estimation.py, which is designed to be compatible with pipelines from the huggingface diffusers library. For example, to use this scheduler in a text-to-image pipeline:

import torch
from diffusers import StableDiffusionPipeline
from gescheduler.scheduling_gradient_estimation import GEScheduler

pipe = StableDiffusionPipeline.from_pretrained('stabilityai/stable-diffusion-2-1-base').to('cuda')
pipe.scheduler = GEScheduler.from_config(pipe.scheduler.config)
prompt = "Cluttered house in the woods, anime, oil painting, high resolution, cottagecore, ghibli inspired, 4k"
img = pipe(prompt, num_inference_steps=10, generator=torch.manual_seed(0)).images[0]
img.save('result.png')

To reproduce experiments in paper

Follow the instructions in experiments/README.md

Acknowledgement

The gradient estimation scheduler uses code adapted from huggingface diffusers library, whereas the experiment code is adapted from the PNDM repository.

Citation

@inproceedings{
permenter2024interpreting,
title={Interpreting and Improving Diffusion Models from an Optimization Perspective},
author={Frank Permenter and Chenyang Yuan},
booktitle={Forty-first International Conference on Machine Learning},
year={2024},
url={https://openreview.net/forum?id=o2ND9v0CeK}
}

About

Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages