TinyEDM 🔥

Analyzing and Improving the Training Dynamics of Diffusion Models

This a an unofficial PyTorch (Lightning) implementation of EDM Elucidating the Design Space of Diffusion-Based Generative Models and Analyzing and Improving the Training Dynamics of Diffusion Models.

Config G.
Post-hoc EMA.

Installation

git clone https://github.com/YichengDWu/tinyedm.git cd tinyedm && pip install .

Train

python experiments/train.py --config-name=mnist python experiments/train.py --config-name=cifar10

ImageNet

To download the ImageNet dataset, follow these steps:

Visit the ImageNet website: http://www.image-net.org/
Register for an account and request access for the dataset.
Once approved, follow the instructions provided by ImageNet to download the dataset.

ImageNet Latents

After downloading the ImageNet dataset, extract the files to a directory. When running the feature extraction script, use the --data-dir option to specify the path to this directory.

For example:

python src/tinyedm/datamodules/extract_latents.py --data-dir ./datasets/imagenet/train --out-dir ./datasets/imagenet/latents/train

Generate

python src/tinyedm/generate.py \ --ckpt_path /path/to/checkpoint.ckpt \ --load_ema \ --output_dir /path/to/output \ --num_samples 50000 \ --image_size 32 \ --num_classes 10 \ --batch_size 128 \ --num_workers 16 \ --num_steps 32

Results

Dataset	Params	type	epochs	FID
CIFAR-10	35.6 M	unconditional	1700	4.0

Observations

Using FP16 mixed precision training on the CIFAR-10 dataset sometimes leads to overflow, so we have adopted bf16 mixed precision, which may result in a loss of accuracy for the model.
For the scale factors of skip connections, this implementation uses a small network to learn them, inspired by ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection . The experiment shows that this improves the results.
The use of multi-task learning in the paper did not observe any improvement, or it may be more effective in long-term training. However, I do not have the compute power to verify this.

Name		Name	Last commit message	Last commit date
Latest commit History 288 Commits
experiments		experiments
src/tinyedm		src/tinyedm
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TinyEDM 🔥

Analyzing and Improving the Training Dynamics of Diffusion Models

Installation

Train

ImageNet

ImageNet Latents

Generate

Results

Observations

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

YichengDWu/tinyedm

Folders and files

Latest commit

History

Repository files navigation

TinyEDM 🔥

Analyzing and Improving the Training Dynamics of Diffusion Models

Installation

Train

ImageNet

ImageNet Latents

Generate

Results

Observations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages