LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

Luo, Simian, et al. "Lcm-lora: A universal stable-diffusion acceleration module." arXiv preprint arXiv:2311.05556 (2023).

참고:

1. Introduction

A method to generate high quality images with large text-to-image generation models
- e.g., specifically SDXL (Stable Diffusion XL)
Make it dramatically faster.
Works for both..
- (1) Not only for SDXL
- (2) But also for fine-tuned SDXL without going through another training process

Summary = Doing most of the work in the latent space makes the process…

\(\rightarrow\) The proposed method!

LDMs = Still process with many iterations

\(\rightarrow\) Why not combine with consistency models?

\(\rightarrow\) Latent consistency models

For faster inference

\(\rightarrow\) Directly remove all of the noise in order to skip steps in the denoising process.

Step 1) Use a pre-trained LDM weights
Step 2) fine-tuning the LDM weights
- Too costly (\(\because\) SDXL)
- Solution = LoRA