Diffusion model

From Rice Wiki
Revision as of 22:59, 4 July 2024 by Rice (talk | contribs)

In a nutshell, diffusion models work by making something more random/noisy, and through the process finding the inverse, which is making noisy things less random, thus generating data.

Diffusion models are frequently used for computer vision tasks such as text-to-image. This page will focus exclusively on that use case.

Principle

More specifically, diffusion models consists of two steps: forward diffusion, which turns an image into noise, and backward diffusion, which turns noise back into an image.

Forward diffusion

Mathematically, forward diffusion is defined as , where q is the forward diffusion process,

Sources