Diffusion model: Difference between revisions

From Rice Wiki
No edit summary
Line 7: Line 7:


=== Forward diffusion ===
=== Forward diffusion ===
Mathematically, each step of the forward diffusion process is defined as <math>q(x_t|x_{t-1})=N(x_t, \sqrt{1-\beta_t}, \beta_tI)</math>, where ''q'' is the process, ''N'' is a normal distribution, ''x<sub>t</sub>'' is the process output at step ''t'', the square root is the mean, and the last one being the variance.
Mathematically, each step of the forward diffusion process is defined as <math>q(x_t|x_{t-1})=N(x_t, \sqrt{1-\beta_t}, \beta_tI)</math>, where ''x<sub>t</sub>'' is the process output at step ''t'', ''q'' is a distribution at the current step depending on the previous step, ''N'' is a normal distribution, the square root is the mean, and the last one being the variance.


= Sources =
= Sources =

Revision as of 23:19, 4 July 2024

In a nutshell, diffusion models work by making something more random/noisy, and through the process finding the inverse, which is making noisy things less random, thus generating data.

Diffusion models are frequently used for computer vision tasks such as text-to-image. This page will focus exclusively on that use case.

Principle

More specifically, diffusion models consists of two steps: forward diffusion, which turns an image into noise, and backward diffusion, which turns noise back into an image.

Forward diffusion

Mathematically, each step of the forward diffusion process is defined as , where xt is the process output at step t, q is a distribution at the current step depending on the previous step, N is a normal distribution, the square root is the mean, and the last one being the variance.

Sources