Diffusion model in web browser

第一集: https://www.bilibili.com/video/BV1tz4y1h7q1 | 正态分布 | 基本设定 | 公式推导 |
第二集: https://www.bilibili.com/video/BV1xQ4y1w7ex | 神经网络 | 概率空间 | 边缘概率 | 各向同性高斯分布 |
第三集: https://www.bilibili.com/video/BV1hZ421y7id | 三维动画展示全过程
第四集: https://www.bilibili.com/video/BV1gK421b7W9 | 神经网络的学习目标以及训练
第五集: https://www.bilibili.com/video/BV12y421z7Mh/ | 花絮 | 热度图制作细节揭秘

UPDATE:

2024-07-14 : Update online sample to use WebGPU if possible
2024-07-15 : Added DDIM sampling method

1. DDPM Introduction

$q$ - a fixed (or predefined) forward diffusion process of adding Gaussian noise to an image gradually, until ending up with pure noise
$p_{θ}$ - a learned reverse denoising diffusion process, where a neural network is trained to gradually denoise an image starting from pure noise, until ending up with an actual image.

Both the forward and reverse process indexed by $t$ happen for some number of finite time steps $T$ (the DDPM authors use $T$ =1000). You start with $t = 0$ where you sample a real image $x_{0}$ from your data distribution, and the forward process samples some noise from a Gaussian distribution at each time step $t$ , which is added to the image of the previous time step. Given a sufficiently large $T$ and a well behaved schedule for adding noise at each time step, you end up with what is called an isotropic Gaussian distribution at $t = T$ via a gradual process

2. Forward Process $q$

$x_{0} \overset{q (x_{1} | x_{0})}{\to} x_{1} \overset{q (x_{2} | x_{1})}{\to} x_{2} \to \dots \to x_{T - 1} \overset{q (x_{t} | x_{t - 1})}{\to} x_{T}$

This process is a markov chain, $x_{t}$ only depends on $x_{t - 1}$ . $q (x_{t} | x_{t - 1})$ adds Gaussian noise at each time step $t$ , according to a known variance schedule $β_{t}$

$x_{t} = \sqrt{1 - β_{t}} \times x_{t - 1} + \sqrt{β_{t}} \times ϵ_{t}$

$β_{t}$ is not constant at each time step $t$ . In fact one defines a so-called "variance schedule", which can be linear, quadratic, cosine, etc.

$0 < β_{1} < β_{2} < β_{3} < \dots < β_{T} < 1$

$ϵ_{t}$ Gaussian noise, sampled from standard normal distribution.

$x_{t} = \sqrt{1 - β_{t}} \times x_{t - 1} + \sqrt{β_{t}} \times ϵ_{t}$

Define $a_{t} = 1 - β_{t}$

$x_{t} = \sqrt{a_{t}} \times x_{t - 1} + \sqrt{1 - a_{t}} \times ϵ_{t}$

2.1 Relationship between $x_{t}$ and $x_{t - 2}$

$x_{t - 1} = \sqrt{a_{t - 1}} \times x_{t - 2} + \sqrt{1 - a_{t - 1}} \times ϵ_{t - 1}$

$⇓$

$x_{t} = \sqrt{a_{t}} (\sqrt{a_{t - 1}} \times x_{t - 2} + \sqrt{1 - a_{t - 1}} ϵ_{t - 1}) + \sqrt{1 - a_{t}} \times ϵ_{t}$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1}} \times x_{t - 2} + \sqrt{a_{t} (1 - a_{t - 1})} ϵ_{t - 1} + \sqrt{1 - a_{t}} \times ϵ_{t}$

Because

N (μ_{1}, σ_{1}^{2}) + N (μ_{2}, σ_{2}^{2}) = N (μ_{1} + μ_{2}, σ_{1}^{2} + σ_{2}^{2})

Proof

$x_{t} = \sqrt{a_{t} a_{t - 1}} \times x_{t - 2} + \sqrt{a_{t} (1 - a_{t - 1}) + 1 - a_{t}} \times ϵ$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1}} \times x_{t - 2} + \sqrt{1 - a_{t} a_{t - 1}} \times ϵ$

2.2 Relationship between $x_{t}$ and $x_{t - 3}$

$x_{t - 2} = \sqrt{a_{t - 2}} \times x_{t - 3} + \sqrt{1 - a_{t - 2}} \times ϵ_{t - 2}$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1}} (\sqrt{a_{t - 2}} \times x_{t - 3} + \sqrt{1 - a_{t - 2}} ϵ_{t - 2}) + \sqrt{1 - a_{t} a_{t - 1}} \times ϵ$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2}} \times x_{t - 3} + \sqrt{a_{t} a_{t - 1} (1 - a_{t - 2})} ϵ_{t - 2} + \sqrt{1 - a_{t} a_{t - 1}} \times ϵ$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2}} \times x_{t - 3} + \sqrt{a_{t} a_{t - 1} - a_{t} a_{t - 1} a_{t - 2}} ϵ_{t - 2} + \sqrt{1 - a_{t} a_{t - 1}} \times ϵ$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2}} \times x_{t - 3} + \sqrt{(a_{t} a_{t - 1} - a_{t} a_{t - 1} a_{t - 2}) + 1 - a_{t} a_{t - 1}} \times ϵ$

$⇓$

$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2}} \times x_{t - 3} + \sqrt{1 - a_{t} a_{t - 1} a_{t - 2}} \times ϵ$

2.3 Relationship between $x_{t}$ and $x_{0}$

$x_{t} = \sqrt{a_{t} a_{t - 1}} \times x_{t - 2} + \sqrt{1 - a_{t} a_{t - 1}} \times ϵ$
$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2}} \times x_{t - 3} + \sqrt{1 - a_{t} a_{t - 1} a_{t - 2}} \times ϵ$
$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2} a_{t - 3} . . . a_{t - (k - 2)} a_{t - (k - 1)}} \times x_{t - k} + \sqrt{1 - a_{t} a_{t - 1} a_{t - 2} a_{t - 3} . . . a_{t - (k - 2)} a_{t - (k - 1)}} \times ϵ$
$x_{t} = \sqrt{a_{t} a_{t - 1} a_{t - 2} a_{t - 3} . . . a_{2} a_{1}} \times x_{0} + \sqrt{1 - a_{t} a_{t - 1} a_{t - 2} a_{t - 3} . . . a_{2} a_{1}} \times ϵ$

$$\bar{a}{t} := a{t}a_{t-1}a_{t-2}a_{t-3}...a_{2}a_{1}$$

$x_{t} = \sqrt{{\bar{a}}_{t}} \times x_{0} + \sqrt{1 - {\bar{a}}_{t}} \times ϵ, ϵ \sim N (0, I)$

$⇓$

$q (x_{t} | x_{0}) = \frac{1}{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t}}} e^{(- \frac{1}{2} \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})}$

3.Reverse Process $p$

Because $P (A | B) = \frac{P (B | A) P (A)}{P (B)}$

$p (x_{t - 1} | x_{t}, x_{0}) = \frac{q (x_{t} | x_{t - 1}, x_{0}) \times q (x_{t - 1} | x_{0})}{q (x_{t} | x_{0})}$

$x_{t} = \sqrt{a_{t}} x_{t - 1} + \sqrt{1 - a_{t}} \times ϵ$	~	$N (\sqrt{a_{t}} x_{t - 1}, 1 - a_{t})$
$x_{t - 1} = \sqrt{{\bar{a}}_{t - 1}} x_{0} + \sqrt{1 - {\bar{a}}_{t - 1}} \times ϵ$	~	$N (\sqrt{{\bar{a}}_{t - 1}} x_{0}, 1 - {\bar{a}}_{t - 1})$
$x_{t} = \sqrt{{\bar{a}}_{t}} x_{0} + \sqrt{1 - {\bar{a}}_{t}} \times ϵ$	~	$N (\sqrt{{\bar{a}}_{t}} x_{0}, 1 - {\bar{a}}_{t})$

$q (x_{t} | x_{t - 1}, x_{0}) = \frac{1}{\sqrt{2 π} \sqrt{1 - a_{t}}} e^{(- \frac{1}{2} \frac{(x_{t} - \sqrt{a_{t}} x_{t - 1})^{2}}{1 - a_{t}})}$

$q (x_{t - 1} | x_{0}) = \frac{1}{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t - 1}}} e^{(- \frac{1}{2} \frac{(x_{t - 1} - \sqrt{{\bar{a}}_{t - 1}} x_{0})^{2}}{1 - {\bar{a}}_{t - 1}})}$

$q (x_{t} | x_{0}) = \frac{1}{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t}}} e^{(- \frac{1}{2} \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})}$

$\frac{q (x_{t} | x_{t - 1}, x_{0}) \times q (x_{t - 1} | x_{0})}{q (x_{t} | x_{0})} = [\frac{1}{\sqrt{2 π} \sqrt{1 - a_{t}}} e^{(- \frac{1}{2} \frac{(x_{t} - \sqrt{a_{t}} x_{t - 1})^{2}}{1 - a_{t}})}] * [\frac{1}{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t - 1}}} e^{(- \frac{1}{2} \frac{(x_{t - 1} - \sqrt{{\bar{a}}_{t - 1}} x_{0})^{2}}{1 - {\bar{a}}_{t - 1}})}] \div [\frac{1}{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t}}} e^{(- \frac{1}{2} \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})}]$

$⇓$

$\frac{\sqrt{2 π} \sqrt{1 - {\bar{a}}_{t}}}{\sqrt{2 π} \sqrt{1 - a_{t}} \sqrt{2 π} \sqrt{1 - {\bar{a}}_{t - 1}}} e^{[- \frac{1}{2} (\frac{(x_{t} - \sqrt{a_{t}} x_{t - 1})^{2}}{1 - a_{t}} + \frac{(x_{t - 1} - \sqrt{{\bar{a}}_{t - 1}} x_{0})^{2}}{1 - {\bar{a}}_{t - 1}} - \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})]}$

$⇓$

$\frac{1}{\sqrt{2 π} (\frac{\sqrt{1 - a_{t}} \sqrt{1 - {\bar{a}}_{t - 1}}}{\sqrt{1 - {\bar{a}}_{t}}})} \exp [- \frac{1}{2} (\frac{(x_{t} - \sqrt{a_{t}} x_{t - 1})^{2}}{1 - a_{t}} + \frac{(x_{t - 1} - \sqrt{{\bar{a}}_{t - 1}} x_{0})^{2}}{1 - {\bar{a}}_{t - 1}} - \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})]$

$⇓$

$\frac{1}{\sqrt{2 π} (\frac{\sqrt{1 - a_{t}} \sqrt{1 - {\bar{a}}_{t - 1}}}{\sqrt{1 - {\bar{a}}_{t}}})} \exp [- \frac{1}{2} (\frac{x_{t}^{2} - 2 \sqrt{a_{t}} x_{t} x_{t - 1} + a_{t} x_{t - 1}^{2}}{1 - a_{t}} + \frac{x_{t - 1}^{2} - 2 \sqrt{{\bar{a}}_{t - 1}} x_{0} x_{t - 1} + {\bar{a}}_{t - 1} x_{0}^{2}}{1 - {\bar{a}}_{t - 1}} - \frac{(x_{t} - \sqrt{{\bar{a}}_{t}} x_{0})^{2}}{1 - {\bar{a}}_{t}})]$

$⇓$

$\frac{1}{\sqrt{2 π} (\frac{\sqrt{1 - a_{t}} \sqrt{1 - {\bar{a}}_{t - 1}}}{\sqrt{1 - {\bar{a}}_{t}}})} \exp [- \frac{1}{2} \frac{{(x_{t - 1} - (\frac{\sqrt{a_{t}} (1 - {\bar{a}}_{t - 1})}{1 - {\bar{a}}_{t}} x_{t} + \frac{\sqrt{{\bar{a}}_{t - 1}} (1 - a_{t})}{1 - {\bar{a}}_{t}} x_{0}))}^{2}}{{(\frac{\sqrt{1 - a_{t}} \sqrt{1 - {\bar{a}}_{t - 1}}}{\sqrt{1 - {\bar{a}}_{t}}})}^{2}}]$

$⇓$

$p (x_{t - 1} | x_{t}) \sim N (\frac{\sqrt{a_{t}} (1 - {\bar{a}}_{t - 1})}{1 - {\bar{a}}_{t}} x_{t} + \frac{\sqrt{{\bar{a}}_{t - 1}} (1 - a_{t})}{1 - {\bar{a}}_{t}} x_{0}, {(\frac{\sqrt{1 - a_{t}} \sqrt{1 - {\bar{a}}_{t - 1}}}{\sqrt{1 - {\bar{a}}_{t}}})}^{2})$

Because $x_{t} = \sqrt{{\bar{a}}_{t}} \times x_{0} + \sqrt{1 - {\bar{a}}_{t}} \times ϵ$ , $x_{0} = \frac{x_{t} - \sqrt{1 - {\bar{a}}_{t}} \times ϵ}{\sqrt{{\bar{a}}_{t}}}$ . Substitute $x_{0}$ with this formula.

$p (x_{t - 1} | x_{t}) \sim N (\frac{\sqrt{a_{t}} (1 - {\bar{a}}_{t - 1})}{1 - {\bar{a}}_{t}} x_{t} + \frac{\sqrt{{\bar{a}}_{t - 1}} (1 - a_{t})}{1 - {\bar{a}}_{t}} \times \frac{x_{t} - \sqrt{1 - {\bar{a}}_{t}} \times ϵ}{\sqrt{{\bar{a}}_{t}}}, \frac{β_{t} (1 - {\bar{a}}_{t - 1})}{1 - {\bar{a}}_{t}})$

Note: This README.md is intended solely for previewing on the Github page. If you wish to view the rendered page locally, please consult README.raw.md.

Name	Name	Last commit message	Last commit date
Latest commit wangjia184 update tensorflow.js to latest version Dec 25, 2024 5ea111b · Dec 25, 2024 History 57 Commits
dataset/64x64/train/nolabel	dataset/64x64/train/nolabel	clean dataset	Jul 14, 2024
docs	docs	update tensorflow.js to latest version	Dec 25, 2024
web	web	update tensorflow.js to latest version	Dec 25, 2024
LICENSE	LICENSE	Initial commit	Feb 24, 2023
README.md	README.md	Added DDIM	Jul 15, 2024
README.raw.md	README.raw.md	add	Dec 24, 2023
chain.png	chain.png	Add files via upload	Mar 14, 2023
ddpm.py	ddpm.py	+ prev	Feb 26, 2023
ddpm_tf.py	ddpm_tf.py	initial commit	Feb 24, 2023
denoise.jpg	denoise.jpg	Add files via upload	Mar 27, 2023
export.sh	export.sh	+ prev	Feb 26, 2023
package-lock.json	package-lock.json	optimize the code to use webgpu backend	Jul 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion model in web browser

1. DDPM Introduction

2. Forward Process $q$

2.1 Relationship between $x_{t}$ and $x_{t - 2}$

2.2 Relationship between $x_{t}$ and $x_{t - 3}$

2.3 Relationship between $x_{t}$ and $x_{0}$

3.Reverse Process $p$

About

Releases

Packages

Contributors 2

Languages

License

wangjia184/diffusion_model

Folders and files

Latest commit

History

Repository files navigation

Diffusion model in web browser

1. DDPM Introduction

2. Forward Process q

2.1 Relationship between x t and x t − 2

2.2 Relationship between x t and x t − 3

2.3 Relationship between x t and x 0

3.Reverse Process p

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

2. Forward Process $q$

2.1 Relationship between $x_{t}$ and $x_{t - 2}$

2.2 Relationship between $x_{t}$ and $x_{t - 3}$

2.3 Relationship between $x_{t}$ and $x_{0}$

3.Reverse Process $p$

Packages