Codebook embedding does not update #14

zhxgj · 2020-04-24T22:38:51Z

I found ctx.needs_input_grad[1] is False during training VQ-VAE. Is this correct, and does it mean the embedding of the codebook does not update during training?

pytorch-vqvae/functions.py

Line 53 in 8d123c0

if ctx.needs_input_grad[1]:

The text was updated successfully, but these errors were encountered:

zhangbo2008 · 2021-09-10T23:48:07Z

i agree with you upfloor. it is so weird.

chenaoxuan · 2022-12-26T09:54:11Z

I found ctx.needs_input_grad[1] is False during training VQ-VAE. Is this correct, and does it mean the embedding of the codebook does not update during training?

pytorch-vqvae/functions.py

Line 53 in 8d123c0

if ctx.needs_input_grad[1]:

This part of code has not been executed! But I printed the "model.codebook.embedding.weight.data" and found that this part will be updated!

Roller44 · 2023-07-12T07:43:37Z

Actually, ctx.needs_input_grad[0] and ctx.needs_input_grad[1] are set to true and false alternatively.
For the 1st step, ctx.needs_input_grad[0] is true and ctx.needs_input_grad[1] is false.
For the 2nd step, ctx.needs_input_grad[0] becomes false and ctx.needs_input_grad[1] becomes true.
For the 3rd step, ctx.needs_input_grad[0] is true and ctx.needs_input_grad[1] is false.
So on and so forth...

This setting is reasonable because there are two "agents", namely codebook and autoencoder, updating w.r.t. to different parts of the loss function.

RipeMangoBox · 2024-05-10T11:36:01Z

Actually, ctx.needs_input_grad[0] and ctx.needs_input_grad[1] are set to true and false alternatively. For the 1st step, ctx.needs_input_grad[0] is true and ctx.needs_input_grad[1] is false. For the 2nd step, ctx.needs_input_grad[0] becomes false and ctx.needs_input_grad[1] becomes true. For the 3rd step, ctx.needs_input_grad[0] is true and ctx.needs_input_grad[1] is false. So on and so forth...

This setting is reasonable because there are two "agents", namely codebook and autoencoder, updating w.r.t. to different parts of the loss function.

I debug the code and find that ctx.needs_input_grad[1] is always false rather than being set to true and false alternatively.
A basic fact is that if a variable $A$ doesn't require gradient, it doesn't mean that it will not be updated during optimization. The attirbute requires_grad describes whether its gradient should be calculated. In other word, whether other variables calculated by $A$ should be updated rather than updating $A$ itself!

Therefore, though ctx.needs_input_grad[1].requires_grad is always False, the codebook can still be updated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codebook embedding does not update #14

Codebook embedding does not update #14

zhxgj commented Apr 24, 2020

zhangbo2008 commented Sep 10, 2021

chenaoxuan commented Dec 26, 2022

Roller44 commented Jul 12, 2023 •

edited

Loading

RipeMangoBox commented May 10, 2024 •

edited

Loading

Codebook embedding does not update #14

Codebook embedding does not update #14

Comments

zhxgj commented Apr 24, 2020

zhangbo2008 commented Sep 10, 2021

chenaoxuan commented Dec 26, 2022

Roller44 commented Jul 12, 2023 • edited Loading

RipeMangoBox commented May 10, 2024 • edited Loading

Roller44 commented Jul 12, 2023 •

edited

Loading

RipeMangoBox commented May 10, 2024 •

edited

Loading