AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 results in the LLaVA-NEXT model achieving a loss of 0 #21

1359347500cwc · 2024-12-02T07:21:17Z

When using AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 leads to the LLaVA-NEXT model reaching a loss of 0 after 3000-4000 steps. The original paper kept the encoder frozen. Why is it not recommended to unfreeze it for training? If I decide to unfreeze it, what learning rate should I set?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 results in the LLaVA-NEXT model achieving a loss of 0 #21

AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 results in the LLaVA-NEXT model achieving a loss of 0 #21

1359347500cwc commented Dec 2, 2024

AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 results in the LLaVA-NEXT model achieving a loss of 0 #21

AIMV2 as the encoder, unfreezing it and setting the learning rate to 2e-6 results in the LLaVA-NEXT model achieving a loss of 0 #21

Comments

1359347500cwc commented Dec 2, 2024