-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model stability problem #54
Comments
mobilevit blocks num_heads is 4 would be too small? |
How many epochs are you training? |
epochs=3000 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi!
Thank you for the great work. I used the mobilevit blocks for my model to low level task. at begin it has good performance , but I get different performance when I run it once again. my model is stable if I remove the mobilevit blocks. Do you know what problem would make the model instability, I use following basic parameter:
max_lr:1e-4
min_lr:1e-6
optim
name: adamw
scheduler:
name: "cosine"
in_channels:96
transformer_dim : 144
ffn_dim = 288
n_transformer_blocks=2
The text was updated successfully, but these errors were encountered: