Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model stability problem #54

Open
Xinjie-Wei opened this issue Sep 28, 2022 · 3 comments
Open

Model stability problem #54

Xinjie-Wei opened this issue Sep 28, 2022 · 3 comments

Comments

@Xinjie-Wei
Copy link

Hi!
Thank you for the great work. I used the mobilevit blocks for my model to low level task. at begin it has good performance , but I get different performance when I run it once again. my model is stable if I remove the mobilevit blocks. Do you know what problem would make the model instability, I use following basic parameter:
max_lr:1e-4
min_lr:1e-6
optim
name: adamw
scheduler:
name: "cosine"
in_channels:96
transformer_dim : 144
ffn_dim = 288
n_transformer_blocks=2

@Xinjie-Wei
Copy link
Author

mobilevit blocks num_heads is 4 would be too small?

@sacmehta
Copy link
Collaborator

How many epochs are you training?

@Xinjie-Wei
Copy link
Author

epochs=3000

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants