Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Settings #47

Open
Yeez-lee opened this issue Jul 20, 2024 · 3 comments
Open

Training Settings #47

Yeez-lee opened this issue Jul 20, 2024 · 3 comments

Comments

@Yeez-lee
Copy link

Hi, I am curious about your MDTv2_s2 settings. What is your lr_anneal_steps? Should I set it to 0? And how can I control the training steps? For example, how can I set the overall training steps to 400k? Thank you!

@gasvn
Copy link
Collaborator

gasvn commented Jul 22, 2024

We keep the same learning rate in our setting. And we just kill the running program when it reaches the required training step.

@Yeez-lee
Copy link
Author

Yeez-lee commented Jul 24, 2024

We keep the same learning rate in our setting. And we just kill the running program when it reaches the required training step.

Thank you! One more question is about cfg_scale. Do you always keep 3.8 for S, B, and XL models during inference?

@gasvn
Copy link
Collaborator

gasvn commented Jul 24, 2024

Yes, the value is obtained with MDTv1, and I keep using 3.8 for all models of MDTv2

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants