Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

中断训练后恢复训练时epoch计数不对 #644

Open
daijin81 opened this issue Mar 9, 2025 · 0 comments
Open

中断训练后恢复训练时epoch计数不对 #644

daijin81 opened this issue Mar 9, 2025 · 0 comments

Comments

@daijin81
Copy link

daijin81 commented Mar 9, 2025

第二次中断训练后从save state继续第三次训练的时候,epoch计数会无视train_state.json里的记录数据并以第二次开始训练的数量作为第三次训练的起始计数, 并非从第一次开始算起的全局计数,不知道计数不对会不会对学习率的调度产生影响。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant