[go: nahoru, domu]

Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: 训练中途报错中断后,重新训练如何在已训练的checkpoint上继续训练而不是重新开始训练 #7952

Open
Matter-Charles opened this issue Feb 2, 2024 · 3 comments
Assignees
Labels
question Further information is requested

Comments

@Matter-Charles
Copy link

请提出你的问题

训练中途报错中断后,重新跑finetune.py代码发现模型重新训练,从checkpiont-100开始,是否有参数可以选择从之前训练过的某个checkpoint开始继续训练?

@Matter-Charles Matter-Charles added the question Further information is requested label Feb 2, 2024
@gongel
Copy link
Member
gongel commented Feb 6, 2024

重跑就可以了哈,自动继续训练。

Copy link

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。

@github-actions github-actions bot added the stale label Apr 27, 2024
@github-actions github-actions bot removed the stale label May 8, 2024
@w5688414
Copy link
Contributor

可以类似这样,指定自己的checkpoint。

checkpoint = training_args.resume_from_checkpoint

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants