[QA] InternLM2微调时数据的max_token #687

Closed Answered by ZwwWayne

MING-ZCH asked this question in Q&A

MING-ZCH
Jan 31, 2024

Describe the question.

我有长文本数据准备全参微调InternLM2-7b-chat，想询问支持的数据的max_length长度？
Thanks！

Answered by ZwwWayne

仅供参考：出于效率和效果的 trade-off，我们目前用32K长文本进行了训练，然后外推到200K。

View full answer

Replies: 1 comment 1 reply

ZwwWayne
Jan 31, 2024
Maintainer

仅供参考：出于效率和效果的 trade-off，我们目前用32K长文本进行了训练，然后外推到200K。

1 reply

MING-ZCH Feb 1, 2024
Author

感谢！

Answer selected by ZwwWayne

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment