You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
众所周知AutoDL平台的系统盘不可以选择扩容,然初次train需要transformers库预下载书生LLM至
.cache
缓存文件下,然.cache文件目录在系统盘上,所以LLM文件过大导致系统盘不足,从而导致模型下载中断报错。解决方法如下:删除缓存文件
执行命令:
rm -rf ~/.cache
将数据盘文件夹路径与系统盘上.cache文件建立软连接(这样就可以将数据盘路径
/root/autodl-tmp
下的内容引入到~/.cache
中,从而更好地管理缓存文件,模型文件则会下载至/root/autodl-tmp
路径下)执行命令:
ln -s /root/autodl-tmp ~/.cache
验证软连接是否成功(如未出现以下内容则重新操作步骤1、2)
依次执行命令:
cd ~
和ll
有如下显示则说明软连接建立成功.cache -> /root/autodl-tmp/
以上操作结束后就可以开始train
xtuner train internlm_20b_qlora_oasst1_512_e3
Beta Was this translation helpful? Give feedback.
All reactions