Skip to content

Conversation

aicodex
Copy link

@aicodex aicodex commented Jul 29, 2025

What does this PR do?

实现channel_loss #5137

使用方法:
训练的yaml加上:
channel_loss: true
report_to: tensorboard
logging_dir: tensorboard输出文件夹

再在dataset_info.json加上channel: “你要作为channel的列名”
例如:
"channel_loss_20250627_500": {
"file_name": "channel_loss_20250627/train_selected_data_500.json",
"formatting": "sharegpt",
"columns": {
"messages": "conversations",
"system": "system",
"channel": "type"
}
}

最后训练过程中:
tensorboard --logdir=tensorboard输出文件夹 --port=8888
访问localhost:8888就能看到了。

目前在asend_npu/gpu测试通过了。

Before submitting

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant