Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RM和PPO支持 #2879

Open
JoeYing1019 opened this issue Jan 7, 2025 · 1 comment
Open

RM和PPO支持 #2879

JoeYing1019 opened this issue Jan 7, 2025 · 1 comment
Labels
enhancement New feature or request

Comments

@JoeYing1019
Copy link

请问RM和PPO现在完全支持了吗,RM训练目前调用的是PeftModelForSequenceClassification,而不是AutoModelForCausalLMWithValueHead,希望能尽快帮忙支持一下(尤其是VLM的RM和PPO训练)

@Jintao-Huang Jintao-Huang added the enhancement New feature or request label Jan 7, 2025
@Jintao-Huang
Copy link
Collaborator

等一下SequenceClassificationWrapper

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants