Does Ludwig Support PPO? #3949

braunagn · 2024-02-29T18:51:47Z

braunagn
Feb 29, 2024

Hi all,

I didn't see anything on the doc site so am asking here: does ludwig support PPO training? And what would an example .yaml config file look to do this? I assume the config would need to include parameters for a supervised fine-tuned model, a reward model, and any parameters for the PPO loss calculations and gradient updates for the SFT model.

Thanks!

Answered by arnavgarg1

Feb 29, 2024

Hi @braunagn! Unfortunately, Ludwig currently doesn't support PPO or DPO, but it is something we intend to add in the next few months.

Would you be interested in contributing support for either of them?

View full answer

arnavgarg1 · 2024-02-29T19:37:02Z

arnavgarg1
Feb 29, 2024
Collaborator

Hi @braunagn! Unfortunately, Ludwig currently doesn't support PPO or DPO, but it is something we intend to add in the next few months.

Would you be interested in contributing support for either of them?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does Ludwig Support PPO? #3949

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Does Ludwig Support PPO? #3949

braunagn Feb 29, 2024

Replies: 1 comment

arnavgarg1 Feb 29, 2024 Collaborator

braunagn
Feb 29, 2024

arnavgarg1
Feb 29, 2024
Collaborator