Multi loss base, PDU, LLM as a Judge, Llama 2 13B, MUSE finetuning #121

tahaEntesari · 2025-06-06T16:10:05Z

What does this PR do?

Addresses #118
This PR adds several new features:

We add a new multi_loss base trainer class that all multi-loss trainers extend. To simplify the linear scalarization of multiple losses, we aggregate the preferences into a single input trainer.method_args.preferences that takes a list as input, instead of having the different gamma and alpha inputs.
We add our new PDU trainer. We provide unlearned models using this method on huggingface.co/tamarsonha. For the details of the unlearning see the paper.
We add a new evaluation script to use LLMs as a judge for evaluation.
We add a new Llama 2 13B model and provide pretrained checkpoints for the MUSE dataset on huggingface.co/tamarsonha. The models are pretrained for 10 epochs.
We add new fine-tuning configurations for the MUSE dataset. To streamline the fine-tuning we provide the merged dataset needed for fine-tuning MUSE on huggingface.co/tamarsonha
To address the changes that was made by the base multi_loss trainer, we performed searches based on the gamma and alpha key words and performed necessary changes in the documentation of the repo
We add documentation on PDU in the community folder.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Have you gone through the contributions guide?
Are your changes documented? Read documentation guidelines here.

Added llm_judge and all the required files and configs required for it Added multi_loss base trainer and made corresponding changes in other trainers. Changed linear scalarization parameter input format to a list, allowing more losses to be added in a simple fashion. Made appropriate changes to all trainers and also documentation to reflect this. Added finetuning for MUSE with the corresponding dataset Added Llama 2 13B model. We provide checkpoints for this model on huggingface.co/tamarsonha

armanhtm · 2025-06-20T16:32:40Z

Hi dear Dorna and OU team,

I hope you're doing well. Thank you so much for your excellent repository; it has been incredibly helpful for us (the PDU team) in implementing and evaluating our method. I truly appreciate your time and effort.

I wanted to kindly ask if you could review our code and pull request, provide any feedback, and possibly merge it into your repository. We believe that PDU offers a new and interesting perspective on this problem, and we're excited to see more people engage with and use our method to make a meaningful impact.

Thank you once again for your amazing repository!

molereddy · 2025-06-21T07:10:27Z

Hi, we've seen the PR, but didn't get a chance to read it carefully. Both of us maintainers just graduated, so it's been busy for us. Things might be slower as we find and hand over to newer maintainers, but we will try to get to reviewing PDU and get it in. If you have any cleanup or code structure improvements you can make, please do those in the meantime, as that would make it easier for us. Thank you very much for your contribution!

molereddy · 2025-06-21T07:14:37Z

There are 40 files changes, and in particular I see that there are changes to many files unrelated to your PR as well, as you seem to have done some standardization. Have you checked all the changed points and tested enough of them to know this won't break things? We don't have unit tests in the code and can't run things ourselves right now so we are relying on contributors verifying all their changes.

community/methods/template/run.sh

molereddy · 2025-06-21T07:25:42Z

configs/eval/llm_judge.yaml

(unless i'm missing something) this is a metric and is implemented in evals, which must only contain evaluators. look at this example from the same folder: configs/eval/muse.yaml shows an example of what an evaluator is -- it's an evaluation suite having several individual metrics. the yaml config for a llm judge handler must be in the metrics folder of the eval suite it belongs to.

The LLM judge is, in principle, an evaluator, not just a metric. The LLM judge performs its evaluation based on some metrics like accuracy, forgetting success, etc. As such, I do not think it should be a metric.
In the current implementation, the metrics are fixed. However, in a future release, it could be made more modular and contain some other metrics. I unfortunately don't have the time right now to make changes regarding this.

configs/experiment/eval/tofu/llm_judge.yaml

configs/experiment/eval/muse/llm_judge.yaml

molereddy · 2025-06-21T07:29:55Z

configs/trainer/GradDiff.yaml

depends on resolution of other comments

molereddy · 2025-06-21T07:34:47Z

configs/trainer/MultiLoss.yaml

it seems you are contributing multiloss as like a new base trainer for most unlearning methods. that's not ideal because there are other open PRs and other users who are using the old structure.

it seems the multiloss mentions primal dual too many times to be a base trainer for other methods, esp ones as simple as graddiff.

if you make this structural change, it would be better if the implementation were much cleaner: with a general base trainer and a more specific primal-dual class that modifies it further and also ensure consistency with other PRs

or just simply leave other methods as they are and add PDU separately.

molereddy · 2025-06-21T07:39:06Z

configs/trainer/finetune.yaml

dont think we need to add new arguments in default trainingargs. you can add new arguments using + and don't need the attribute to have already been in the config. https://github.com/locuslab/open-unlearning/blob/main/docs/hydra.md has this documented.

I am aware of the + new arguments. I just felt like this is something that is important and good to have by default in the default parameters. I'll leave the final decision regarding this to you.

molereddy · 2025-06-21T07:43:28Z

docs/components.md

depends on resolution of other comments

molereddy · 2025-06-21T07:44:47Z

docs/evaluation.md

seems to me this should be a metric. @Dornavineeth thoughts?

molereddy · 2025-06-21T07:45:12Z

docs/repro.md

depends on resolution of other comments

molereddy · 2025-06-21T07:56:23Z

src/evals/metrics/default_prompt_generator.py

if this is a defaultprompt_generator file with a generic create_prompt, it shouldn't be this specific. also prompts might be better given through the config files than written here

molereddy · 2025-06-21T07:56:55Z

src/trainer/unlearn/multi_loss_base.py

refer prior comment on multiloss yaml

molereddy · 2025-06-22T18:26:56Z

You may want to link your paper here: https://github.com/locuslab/open-unlearning/blob/main/docs/links.md

molereddy · 2025-06-22T18:35:05Z

Thank you for citing our work in your paper. We've just updated the README recently with a modified citation to our paper on OpenUnlearning. It would be great if you can change the citation to point to that instead!

tahaEntesari · 2025-07-04T21:28:26Z

I will revert my changes regarding the multi_loss file.
I had been very busy and hadn't been able to check this thread. I'll try to make the required changes soon.

Added llm_judge and all the required files and configs required for it Added multi_loss base trainer and made corresponding changes in other trainers. Changed linear scalarization parameter input format to a list, allowing more losses to be added in a simple fashion. Made appropriate changes to all trainers and also documentation to reflect this. Added finetuning for MUSE with the corresponding dataset Added Llama 2 13B model. We provide checkpoints for this model on huggingface.co/tamarsonha

# Conflicts: # community/methods/PDU/README.md # community/methods/PDU/run.sh # configs/trainer/PDU.yaml # src/trainer/unlearn/pdu.py

tahaEntesari · 2025-07-05T02:14:28Z

I made the changes to revert all changes that were made to standardize the base loss function.
Please take a look at the new changes and let me know if there are other concerns.

molereddy · 2025-07-07T06:18:23Z

Can decide how to go about things on the basis of @Dornavineeth's review.

tahaEntesari and others added 2 commits May 30, 2025 16:35

edited READMEs

d32bc85

tahaEntesari temporarily deployed to tests June 6, 2025 16:10 — with GitHub Actions Inactive

Dornavineeth requested review from molereddy and Dornavineeth and removed request for molereddy June 9, 2025 05:10