Add UNDIAL, a new unlearning algorithm#89
Conversation
* testing commit * Fixes * cleanup
Fix tofu_unlearn.sh for IdKDPO method.
Revert "Dpo fix"
* IdkDPO script fix in tofu_unlearn.sh (locuslab#65) * Fix hyperlinks in README * Download I don't know data in setup_data.py * Fix tofu_unlearn.sh for IdkDPO --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com> * overwrite=True * RMU added * Fix ref model device * ruff fix * RMU updated * Update rmu.py * Update README.md: add RMU * Added references and renamed functions --------- Co-authored-by: Anmol Mekala <49127549+molereddy@users.noreply.github.com>
…on (locuslab#8) * docs: updates, small corrections, re-formats * modified ruff commands * modified ruff commands * CI/CD minor updates * added contributing + leaderboard * fix minor spelling misatkes * docs: bunch of minor updates * docs fixes --------- Co-authored-by: molereddy <m.anmolreddy@gmail.com>
|
Thank you for this! Can you make these additions
Summary of steps to contribute
We've provided a template here. |
|
@molereddy Thank you for the suggestions. I fixed the part I could understand. And send a private message about things I don't understand. I hope at least what I did in this PR is able to be merged. Let me know if there's any fatal issues. Thank you! |
|
I've merged our upstream changes onto your branch. If you can pull, test the code and add some results, that would be great. Else mark PR for review: we can then just test on our end and merge the code in in the coming days. |
|
Sorry for the delay. I had a bit trouble running my method on 7B which requires LoRA implementation as my method need to run TWO 7B model in parallel. It would be great if LoRA could be supported. I would really appreciate if you could test my methods like all other using the same standard and follow the hyperparam search in my description in the PR (which should not be hard given you have the pipeline ready). I also turned the PR for review. |
Dornavineeth
left a comment
There was a problem hiding this comment.
Great Work.
Currently it is tested only on TOFU 1B model.
What does this PR do?
Before submitting