-
Notifications
You must be signed in to change notification settings - Fork 9
QLoRA and DDP #4
Copy link
Copy link
Open
Labels
questionFurther information is requestedFurther information is requestedwontfixThis will not be worked onThis will not be worked on
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requestedwontfixThis will not be worked onThis will not be worked on
Thanks for the great repo
i have two questions about training the models (specifically WizardCoder):
have you tried training with QLoRa, and not just LoRa ? are you considering adding it to the repo ?
the example usage (https://github.com/shibing624/CodeAssist#train-wizardcoder-model) is without ddp, only dp.
are you sure this is the optimal setting? We got significantly higher training rates with ddp