QLoRA and DDP

Thanks for the great repo

i have two questions about training the models (specifically WizardCoder):

1. have you tried training with QLoRa, and not just LoRa ? are you considering adding it to the repo ?

2. the example usage (https://github.com/shibing624/CodeAssist#train-wizardcoder-model) is without ddp, only dp.
are you sure this is the optimal setting? We got significantly higher training rates with ddp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QLoRA and DDP #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

QLoRA and DDP #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions