Skip to content

QLoRA and DDP #4

@mrT23

Description

@mrT23

Thanks for the great repo

i have two questions about training the models (specifically WizardCoder):

  1. have you tried training with QLoRa, and not just LoRa ? are you considering adding it to the repo ?

  2. the example usage (https://github.com/shibing624/CodeAssist#train-wizardcoder-model) is without ddp, only dp.
    are you sure this is the optimal setting? We got significantly higher training rates with ddp

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requestedwontfixThis will not be worked on

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions