Skip to content

Feature: add LoRA fine-tuning support for pi0 policies #944

@wadeKeith

Description

@wadeKeith

Motivation

Full fine-tuning of pi0 models is expensive. LoRA would enable efficient task-specific adaptation with minimal VRAM.

Proposal

Add a LoRAPolicy wrapper that applies LoRA adapters to attention layers while keeping the action head trainable.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions