Skip to content

Latest commit

 

History

History
233 lines (200 loc) · 4.61 KB

File metadata and controls

233 lines (200 loc) · 4.61 KB
title Examples
description Collection of examples for models, training, inference, and clusters
hide
toc

Training

    <p>
        Fine-tune Llama 3.1 8B with SFT and QLoRA, single-node or distributed across multiple nodes.
    </p>
</a>

<a href="/docs/examples/training/axolotl"
   class="feature-cell">
    <h3>
        Axolotl
    </h3>

    <p>
        Fine-tune Llama models with FSDP and QLoRA, single-node or distributed across multiple nodes.
    </p>
</a>

<a href="/docs/examples/training/ray-ragen"
   class="feature-cell">
    <h3>
        Ray+RAGEN
    </h3>

    <p>
        Fine-tune an agent on multiple nodes with RAGEN, verl, and Ray.
    </p>
</a>

<a href="/docs/examples/training/miles"
   class="feature-cell">
    <h3>
        Miles
    </h3>

    <p>
        RL-fine-tune Qwen2.5-32B with Miles.
    </p>
</a>

Clusters

    <p>
        Set up GCP A4 and A3 clusters with optimized networking
    </p>
</a>
<a href="/docs/examples/clusters/efa"
   class="feature-cell sky">
    <h3>
        AWS
    </h3>

    <p>
        Set up AWS EFA clusters with optimized networking
    </p>
</a>
<a href="/docs/examples/clusters/lambda"
   class="feature-cell sky">
    <h3>
        Lambda
    </h3>

    <p>
        Set up Lambda clusters with optimized networking
    </p>
</a>
<a href="/docs/examples/clusters/crusoe"
   class="feature-cell sky">
    <h3>
        Crusoe
    </h3>

    <p>
        Set up Crusoe clusters with optimized networking
    </p>
</a>
<a href="/docs/examples/clusters/nebius"
   class="feature-cell sky">
    <h3>
        Nebius
    </h3>

    <p>
        Set up Nebius clusters with optimized networking
    </p>
</a>
<a href="/docs/examples/clusters/nccl-rccl-tests"
   class="feature-cell sky">
    <h3>
        NCCL/RCCL tests
    </h3>

    <p>
        Run multi-node NCCL tests with MPI
    </p>
</a>

Inference

Deploy Qwen3.6-27B with SGLang

Deploy Qwen3.6-27B with vLLM

Deploy a DeepSeek distilled model with NIM

Deploy Qwen3 with TensorRT-LLM

Models

    <p>
        Deploy DeepSeek V4 with SGLang on B200:8
    </p>
</a>

<a href="/docs/examples/models/qwen36"
   class="feature-cell">
    <h3>
        Qwen 3.6
    </h3>

    <p>
        Deploy Qwen3.6-27B with SGLang on NVIDIA or AMD
    </p>
</a>

Accelerators

    <p>
        Deploy and fine-tune LLMs on AMD
    </p>
</a>

<a href="/docs/examples/accelerators/tpu"
   class="feature-cell sky">
    <h3>
        TPU
    </h3>

    <p>
        Deploy and fine-tune LLMs on TPU
    </p>
</a>

<a href="/docs/examples/accelerators/tenstorrent"
   class="feature-cell sky">
    <h3>
        Tenstorrent
    </h3>

    <p>
        Deploy and fine-tune LLMs on Tenstorrent
    </p>
</a>