dstackai
diff --git a/‎docs/blog/posts/intel-gaudi.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/blog/posts/intel-gaudi.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/blog/posts/tpu-on-gcp.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/blog/posts/tpu-on-gcp.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/docs/concepts/tasks.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/docs/concepts/tasks.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/examples.md‎
Lines changed: 34 additions & 58 deletions b/‎docs/examples.md‎
Lines changed: 34 additions & 58 deletions
diff --git a/‎…cs/examples/fine-tuning/axolotl/index.md‎ ‎…es/single-node-training/axolotl/index.md‎docs/examples/fine-tuning/axolotl/index.md renamed to docs/examples/single-node-training/axolotl/index.md b/‎…cs/examples/fine-tuning/axolotl/index.md‎ ‎…es/single-node-training/axolotl/index.md‎docs/examples/fine-tuning/axolotl/index.md renamed to docs/examples/single-node-training/axolotl/index.md
diff --git a/‎docs/examples/fine-tuning/trl/index.md‎ ‎…amples/single-node-training/trl/index.md‎docs/examples/fine-tuning/trl/index.md renamed to docs/examples/single-node-training/trl/index.md b/‎docs/examples/fine-tuning/trl/index.md‎ ‎…amples/single-node-training/trl/index.md‎docs/examples/fine-tuning/trl/index.md renamed to docs/examples/single-node-training/trl/index.md
diff --git a/‎docs/overrides/main.html‎
Lines changed: 1 addition & 2 deletions b/‎docs/overrides/main.html‎
Lines changed: 1 addition & 2 deletions
diff --git a/‎examples/accelerators/amd/README.md‎
Lines changed: 6 additions & 6 deletions b/‎examples/accelerators/amd/README.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎examples/accelerators/intel/README.md‎
Lines changed: 1 addition & 1 deletion b/‎examples/accelerators/intel/README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎examples/accelerators/tpu/README.md‎
Lines changed: 5 additions & 5 deletions b/‎examples/accelerators/tpu/README.md‎
Lines changed: 5 additions & 5 deletions
@@ -98,7 +98,7 @@ model using [Optimum for Intel Gaudi :material-arrow-top-right-thin:{ .external
 and [DeepSpeed :material-arrow-top-right-thin:{ .external }](https://docs.habana.ai/en/latest/PyTorch/DeepSpeed/DeepSpeed_User_Guide/DeepSpeed_User_Guide.html#deepspeed-user-guide){:target="_blank"} with 
 the [`lvwerra/stack-exchange-paired` :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/lvwerra/stack-exchange-paired){:target="_blank"} dataset:
 
-<div editor-title="examples/fine-tuning/trl/intel/.dstack.yml">
+<div editor-title="examples/single-node-training/trl/intel/.dstack.yml">
 
 ```yaml
 type: task
@@ -152,7 +152,7 @@ Submit the task using the [`dstack apply`](../../docs/reference/cli/dstack/apply
 <div class="termy">
 
 ```shell
-$ dstack apply -f examples/fine-tuning/trl/intel/.dstack.yml -R
+$ dstack apply -f examples/single-node-training/trl/intel/.dstack.yml -R
 ```
 
 </div>
 
@@ -158,7 +158,7 @@ Below is an example of fine-tuning Llama 3.1 8B using [Optimum TPU :material-arr
 and the [Abirate/english_quotes :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/Abirate/english_quotes){:target="_blank"}
 dataset.
 
-<div editor-title="examples/fine-tuning/optimum-tpu/llama31/train.dstack.yml"> 
+<div editor-title="examples/single-node-training/optimum-tpu/llama31/train.dstack.yml"> 
 
 ```yaml
 type: task
@@ -171,8 +171,8 @@ env:
 commands:
   - git clone -b add_llama_31_support https://github.com/dstackai/optimum-tpu.git
   - mkdir -p optimum-tpu/examples/custom/
-  - cp examples/fine-tuning/optimum-tpu/llama31/train.py optimum-tpu/examples/custom/train.py
-  - cp examples/fine-tuning/optimum-tpu/llama31/config.yaml optimum-tpu/examples/custom/config.yaml
+  - cp examples/single-node-training/optimum-tpu/llama31/train.py optimum-tpu/examples/custom/train.py
+  - cp examples/single-node-training/optimum-tpu/llama31/config.yaml optimum-tpu/examples/custom/config.yaml
   - cd optimum-tpu
   - pip install -e . -f https://storage.googleapis.com/libtpu-releases/index.html
   - pip install datasets evaluate
 
@@ -10,7 +10,7 @@ The filename must end with `.dstack.yml` (e.g. `.dstack.yml` or `dev.dstack.yml`
 
 [//]: # (TODO: Make tabs - single machine & distributed tasks & web app)
 
-<div editor-title="examples/fine-tuning/axolotl/train.dstack.yml"> 
+<div editor-title="examples/single-node-training/axolotl/train.dstack.yml"> 
 
 ```yaml
 type: task
@@ -26,7 +26,7 @@ env:
   - WANDB_API_KEY
 # Commands of the task
 commands:
-  - accelerate launch -m axolotl.cli.train examples/fine-tuning/axolotl/config.yaml
+  - accelerate launch -m axolotl.cli.train examples/single-node-training/axolotl/config.yaml
 
 resources:
   gpu:
@@ -461,4 +461,4 @@ it does not block other runs with lower priority from scheduling.
 !!! info "What's next?"
     1. Read about [dev environments](dev-environments.md), [services](services.md), and [repos](repos.md)
     2. Learn how to manage [fleets](fleets.md)
-    3. Check the [Axolotl](/examples/fine-tuning/axolotl) example
+    3. Check the [Axolotl](/examples/single-node-training/axolotl) example
@@ -12,10 +12,10 @@ hide:
 }
 </style>
 
-## Fine-tuning
+## Single-node training
 
 <div class="tx-landing__highlights_grid">
-    <a href="/examples/fine-tuning/axolotl"
+    <a href="/examples/single-node-training/axolotl"
        class="feature-cell">
         <h3>
             Axolotl
@@ -26,7 +26,7 @@ hide:
         </p>
     </a>
 
-    <a href="/examples/fine-tuning/trl"
+    <a href="/examples/single-node-training/trl"
        class="feature-cell">
         <h3>
             TRL
@@ -38,85 +38,86 @@ hide:
     </a>
 </div>
 
-## Clusters
+## Distributed training
 
 <div class="tx-landing__highlights_grid">
-    <a href="/examples/clusters/nccl-tests"
+    <a href="/examples/distributed-training/ray-ragen"
        class="feature-cell sky">
         <h3>
-            NCCL tests
+            Ray+RAGEN
         </h3>
 
         <p>
-            Run multi-node NCCL tests with MPI
+            Fine-tune an agent on multiple nodes
+            with RAGEN, verl, and Ray.
         </p>
     </a>
-    <a href="/examples/clusters/rccl-tests"
+    <a href="/examples/distributed-training/trl"
        class="feature-cell sky">
         <h3>
-            RCCL tests
+            TRL
         </h3>
 
         <p>
-            Run multi-node RCCL tests with MPI
+            Fine-tune LLM on multiple nodes
+            with TRL, Accelerate, and Deepspeed.
         </p>
     </a>
-    <a href="/examples/clusters/a3mega"
+    <a href="/examples/distributed-training/axolotl"
        class="feature-cell sky">
         <h3>
-            A3 Mega
+            Axolotl
         </h3>
 
         <p>
-            Set up GCP A3 Mega clusters with optimized networking
+            Fine-tune LLM on multiple nodes
+            with Axolotl.
         </p>
     </a>
-    <a href="/examples/clusters/a3high"
+</div>
+
+
+## Clusters
+
+<div class="tx-landing__highlights_grid">
+    <a href="/examples/clusters/nccl-tests"
        class="feature-cell sky">
         <h3>
-            A3 High
+            NCCL tests
         </h3>
 
         <p>
-            Set up GCP A3 High clusters with optimized networking
+            Run multi-node NCCL tests with MPI
         </p>
     </a>
-</div>
-
-## Distributed training
-
-<div class="tx-landing__highlights_grid">
-    <a href="/examples/distributed-training/ray-ragen"
+    <a href="/examples/clusters/rccl-tests"
        class="feature-cell sky">
         <h3>
-            Ray+RAGEN
+            RCCL tests
         </h3>
 
         <p>
-            Fine-tune an agent on multiple nodes
-            with RAGEN, verl, and Ray.
+            Run multi-node RCCL tests with MPI
         </p>
     </a>
-    <a href="/examples/distributed-training/trl"
+    <a href="/examples/clusters/a3mega"
        class="feature-cell sky">
         <h3>
-            TRL
+            A3 Mega
         </h3>
 
         <p>
-            Fine-tune LLM on multiple nodes
-            with TRL, Accelerate, and Deepspeed.
+            Set up GCP A3 Mega clusters with optimized networking
         </p>
     </a>
-    <a href="/examples/distributed-training/axolotl"
+    <a href="/examples/clusters/a3high"
        class="feature-cell sky">
         <h3>
-            Axolotl
+            A3 High
         </h3>
 
         <p>
-            Fine-tune LLM on multiple nodes
-            with Axolotl.
+            Set up GCP A3 High clusters with optimized networking
         </p>
     </a>
 </div>
@@ -219,31 +220,6 @@ hide:
     </a>
 </div>
 
-## LLMs
-
-<div class="tx-landing__highlights_grid">
-    <a href="/examples/llms/deepseek"
-       class="feature-cell sky">
-        <h3>
-            Deepseek
-        </h3>
-
-        <p>
-            Deploy and train Deepseek models
-        </p>
-    </a>
-    <a href="/examples/llms/llama"
-       class="feature-cell sky">
-        <h3>
-            Llama
-        </h3>
-
-        <p>
-            Deploy Llama 4 models
-        </p>
-    </a>
-</div>
-
 ## Misc
 
 <div class="tx-landing__highlights_grid">
 
@@ -117,12 +117,11 @@
 
                 <div class="tx-footer__section">
                     <div class="tx-footer__section-title">Examples</div>
-                    <a href="/examples#fine-tuning" class="tx-footer__section-link">Fine-tuning</a>
+                    <a href="/examples#fine-tuning" class="tx-footer__section-link">Single-node training</a>
                     <a href="/examples#clusters" class="tx-footer__section-link">Clusters</a>
                     <a href="/examples#distributed-training" class="tx-footer__section-link">Distributed training</a>
                     <a href="/examples#inference" class="tx-footer__section-link">Inference</a>
                     <a href="/examples#accelerators" class="tx-footer__section-link">Accelerators</a>
-                    <a href="/examples#llms" class="tx-footer__section-link">LLMs</a>
                     <!-- <a href="/examples#misc" class="tx-footer__section-link">Misc</a> -->
                 </div>
 
 
@@ -114,7 +114,7 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
     and the [`mlabonne/guanaco-llama2-1k` :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/mlabonne/guanaco-llama2-1k){:target="_blank"}
     dataset.
 
-    <div editor-title="examples/fine-tuning/trl/amd/.dstack.yml">
+    <div editor-title="examples/single-node-training/trl/amd/.dstack.yml">
 
     ```yaml
     type: task
@@ -140,7 +140,7 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
       - pip install peft
       - pip install transformers datasets huggingface-hub scipy
       - cd ..
-      - python examples/fine-tuning/trl/amd/train.py
+      - python examples/single-node-training/trl/amd/train.py
 
     # Uncomment to leverage spot instances
     #spot_policy: auto
@@ -157,7 +157,7 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
     and the [tatsu-lab/alpaca :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/tatsu-lab/alpaca){:target="_blank"}
     dataset.
 
-    <div editor-title="examples/fine-tuning/axolotl/amd/.dstack.yml">
+    <div editor-title="examples/single-node-training/axolotl/amd/.dstack.yml">
 
     ```yaml
     type: task
@@ -213,7 +213,7 @@ To request multiple GPUs, specify the quantity after the GPU name, separated by
 
     > To speed up installation of `flash-attention` and `xformers `, we use pre-built binaries uploaded to S3. 
     > You can find the tasks that build and upload the binaries
-    > in [`examples/fine-tuning/axolotl/amd/` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/amd/){:target="_blank"}.
+    > in [`examples/single-node-training/axolotl/amd/` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/single-node-training/axolotl/amd/){:target="_blank"}.
 
 ## Running a configuration
 
@@ -238,8 +238,8 @@ $ dstack apply -f examples/inference/vllm/amd/.dstack.yml
 The source-code of this example can be found in 
 [`examples/inference/tgi/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/inference/tgi/amd){:target="_blank"},
 [`examples/inference/vllm/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/inference/vllm/amd){:target="_blank"},
-[`examples/fine-tuning/axolotl/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/axolotl/amd){:target="_blank"} and
-[`examples/fine-tuning/trl/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/trl/amd){:target="_blank"}
+[`examples/single-node-training/axolotl/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/single-node-training/axolotl/amd){:target="_blank"} and
+[`examples/single-node-training/trl/amd` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/single-node-training/trl/amd){:target="_blank"}
 
 ## What's next?
 
 
@@ -102,7 +102,7 @@ using [Optimum for Intel Gaudi :material-arrow-top-right-thin:{ .external }](htt
 and [DeepSpeed :material-arrow-top-right-thin:{ .external }](https://docs.habana.ai/en/latest/PyTorch/DeepSpeed/DeepSpeed_User_Guide/DeepSpeed_User_Guide.html#deepspeed-user-guide){:target="_blank"} with 
 the [`lvwerra/stack-exchange-paired` :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/lvwerra/stack-exchange-paired){:target="_blank"} dataset. 
 
-<div editor-title="examples/fine-tuning/trl/intel/.dstack.yml">
+<div editor-title="examples/single-node-training/trl/intel/.dstack.yml">
 
 ```yaml
 type: task
 
@@ -127,7 +127,7 @@ Below is an example of fine-tuning Llama 3.1 8B using [Optimum TPU :material-arr
 and the [`Abirate/english_quotes` :material-arrow-top-right-thin:{ .external }](https://huggingface.co/datasets/Abirate/english_quotes){:target="_blank"}
 dataset.
 
-<div editor-title="examples/fine-tuning/optimum-tpu/llama31/.dstack.yml"> 
+<div editor-title="examples/single-node-training/optimum-tpu/llama31/.dstack.yml"> 
 
 ```yaml
 type: task
@@ -139,8 +139,8 @@ env:
 commands:
   - git clone -b add_llama_31_support https://github.com/dstackai/optimum-tpu.git
   - mkdir -p optimum-tpu/examples/custom/
-  - cp examples/fine-tuning/optimum-tpu/llama31/train.py optimum-tpu/examples/custom/train.py
-  - cp examples/fine-tuning/optimum-tpu/llama31/config.yaml optimum-tpu/examples/custom/config.yaml
+  - cp examples/single-node-training/optimum-tpu/llama31/train.py optimum-tpu/examples/custom/train.py
+  - cp examples/single-node-training/optimum-tpu/llama31/config.yaml optimum-tpu/examples/custom/config.yaml
   - cd optimum-tpu
   - pip install -e . -f https://storage.googleapis.com/libtpu-releases/index.html
   - pip install datasets evaluate
@@ -155,7 +155,7 @@ resources:
 </div>
 
 [//]: # (### Fine-Tuning with TRL)
-[//]: # (Use the example `examples/fine-tuning/optimum-tpu/gemma/train.dstack.yml` to Finetune `Gemma-2B` model using `trl` with `dstack` and `optimum-tpu`. )
+[//]: # (Use the example `examples/single-node-training/optimum-tpu/gemma/train.dstack.yml` to Finetune `Gemma-2B` model using `trl` with `dstack` and `optimum-tpu`. )
 
 ### Memory requirements
 
@@ -181,7 +181,7 @@ Note, `v5litepod` is optimized for fine-tuning transformer-based models. Each co
 The source-code of this example can be found in 
 [`examples/inference/tgi/tpu` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/inference/tgi/tpu){:target="_blank"},
 [`examples/inference/vllm/tpu` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/inference/vllm/tpu){:target="_blank"},
-and [`examples/fine-tuning/optimum-tpu` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/fine-tuning/trl){:target="_blank"}.
+and [`examples/single-node-training/optimum-tpu` :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/examples/single-node-training/trl){:target="_blank"}.
 
 ## What's next?