Skip to content

Commit 48663a4

Browse files
committed
docs: apply tech pubs launcher copyedits
Signed-off-by: Zeel <desaizeel2128@gmail.com>
1 parent 37c2e07 commit 48663a4

File tree

2 files changed

+11
-11
lines changed

2 files changed

+11
-11
lines changed

docs/launcher/overview.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,20 +8,20 @@ NeMo AutoModel provides several ways to launch training. The right choice depend
88
|---|---|---|---|
99
| **Local Workstation** | Getting started, debugging, single-node training | 1-8 on one machine | [Local Workstation](./local-workstation.md) |
1010
| **NeMo-Run** | Managed execution on Slurm, Kubernetes, Docker, local | 1+ | [NeMo-Run](./nemo-run.md) |
11-
| **SkyPilot** | Cloud training (AWS, GCP, Azure) with spot pricing | Any | [SkyPilot](./skypilot.md) |
11+
| **SkyPilot** | Cloud training or Kubernetes clusters | Any | [SkyPilot](./skypilot.md) |
1212
| **Slurm** | Multi-node batch jobs on HPC clusters | 8+ across nodes | [Slurm](./slurm.md) |
1313

1414
### I Have 1–2 GPUs on My Workstation
1515

16-
Use the **interactive** launcher. No scheduler or cluster software needed:
16+
Use the **interactive** launcher. No scheduler or cluster software is needed:
1717

1818
```bash
1919
automodel examples/llm_finetune/llama3_2/llama3_2_1b_squad.yaml
2020
```
2121

2222
See the [Local Workstation](./local-workstation.md) guide.
2323

24-
### I have access to a Slurm cluster
24+
### I Have Access to a Slurm Cluster
2525

2626
Add a `slurm:` section to your YAML config and submit with the same `automodel` command. The CLI generates the `torchrun` invocation and calls `sbatch` for you:
2727

@@ -31,7 +31,7 @@ automodel config_with_slurm.yaml
3131

3232
See the [Slurm](./slurm.md) guide.
3333

34-
### I want managed job submission (Slurm, Kubernetes, Docker)
34+
### I Want Managed Job Submission (Slurm, Kubernetes, Docker)
3535

3636
Add a `nemo_run:` section to your YAML config. NeMo-Run loads a pre-configured executor for your compute target and submits the job:
3737

@@ -41,7 +41,7 @@ automodel config_with_nemo_run.yaml
4141

4242
See the [NeMo-Run](./nemo-run.md) guide.
4343

44-
### I want to train on the cloud
44+
### I Want to Train on the Cloud
4545

4646
Add a `skypilot:` section to your YAML config. SkyPilot provisions VMs on any major cloud and handles spot-instance preemption automatically:
4747

@@ -51,7 +51,7 @@ automodel config_with_skypilot.yaml
5151

5252
See the [SkyPilot](./skypilot.md) guide.
5353

54-
### I want to train on Kubernetes with SkyPilot
54+
### I Want to Train on Kubernetes with SkyPilot
5555

5656
Use the same `skypilot:` launcher, but set `cloud: kubernetes`. This is a good fit when your team already has a GPU-backed Kubernetes cluster and you want SkyPilot to handle job submission and multi-node orchestration:
5757

docs/launcher/skypilot.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -106,7 +106,7 @@ model:
106106

107107
## Cloud Examples
108108

109-
### AWS - On-demand A10G
109+
### AWS On-Demand A10G
110110

111111
```yaml
112112
skypilot:
@@ -118,7 +118,7 @@ skypilot:
118118
hf_token: ${HF_TOKEN}
119119
```
120120
121-
### GCP - Spot V100, 8 GPUs (single node)
121+
### GCP Spot V100, 8 GPUs (Single Node)
122122
123123
```yaml
124124
skypilot:
@@ -130,7 +130,7 @@ skypilot:
130130
hf_token: ${HF_TOKEN}
131131
```
132132
133-
### Multi-node distributed training (2 x 8 x A100)
133+
### Multi-Node Distributed Training (2 x 8 x A100)
134134
135135
```yaml
136136
skypilot:
@@ -142,7 +142,7 @@ skypilot:
142142
hf_token: ${HF_TOKEN}
143143
```
144144
145-
For multi-node jobs the launcher automatically adds the SkyPilot rendezvous environment variables (`$SKYPILOT_NODE_RANK`, `$SKYPILOT_NUM_NODES`, `$SKYPILOT_NODE_IPS`) to the `torchrun` command.
145+
For multi-node jobs, the launcher automatically adds the SkyPilot rendezvous environment variables (`$SKYPILOT_NODE_RANK`, `$SKYPILOT_NUM_NODES`, `$SKYPILOT_NODE_IPS`) to the `torchrun` command.
146146

147147
## Monitor and Manage Jobs
148148

@@ -176,7 +176,7 @@ automodel config_with_skypilot.yaml \
176176
--model.pretrained_model_name_or_path meta-llama/Llama-3.2-3B
177177
```
178178

179-
## Kubernetes users
179+
## Kubernetes Users
180180

181181
If you want to run on a Kubernetes cluster, use `cloud: kubernetes` and follow the dedicated [SkyPilot + Kubernetes tutorial](./skypilot-kubernetes.md). That guide includes:
182182

0 commit comments

Comments
 (0)