You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/docs/concepts/dev-environments.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -133,7 +133,7 @@ The `gpu` property lets you specify vendor, model, memory, and count — e.g., `
133
133
134
134
If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
135
135
136
-
??? info "Google Cloud TPU"
136
+
<!-- ??? info "Google Cloud TPU"
137
137
To use TPUs, specify its architecture via the `gpu` property.
138
138
139
139
```yaml
@@ -146,7 +146,7 @@ If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
146
146
gpu: v2-8
147
147
```
148
148
149
-
Currently, only 8 TPU cores can be specified, supporting single TPU device workloads. Multi-TPU support is coming soon.
149
+
Currently, only 8 TPU cores can be specified, supporting single TPU device workloads. Multi-TPU support is coming soon. -->
150
150
151
151
??? info "Shared memory"
152
152
If you are using parallel communicating processes (e.g., dataloaders in PyTorch), you may need to configure
@@ -159,8 +159,8 @@ If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
159
159
160
160
#### Default image
161
161
162
-
If you don't specify `image`, `dstack` uses its base Docker image pre-configured with
163
-
`uv`, `python`, `pip`, essential CUDA drivers, and NCCL tests (under `/opt/nccl-tests/build`).
162
+
If you don't specify `image`, `dstack` uses its [base :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/tree/master/docker/base){:target="_blank"} Docker image pre-configured with
163
+
`uv`, `python`, `pip`, essential CUDA drivers, `mpirun`, and NCCL tests (under `/opt/nccl-tests/build`).
164
164
165
165
Set the `python` property to pre-install a specific version of Python.
Copy file name to clipboardExpand all lines: docs/docs/concepts/services.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -433,8 +433,8 @@ If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
433
433
434
434
#### Default image
435
435
436
-
If you don't specify `image`, `dstack` uses its base Docker image pre-configured with
437
-
`uv`, `python`, `pip`, essential CUDA drivers, and NCCL tests (under `/opt/nccl-tests/build`).
436
+
If you don't specify `image`, `dstack` uses its [base :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/tree/master/docker/base){:target="_blank"} Docker image pre-configured with
437
+
`uv`, `python`, `pip`, essential CUDA drivers, `mpirun`, and NCCL tests (under `/opt/nccl-tests/build`).
438
438
439
439
Set the `python` property to pre-install a specific version of Python.
Copy file name to clipboardExpand all lines: docs/docs/concepts/tasks.md
+2-4Lines changed: 2 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -229,8 +229,6 @@ If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
229
229
<!-- ??? info "Google Cloud TPU"
230
230
To use TPUs, specify its architecture via the `gpu` property.
231
231
232
-
<!-- TODO: Add a TRL TPU example -->
233
-
234
232
```yaml
235
233
type: task
236
234
name: train
@@ -259,8 +257,8 @@ If vendor is omitted, `dstack` infers it from the model or defaults to `nvidia`.
259
257
260
258
#### Default image
261
259
262
-
If you don't specify `image`, `dstack` uses its base Docker image pre-configured with
263
-
`uv`, `python`, `pip`, essential CUDA drivers, and NCCL tests (under `/opt/nccl-tests/build`).
260
+
If you don't specify `image`, `dstack` uses its [base :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/tree/master/docker/base){:target="_blank"} Docker image pre-configured with
261
+
`uv`, `python`, `pip`, essential CUDA drivers, `mpirun`, and NCCL tests (under `/opt/nccl-tests/build`).
264
262
265
263
Set the `python` property to pre-install a specific version of Python.
Copy file name to clipboardExpand all lines: examples/clusters/nccl-tests/README.md
+8-7Lines changed: 8 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -33,21 +33,22 @@ commands:
33
33
sleep infinity
34
34
fi
35
35
36
+
# Uncomment if the `kubernetes` backend requires it for `/dev/infiniband` access
37
+
#privileged: true
38
+
36
39
resources:
37
40
gpu: nvidia:1..8
38
41
shm_size: 16GB
39
42
```
40
43
41
44
</div>
42
45
43
-
<!-- TODO: Need to stop using our EFA image - either make our default image cluster-friendly, or recommend using NGC or other images -->
46
+
!!! info "Default image"
47
+
If you don't specify `image`, `dstack` uses its [base :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/tree/master/docker/base){:target="_blank"} Docker image pre-configured with
48
+
`uv`, `python`, `pip`, essential CUDA drivers, `mpirun`, and NCCL tests (under `/opt/nccl-tests/build`).
44
49
45
-
!!! info "Docker image"
46
-
The `dstackai/efa` image used in the example comes with MPI and NCCL tests pre-installed. While it is optimized for
47
-
[AWS EFA :material-arrow-top-right-thin:{ .external }](https://aws.amazon.com/hpc/efa/){:target="_blank"}, it can also
48
-
be used with regular TCP/IP network adapters and InfiniBand.
49
-
50
-
See the [source code :material-arrow-top-right-thin:{ .external }](https://github.com/dstackai/dstack/blob/master/docker/efa) for the image.
50
+
!!! info "Privileged"
51
+
In some cases, the backend (e.g., `kubernetes`) may require `privileged: true` to access the high-speed interconnect (e.g., InfiniBand).
0 commit comments