|
945 | 945 | </span> |
946 | 946 | </a> |
947 | 947 |
|
| 948 | +</li> |
| 949 | + |
| 950 | + <li class="md-nav__item"> |
| 951 | + <a href="#cloud-blocks" class="md-nav__link"> |
| 952 | + <span class="md-ellipsis"> |
| 953 | + |
| 954 | + <span class="md-typeset"> |
| 955 | + Blocks |
| 956 | + </span> |
| 957 | + |
| 958 | + </span> |
| 959 | + </a> |
| 960 | + |
948 | 961 | </li> |
949 | 962 |
|
950 | 963 | <li class="md-nav__item"> |
|
1067 | 1080 | </ul> |
1068 | 1081 | </nav> |
1069 | 1082 |
|
| 1083 | +</li> |
| 1084 | + |
| 1085 | + <li class="md-nav__item"> |
| 1086 | + <a href="#ssh-blocks" class="md-nav__link"> |
| 1087 | + <span class="md-ellipsis"> |
| 1088 | + |
| 1089 | + <span class="md-typeset"> |
| 1090 | + Blocks |
| 1091 | + </span> |
| 1092 | + |
| 1093 | + </span> |
| 1094 | + </a> |
| 1095 | + |
1070 | 1096 | </li> |
1071 | 1097 |
|
1072 | 1098 | <li class="md-nav__item"> |
|
3710 | 3736 | </span> |
3711 | 3737 | </a> |
3712 | 3738 |
|
| 3739 | +</li> |
| 3740 | + |
| 3741 | + <li class="md-nav__item"> |
| 3742 | + <a href="#cloud-blocks" class="md-nav__link"> |
| 3743 | + <span class="md-ellipsis"> |
| 3744 | + |
| 3745 | + <span class="md-typeset"> |
| 3746 | + Blocks |
| 3747 | + </span> |
| 3748 | + |
| 3749 | + </span> |
| 3750 | + </a> |
| 3751 | + |
3713 | 3752 | </li> |
3714 | 3753 |
|
3715 | 3754 | <li class="md-nav__item"> |
|
3832 | 3871 | </ul> |
3833 | 3872 | </nav> |
3834 | 3873 |
|
| 3874 | +</li> |
| 3875 | + |
| 3876 | + <li class="md-nav__item"> |
| 3877 | + <a href="#ssh-blocks" class="md-nav__link"> |
| 3878 | + <span class="md-ellipsis"> |
| 3879 | + |
| 3880 | + <span class="md-typeset"> |
| 3881 | + Blocks |
| 3882 | + </span> |
| 3883 | + |
| 3884 | + </span> |
| 3885 | + </a> |
| 3886 | + |
3835 | 3887 | </li> |
3836 | 3888 |
|
3837 | 3889 | <li class="md-nav__item"> |
@@ -4058,6 +4110,24 @@ <h4 id="resources">Resources<a class="headerlink" href="#resources" title="Perma |
4058 | 4110 | </code></pre></div> |
4059 | 4111 | <p>Currently, only 8 TPU cores can be specified, supporting single TPU device workloads. Multi-TPU support is coming soon.</p> |
4060 | 4112 | </details> |
| 4113 | +<h4 id="cloud-blocks">Blocks<a class="headerlink" href="#cloud-blocks" title="Permanent link">¶</a></h4> |
| 4114 | +<p>For cloud fleets, <code>blocks</code> function the same way as in SSH fleets. |
| 4115 | +See the <a href="#ssh-blocks"><code>Blocks</code></a> section under SSH fleets for details on the blocks concept.</p> |
| 4116 | +<div editor-title=".dstack.yml"> |
| 4117 | + |
| 4118 | +<div class="highlight"><pre><span></span><code><span class="nt">type</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">fleet</span> |
| 4119 | + |
| 4120 | +<span class="nt">name</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">my-fleet</span> |
| 4121 | + |
| 4122 | +<span class="nt">resources</span><span class="p">:</span> |
| 4123 | +<span class="w"> </span><span class="nt">gpu</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">NVIDIA:80GB:8</span> |
| 4124 | + |
| 4125 | +<span class="c1"># Split into 4 blocks, each with 2 GPUs</span> |
| 4126 | +<span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">4</span> |
| 4127 | +</code></pre></div> |
| 4128 | + |
| 4129 | +</div> |
| 4130 | + |
4061 | 4131 | <h4 id="idle-duration">Idle duration<a class="headerlink" href="#idle-duration" title="Permanent link">¶</a></h4> |
4062 | 4132 | <p>By default, fleet instances stay <code>idle</code> for 3 days and can be reused within that time. |
4063 | 4133 | If the fleet is not reused within this period, it is automatically terminated.</p> |
@@ -4192,6 +4262,38 @@ <h5 id="network">Network<a class="headerlink" href="#network" title="Permanent l |
4192 | 4262 | <p>By default, <code>dstack</code> automatically detects the network shared by the hosts. |
4193 | 4263 | However, it's possible to configure it explicitly via |
4194 | 4264 | the <a href="../../reference/dstack.yml/fleet/#network"><code>network</code></a> property.</p> |
| 4265 | +<h4 id="ssh-blocks">Blocks<a class="headerlink" href="#ssh-blocks" title="Permanent link">¶</a></h4> |
| 4266 | +<p>By default, a single job occupies the entire instance, so if the instance has 8 GPUs, the job will use all of them.</p> |
| 4267 | +<p>To make it more efficient, you can set the blocks property to specify how many blocks you’d like the instance to be |
| 4268 | +divided into, allowing multiple jobs to use these blocks concurrently.</p> |
| 4269 | +<div editor-title=".dstack.yml"> |
| 4270 | + |
| 4271 | + <div class="highlight"><pre><span></span><code><span class="nt">type</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">fleet</span> |
| 4272 | +<span class="nt">name</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">my-fleet</span> |
| 4273 | + |
| 4274 | +<span class="nt">ssh_config</span><span class="p">:</span> |
| 4275 | +<span class="w"> </span><span class="nt">user</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">ubuntu</span> |
| 4276 | +<span class="w"> </span><span class="nt">identity_file</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">~/.ssh/id_rsa</span> |
| 4277 | +<span class="w"> </span><span class="nt">hosts</span><span class="p">:</span> |
| 4278 | +<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.51</span> |
| 4279 | +<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">4</span> |
| 4280 | +<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostaname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.52</span> |
| 4281 | +<span class="w"> </span><span class="c1"># As many as possible, according to numbers of GPUs and CPUs</span> |
| 4282 | +<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">auto</span> |
| 4283 | +<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostaname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.53</span> |
| 4284 | +<span class="w"> </span><span class="c1"># Do not sclice. This is the default value, may be omitted</span> |
| 4285 | +<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">1</span> |
| 4286 | +</code></pre></div> |
| 4287 | + |
| 4288 | +</div> |
| 4289 | + |
| 4290 | +<p>For instance, with 8 GPUs, 128 CPUs, and 2TB of memory, setting blocks to 8 would assign 1 GPU, 16 CPUs, and 256 GB of |
| 4291 | +memory to each block. These blocks can be used concurrently, and a single job can occupy multiple blocks if needed.</p> |
| 4292 | +<blockquote> |
| 4293 | +<p>GPUs and CPUs must be divisible by the number of blocks. All resources (GPU, CPU, memory) are split proportionally, |
| 4294 | +except disk storage, which is shared.</p> |
| 4295 | +</blockquote> |
| 4296 | +<p>You can also set <code>blocks</code> to <code>auto</code>, which automatically sets the number of blocks to match the number of GPUs.</p> |
4195 | 4297 | <h4 id="environment-variables">Environment variables<a class="headerlink" href="#environment-variables" title="Permanent link">¶</a></h4> |
4196 | 4298 | <p>If needed, you can specify environment variables that will be used by <code>dstack-shim</code> and passed to containers.</p> |
4197 | 4299 | <p>For example, these variables can be used to configure a proxy:</p> |
|
0 commit comments