Skip to content

Commit 0b961ed

Browse files
Deploying to gh-pages from @ dstackai/dstack@96406f1 🚀
1 parent f852501 commit 0b961ed

File tree

3 files changed

+104
-2
lines changed

3 files changed

+104
-2
lines changed

docs/concepts/fleets/index.html

Lines changed: 102 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -945,6 +945,19 @@
945945
</span>
946946
</a>
947947

948+
</li>
949+
950+
<li class="md-nav__item">
951+
<a href="#cloud-blocks" class="md-nav__link">
952+
<span class="md-ellipsis">
953+
954+
<span class="md-typeset">
955+
Blocks
956+
</span>
957+
958+
</span>
959+
</a>
960+
948961
</li>
949962

950963
<li class="md-nav__item">
@@ -1067,6 +1080,19 @@
10671080
</ul>
10681081
</nav>
10691082

1083+
</li>
1084+
1085+
<li class="md-nav__item">
1086+
<a href="#ssh-blocks" class="md-nav__link">
1087+
<span class="md-ellipsis">
1088+
1089+
<span class="md-typeset">
1090+
Blocks
1091+
</span>
1092+
1093+
</span>
1094+
</a>
1095+
10701096
</li>
10711097

10721098
<li class="md-nav__item">
@@ -3710,6 +3736,19 @@
37103736
</span>
37113737
</a>
37123738

3739+
</li>
3740+
3741+
<li class="md-nav__item">
3742+
<a href="#cloud-blocks" class="md-nav__link">
3743+
<span class="md-ellipsis">
3744+
3745+
<span class="md-typeset">
3746+
Blocks
3747+
</span>
3748+
3749+
</span>
3750+
</a>
3751+
37133752
</li>
37143753

37153754
<li class="md-nav__item">
@@ -3832,6 +3871,19 @@
38323871
</ul>
38333872
</nav>
38343873

3874+
</li>
3875+
3876+
<li class="md-nav__item">
3877+
<a href="#ssh-blocks" class="md-nav__link">
3878+
<span class="md-ellipsis">
3879+
3880+
<span class="md-typeset">
3881+
Blocks
3882+
</span>
3883+
3884+
</span>
3885+
</a>
3886+
38353887
</li>
38363888

38373889
<li class="md-nav__item">
@@ -4058,6 +4110,24 @@ <h4 id="resources">Resources<a class="headerlink" href="#resources" title="Perma
40584110
</code></pre></div>
40594111
<p>Currently, only 8 TPU cores can be specified, supporting single TPU device workloads. Multi-TPU support is coming soon.</p>
40604112
</details>
4113+
<h4 id="cloud-blocks">Blocks<a class="headerlink" href="#cloud-blocks" title="Permanent link">&para;</a></h4>
4114+
<p>For cloud fleets, <code>blocks</code> function the same way as in SSH fleets.
4115+
See the <a href="#ssh-blocks"><code>Blocks</code></a> section under SSH fleets for details on the blocks concept.</p>
4116+
<div editor-title=".dstack.yml">
4117+
4118+
<div class="highlight"><pre><span></span><code><span class="nt">type</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">fleet</span>
4119+
4120+
<span class="nt">name</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">my-fleet</span>
4121+
4122+
<span class="nt">resources</span><span class="p">:</span>
4123+
<span class="w"> </span><span class="nt">gpu</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">NVIDIA:80GB:8</span>
4124+
4125+
<span class="c1"># Split into 4 blocks, each with 2 GPUs</span>
4126+
<span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">4</span>
4127+
</code></pre></div>
4128+
4129+
</div>
4130+
40614131
<h4 id="idle-duration">Idle duration<a class="headerlink" href="#idle-duration" title="Permanent link">&para;</a></h4>
40624132
<p>By default, fleet instances stay <code>idle</code> for 3 days and can be reused within that time.
40634133
If the fleet is not reused within this period, it is automatically terminated.</p>
@@ -4192,6 +4262,38 @@ <h5 id="network">Network<a class="headerlink" href="#network" title="Permanent l
41924262
<p>By default, <code>dstack</code> automatically detects the network shared by the hosts.
41934263
However, it's possible to configure it explicitly via
41944264
the <a href="../../reference/dstack.yml/fleet/#network"><code>network</code></a> property.</p>
4265+
<h4 id="ssh-blocks">Blocks<a class="headerlink" href="#ssh-blocks" title="Permanent link">&para;</a></h4>
4266+
<p>By default, a single job occupies the entire instance, so if the instance has 8 GPUs, the job will use all of them.</p>
4267+
<p>To make it more efficient, you can set the blocks property to specify how many blocks you’d like the instance to be
4268+
divided into, allowing multiple jobs to use these blocks concurrently.</p>
4269+
<div editor-title=".dstack.yml">
4270+
4271+
<div class="highlight"><pre><span></span><code><span class="nt">type</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">fleet</span>
4272+
<span class="nt">name</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">my-fleet</span>
4273+
4274+
<span class="nt">ssh_config</span><span class="p">:</span>
4275+
<span class="w"> </span><span class="nt">user</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">ubuntu</span>
4276+
<span class="w"> </span><span class="nt">identity_file</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">~/.ssh/id_rsa</span>
4277+
<span class="w"> </span><span class="nt">hosts</span><span class="p">:</span>
4278+
<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.51</span>
4279+
<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">4</span>
4280+
<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostaname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.52</span>
4281+
<span class="w"> </span><span class="c1"># As many as possible, according to numbers of GPUs and CPUs</span>
4282+
<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">auto</span>
4283+
<span class="w"> </span><span class="p p-Indicator">-</span><span class="w"> </span><span class="nt">hostaname</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">3.255.177.53</span>
4284+
<span class="w"> </span><span class="c1"># Do not sclice. This is the default value, may be omitted</span>
4285+
<span class="w"> </span><span class="nt">blocks</span><span class="p">:</span><span class="w"> </span><span class="l l-Scalar l-Scalar-Plain">1</span>
4286+
</code></pre></div>
4287+
4288+
</div>
4289+
4290+
<p>For instance, with 8 GPUs, 128 CPUs, and 2TB of memory, setting blocks to 8 would assign 1 GPU, 16 CPUs, and 256 GB of
4291+
memory to each block. These blocks can be used concurrently, and a single job can occupy multiple blocks if needed.</p>
4292+
<blockquote>
4293+
<p>GPUs and CPUs must be divisible by the number of blocks. All resources (GPU, CPU, memory) are split proportionally,
4294+
except disk storage, which is shared.</p>
4295+
</blockquote>
4296+
<p>You can also set <code>blocks</code> to <code>auto</code>, which automatically sets the number of blocks to match the number of GPUs.</p>
41954297
<h4 id="environment-variables">Environment variables<a class="headerlink" href="#environment-variables" title="Permanent link">&para;</a></h4>
41964298
<p>If needed, you can specify environment variables that will be used by <code>dstack-shim</code> and passed to containers.</p>
41974299
<p>For example, these variables can be used to configure a proxy:</p>

docs/reference/api/rest/openapi.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

search/search_index.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)