0.18.11-v1
0.18.11
The update includes all the features and bug fixes from version 0.18.11.
AMD
With the latest update, you can now specify an AMD GPU under resources. Below is an example.
type: service
name: amd-service-tgi
image: ghcr.io/huggingface/text-generation-inference:sha-a379d55-rocm
env:
- HUGGING_FACE_HUB_TOKEN
- MODEL_ID=meta-llama/Meta-Llama-3.1-70B-Instruct
- TRUST_REMOTE_CODE=true
- ROCM_USE_FLASH_ATTN_V2_TRITON=true
commands:
- text-generation-launcher --port 8000
port: 8000
resources:
gpu: MI300X
disk: 150GB
spot_policy: auto
model:
type: chat
name: meta-llama/Meta-Llama-3.1-70B-Instruct
format: openaiNote
AMD accelerators are currently supported only with the runpod backend. Support for on-prem fleets and more backends
is coming soon.
GPU vendors
The gpu property now accepts the vendor attribute, with supported values: nvidia, tpu, and amd.
Alternatively, you can also prefix the GPU name with the vendor name followed by a colon, for example: tpu:v2-8 or amd:192GB, etc. This change ensures consistency in GPU requirements configuration across vendors.
Encryption
dstack now supports encryption of sensitive data, such as backend credentials, user tokens, etc. Learn more on the reference page.
Storing logs in AWS CloudWatch
By default, the dstack server stores run logs in ~/.dstack/server/projects/<project name>/logs. To store logs in AWS CloudWatch, set the SERVER_CLOUDWATCH_LOG_GROUP environment variable.
Project manager role
With this update, it's now possible to assign any user as a project manager. This role grants permission to manage project users but does not allow management of backends or resources.
Default permissions
By default, all users can create and manage their own projects. If you want only global admins to create projects, add the following to ~/.dstack/server/config.yml:
default_permissions:
allow_non_admins_create_projects: falseOther
- [Bugfix] Provision AWS instances in all eligible availability zones by @r4victor in dstackai/dstack#1585
- [Feature] Support the
vendorproperty underresources.gpu@un-def in dstackai/dstack#1558 - [UI] Fix logs appearance in the dark theme by @olgenn in dstackai/dstack#1579
- [Docs] Document projects #1547 by @peterschmidt85 in dstackai/dstack#1548
- [Internal] Improve gateway auth issues troubleshooting by @jvstme in dstackai/dstack#1569
- [Internal] Force
rootin Kubernetes runs by @jvstme in dstackai/dstack#1555
Full changelog: dstackai/dstack@0.18.10...0.18.11