Skip to content

Commit 1a795a4

Browse files
committed
Update docs
1 parent 0d53c15 commit 1a795a4

2 files changed

Lines changed: 19 additions & 2 deletions

File tree

docs/docs/guides/metrics.md

Lines changed: 19 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
## Prometheus
44

5-
To collect and export fleet and run metrics to Prometheus, enable the
5+
To collect and export fleet and run as well as server health metrics to Prometheus, enable the
66
`DSTACK_ENABLE_PROMETHEUS_METRICS` environment variable and configure Prometheus to fetch metrics from
77
`<dstack server URL>/metrics`.
88

@@ -140,3 +140,21 @@ telemetry, and more.
140140
| `dstack_run_type` | *string* | Run configuration type | `task`, `dev-environment` |
141141
| `dstack_backend` | *string* | Backend | `aws`, `runpod` |
142142
| `dstack_gpu` | *string?* | GPU name | `H100` |
143+
144+
### Server health metrics
145+
146+
These are operational metrics to monitor the health of the dstack server. For now, these only include HTTP metrics, but more will be added later.
147+
148+
=== "Metrics"
149+
| Name | Type | Description | Examples |
150+
|------------------------------------------|-----------|-----------------------------------|--------------|
151+
| `dstack_server_requests_total` | *counter* | Total number of HTTP requests | `100.0` |
152+
| `dstack_server_request_duration_seconds` | *histogram* | HTTP request duration in seconds | `1.0`|
153+
154+
=== "Labels"
155+
| Name | Type | Description | Examples |
156+
|------------------------|-----------|:--------------|----------------------------------------|
157+
| `method` | *string?* | HTTP method | `POST` |
158+
| `endpoint` | *string* | Endpoint path | `/api/project/main/repos/get` |
159+
| `http_status` | *string* | HTTP status code | `200` |
160+
| `project_name` | *string* | Project name | `main` |

pyproject.toml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,6 @@ dependencies = [
3535
"gpuhunt==0.1.6",
3636
"argcomplete>=3.5.0",
3737
"ignore-python>=0.2.0",
38-
"prometheus-fastapi-instrumentator>=7.1.0",
3938
]
4039

4140
[project.urls]

0 commit comments

Comments
 (0)