You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix(docker-stats,podman-stats): restore per-container CPU and memory perfdata (fix#1104)
v2026041002 replaced the per-container cpu_usage / mem_usage perfdata
with aggregates ('containers_running', 'cpu' for docker; plus
'block_input', 'block_output', 'images', 'net_rx', 'net_tx', 'ram'
for podman) on the rationale that container names come and go and
bloat the time-series backend. That broke the long-term trending of
individual workloads, which is the primary use case for these checks.
Re-emit <container>_cpu_usage and <container>_mem_usage per running
container alongside the aggregates. Names are still shortened via
shorten() unless --full-name is passed, matching v2025022501
semantics.
Extend the unit-test assertions to pin the per-container perfdata
labels so this can't regress silently again.
Bump __version__ to 2026051201.
Copy file name to clipboardExpand all lines: CHANGELOG.md
+5Lines changed: 5 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -85,6 +85,11 @@ Grafana:
85
85
*`schemaVersion` fixed to `42`; Grafana 12 was failing to import the date-encoded value
86
86
87
87
88
+
Monitoring Plugins:
89
+
90
+
* docker-stats, podman-stats: per-container CPU and memory perfdata restored. The previous release reported only aggregate values, breaking long-term trending of individual containers ([#1104](https://github.com/Linuxfabrik/monitoring-plugins/issues/1104))
Copy file name to clipboardExpand all lines: check-plugins/docker-stats/README.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -108,8 +108,12 @@ myconti_ds_1 ! 0.0 ! 11.42
108
108
109
109
## Perfdata / Metrics
110
110
111
+
The plugin emits one CPU and one memory metric per container so individual workloads can be plotted long-term. Because container names appear and disappear as workloads come and go, the time-series backend (Graphite, InfluxDB, ...) will keep stale entries until they are pruned.
112
+
111
113
| Name | Type | Description |
112
114
|----|----|----|
115
+
|`<container>_cpu_usage`| Percentage | Per-container CPU usage, normalized by host CPU count. |
116
+
|`<container>_mem_usage`| Percentage | Per-container memory usage, relative to the container memory limit or host memory. |
113
117
| containers_running | Number | Number of running containers. |
Copy file name to clipboardExpand all lines: check-plugins/podman-stats/README.md
+12-8Lines changed: 12 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -109,16 +109,20 @@ myconti_ds_1 ! 0.0 ! 11.42
109
109
110
110
## Perfdata / Metrics
111
111
112
+
The plugin emits one CPU and one memory metric per container so individual workloads can be plotted long-term. Because container names appear and disappear as workloads come and go, the time-series backend (Graphite, InfluxDB, ...) will keep stale entries until they are pruned.
113
+
112
114
| Name | Type | Description |
113
115
|----|----|----|
114
-
| block_input | Bytes | Total data read from block device across all containers. |
115
-
| block_output | Bytes | Total data written to block device across all containers. |
116
-
| containers_running | Number | Number of running containers. |
117
-
| cpu | Number | Number of host CPUs. |
118
-
| images | Number | Number of images. |
119
-
| net_rx | Bytes | Total network bytes received across all containers. |
120
-
| net_tx | Bytes | Total network bytes transmitted across all containers. |
121
-
| ram | Bytes | Total host memory. |
116
+
|`<container>_cpu_usage`| Percentage | Per-container CPU usage, normalized by host CPU count. |
117
+
|`<container>_mem_usage`| Percentage | Per-container memory usage, relative to the container memory limit or host memory. |
118
+
| block_input | Bytes | Total data read from block device across all containers. |
119
+
| block_output | Bytes | Total data written to block device across all containers. |
120
+
| containers_running | Number | Number of running containers. |
121
+
| cpu | Number | Number of host CPUs. |
122
+
| images | Number | Number of images. |
123
+
| net_rx | Bytes | Total network bytes received across all containers. |
124
+
| net_tx | Bytes | Total network bytes transmitted across all containers. |
0 commit comments