|
| 1 | +Prometheus |
| 2 | +========== |
| 3 | + |
| 4 | +By enabling this toolset, HolmesGPT will be able to generate graphs from prometheus metrics as well as help you write and |
| 5 | +validate prometheus queries. |
| 6 | + |
| 7 | +There is also an option for Holmes to analyze prometheus metrics. When enabled, HolmesGPT can detect memory leak patterns, |
| 8 | +CPU throttling, high latency for your APIs, etc. The configuration field to enable prometheus metrics analysis is |
| 9 | +``tool_calls_return_data``. |
| 10 | + |
| 11 | +Configuration |
| 12 | +------------- |
| 13 | + |
| 14 | +.. md-tab-set:: |
| 15 | + |
| 16 | + .. md-tab-item:: Robusta Helm Chat |
| 17 | + |
| 18 | + .. code-block:: yaml |
| 19 | +
|
| 20 | + holmes: |
| 21 | + toolsets: |
| 22 | + prometheus/metrics: |
| 23 | + enabled: true |
| 24 | + config: |
| 25 | + prometheus_url: ... |
| 26 | + metrics_labels_time_window_hrs: 48 # default value |
| 27 | + metrics_labels_cache_duration_hrs: 12 # default value |
| 28 | + fetch_labels_with_labels_api: false # default value |
| 29 | + fetch_metadata_with_series_api: false # default value |
| 30 | + tool_calls_return_data: false # default value |
| 31 | + headers: |
| 32 | + Authorization: "Basic <base_64_encoded_string>" |
| 33 | +
|
| 34 | +
|
| 35 | + .. include:: ./_toolset_configuration.inc.rst |
| 36 | + |
| 37 | + .. md-tab-item:: Holmes CLI |
| 38 | + |
| 39 | + Add the following to **~/.holmes/config.yaml**, creating the file if it doesn't exist: |
| 40 | + |
| 41 | + .. code-block:: yaml |
| 42 | +
|
| 43 | + toolsets: |
| 44 | + prometheus/metrics: |
| 45 | + enabled: true |
| 46 | + config: |
| 47 | + prometheus_url: ... |
| 48 | + metrics_labels_time_window_hrs: 48 # default value |
| 49 | + metrics_labels_cache_duration_hrs: 12 # default value |
| 50 | + fetch_labels_with_labels_api: false # default value |
| 51 | + fetch_metadata_with_series_api: false # default value |
| 52 | + tool_calls_return_data: false # default value |
| 53 | + headers: |
| 54 | + Authorization: "Basic <base_64_encoded_string>" |
| 55 | +
|
| 56 | +It is also possible to set the ``PROMETHEUS_URL`` environment variable instead of the above ``prometheus_url`` config key. |
| 57 | + |
| 58 | +Prior to generating a PromQL query, HolmesQPT tends to list the available metrics. This is done to ensure the metrics used |
| 59 | +in PromQL are actually available. |
| 60 | + |
| 61 | +Below is the full list of options for this toolset: |
| 62 | + |
| 63 | +- **metrics_labels_time_window_hrs** Represents the time window, in hours, over which labels are fetched. This avoids fetching obsolete labels. Set it to ``null`` to let HolmesGPT fetch labels regardless of when they were generated. |
| 64 | +- **metrics_labels_cache_duration_hrs** How long are labels cached, in hours. Set it to ``null`` to disable caching. |
| 65 | +- **fetch_labels_with_labels_api** Uses prometheus `labels API <https://prometheus.io/docs/prometheus/latest/querying/api/#getting-label-names>`_ to fetch labels instead of the `series API <https://prometheus.io/docs/prometheus/latest/querying/api/#finding-series-by-label-matchers>`_. In some cases setting to True can improve the performance of the toolset, however there will be an increased number of HTTP calls to prometheus. You can experiment with both as they are functionally identical. |
| 66 | +- **fetch_metadata_with_series_api** Uses the `series API <https://prometheus.io/docs/prometheus/latest/querying/api/#finding-series-by-label-matchers>`_ instead of the `metadata API <https://prometheus.io/docs/prometheus/latest/querying/api/#querying-metric-metadata>`_. You should only set this value to `true` if the metadata API is disabled or not working. HolmesGPT's ability to select the right metric will be negatively impacted because the series API does not return key metadata like the metrics/series description or their type (gauge, histogram, etc.). |
| 67 | +- **tool_calls_return_data** Experimental. If true, the prometheus data will be available to HolmesGPT. In some cases, HolmesGPT will be able to detect memory leaks or other anomalies. This is disabled by default to reduce the likelyhood of reaching the input token limit. |
| 68 | +- **headers** Extra headers to pass to all prometheus http requests. Use this to pass authentication. Prometheus `supports basic authentication <https://prometheus.io/docs/guides/basic-auth/>`_. |
| 69 | + |
| 70 | +Capabilities |
| 71 | +------------ |
| 72 | +.. include:: ./_toolset_capabilities.inc.rst |
| 73 | + |
| 74 | +.. list-table:: |
| 75 | + :header-rows: 1 |
| 76 | + :widths: 30 70 |
| 77 | + |
| 78 | + * - Tool Name |
| 79 | + - Description |
| 80 | + * - list_available_metrics |
| 81 | + - List all the available metrics to query from prometheus, including their types (counter, gauge, histogram, summary) and available labels. |
| 82 | + * - execute_prometheus_instant_query |
| 83 | + - Execute an instant PromQL query |
| 84 | + * - execute_prometheus_range_query |
| 85 | + - Execute a PromQL range query |
0 commit comments