localstack
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 0 deletions b/‎.gitignore‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎prometheus/README.md‎
Lines changed: 8 additions & 7 deletions b/‎prometheus/README.md‎
Lines changed: 8 additions & 7 deletions
diff --git a/‎prometheus/docs/event_analysis.md‎
Lines changed: 104 additions & 0 deletions b/‎prometheus/docs/event_analysis.md‎
Lines changed: 104 additions & 0 deletions
diff --git a/‎prometheus/docs/images/avg_propagation_delay.png‎
79.9 KB b/‎prometheus/docs/images/avg_propagation_delay.png‎
79.9 KB
diff --git a/‎prometheus/docs/images/batch_efficiency_ratio.png‎
79.5 KB b/‎prometheus/docs/images/batch_efficiency_ratio.png‎
79.5 KB
diff --git a/‎prometheus/docs/images/empty_poll_responses.png‎
95.1 KB b/‎prometheus/docs/images/empty_poll_responses.png‎
95.1 KB
diff --git a/‎prometheus/docs/images/event_processing_duration.png‎
80.4 KB b/‎prometheus/docs/images/event_processing_duration.png‎
80.4 KB
diff --git a/‎prometheus/docs/images/high_latency_event_processing.png‎
114 KB b/‎prometheus/docs/images/high_latency_event_processing.png‎
114 KB
diff --git a/‎prometheus/docs/images/in_flight_events.png‎
91.4 KB b/‎prometheus/docs/images/in_flight_events.png‎
91.4 KB
diff --git a/‎prometheus/docs/images/in_flight_requests.png‎
140 KB b/‎prometheus/docs/images/in_flight_requests.png‎
140 KB
@@ -133,3 +133,4 @@ dmypy.json
 .vscode
 
 node_modules/
+.DS_Store
@@ -115,15 +115,16 @@ services:
       - "./prometheus_config.yml:/etc/prometheus/prometheus.yml" # Assumes prometheus_config.yml exists in your CWD
 ```
 
-## Available Metrics
+## Metrics
 
-The Prometheus extension exposes various LocalStack metrics through the `/_extension/metrics` endpoint, including:
-- Request counts by service
-- Request latencies
-- Resource utilization
-- Error rates
+The Prometheus extension exposes various LocalStack and system metrics through the `/_extension/metrics` endpoint.
 
-For a complete list of available metrics, visit the endpoint directly at `localhost.localstack.cloud:4566/_extension/metrics` when LocalStack is running.
+For a complete list of available metrics, view the:
+- [LocalStack Metrics documentation](./docs/localstack_metrics.md) 
+- [System Metrics documentation](./docs/system_metrics.md) 
+- Otherwise, visit the endpoint directly at `localhost.localstack.cloud:4566/_extension/metrics` when LocalStack is running.
+
+We've also included a [collection of PromQL queries](./docs/event_analysis.md) that are useful for analyzing LocalStack event source mappings performance.
 
 ## Licensing
 
 
@@ -0,0 +1,104 @@
+# PromQL Queries for Event Processing Statistics
+
+The following queries can be used to analyse performance of LocalStack's event processing capabilties.
+
+## Average Propagation Delay from Event Source to Poller
+
+The average amount of time a record has to wait before being processed during the last 5 minutes. A high propagation delay indicates that our event pollers are taking too long to ingest new events from an event source.
+
+```
+rate(localstack_event_propagation_delay_seconds_sum[5m]) / rate(localstack_event_propagation_delay_seconds_count[5m])
+```
+
+**Example**:
+![Average Propagation Delay](images/avg_propagation_delay.png)
+
+## Batch Efficiency
+
+A ratio showing how efficiently are our pollers retrieving records from an event source relative to how large their maximum batch size is. A higher number indicates that batch sizes could be increased.
+
+```
+rate(localstack_batch_size_efficiency_ratio_sum[1m]) / rate(localstack_batch_size_efficiency_ratio_count[1m])
+```
+
+Example:
+![Batch Efficiency Ratio](images/batch_efficiency_ratio.png)
+
+## Records Per Poll
+
+The average number of records being pulled in by an event poller per minute. When used in conjunction with batch efficiency, you can interpret the performance of your batching configuration.
+
+```
+rate(localstack_records_per_poll_sum[1m]) / rate(localstack_records_per_poll_count[1m])
+```
+
+Example:
+
+![Records Per Poll](images/records_per_poll.png)
+
+## In-Flight Events
+
+Gauges how many events are currently being processed by a target at a given point in time. If event processing is taking long, this is a good way of measuring back-pressure on the system.
+
+```
+localstack_in_flight_events
+```
+
+Example:
+![In-Flight Events](images/in_flight_events.png)
+
+## Event Processing Duration
+
+The average duration per minute that targets are processing events for.
+
+```
+rate(localstack_process_event_duration_seconds_sum[1m]) / rate(localstack_process_event_duration_seconds_count[1m])
+```
+
+Example:
+
+![Event Processing Duration](images/event_processing_duration.png)
+
+## High Latency Event Processing
+
+Retrieve the 95th percentile of processing times in a 5m interval grouped by LocalStack service and operation. Useful for analysing the tail-latency of event processing since this is likely where bottlenecks in performance start to show.
+
+```
+histogram_quantile(0.95, sum by(service, operation, le) (rate(localstack_request_processing_duration_seconds_bucket[5m])))
+```
+
+Example:
+![High Latency Event Processing](images/high_latency_event_processing.png)
+
+## Empty Poll Responses
+
+The approximate number of empty poll requests in a 5 minute interval.
+
+```
+rate(localstack_poll_miss_total[5m]) * 60
+```
+
+Example:
+![Empty Poll Responses](images/empty_poll_responses.png)
+
+## Number of LocalStack requests Processed
+
+The average number of request processed by the LocalStack gateway per minute. This is grouped by service type (i.e SQS) and operation type (i.e ReceiveMessage)
+
+```
+sum by(service, operation) (rate(localstack_request_processing_duration_seconds_count[1m]) * 60)
+```
+
+Example:
+![Requests Processed](images/requests_processed.png)
+
+## In-Flight Requests Against LocalStack Gateway
+
+Measures how many requests the Kinesis, SQS, DynamoDB, and Lambda services are currently processing in a given minute interval. Useful for seeing how hard a given service is currently being hit and the operation type.
+
+```
+sum_over_time(localstack_in_flight_requests{service=~"dynamodb|kinesis|sqs|lambda"}[1m])
+```
+
+Example:
+![In-Flight Requests](images/in_flight_requests.png)
Original file line number	Diff line number	Diff line change
`@@ -133,3 +133,4 @@ dmypy.json`
`133`	`133`	`.vscode`
`134`	`134`
`135`	`135`	`node_modules/`
	`136`	`+.DS_Store`