Skip to content

[Task] Default transform metrics like throughput, latency & watermark progress for Beam Samza Runner #26456

@Sanil15

Description

@Sanil15

What needs to happen?

Summary

  • Added Support for generating throughput, watermark progress & latency metric emission
  • A MetricOp is attached to each input & output
  • Throughput Metrics are emitted for input and output throughput
  • Watermark Progress Metrics are emitting at the during-process watermark of output
  • Latency Metrics are emitted by collecting avg arrival time at input & output per watermark for each transform using formula Latency = Avg
  • Arrival Time of Elements at Output - Avg Arrival Time of Elements at Input
  • For GroupByKey translator based operators we aggregate avg arrivial time per window id and then only emit the latency using bove formula for elements under the same window

Issue Priority

Priority: 3 (nice-to-have improvement)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions