rpk: Refactor benchmark by StephanDollberg · Pull Request #30007 · redpanda-data/redpanda

StephanDollberg · 2026-03-31T09:50:08Z

Non-functional refactor of rpk benchmark to prepare for introduction of other subcommands like consume.

Plus two commits on top to add some more freedom in regards to topic selection (reset and preexisting).

Backports Required

Release Notes

none

Copilot

Pull request overview

This PR refactors the hidden rpk benchmark CLI into a subcommand-based structure (starting with produce) and updates the ducktape RpkBenchmarkService wrapper + tests to invoke the new CLI shape. It also introduces new topic-handling flexibility in the CLI via --reset-topic and --use-existing-topic.

Changes:

Refactor rpk benchmark into rpk benchmark produce, moving produce logic into a new produce.go command implementation.
Add topic lifecycle options (--reset-topic, --use-existing-topic) and refactor shared benchmark setup/teardown into reusable helpers.
Update ducktape service wrapper and tests/perf harnesses to pass a benchmark mode (currently "produce").

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/rptest/tests/rpk_benchmark_service_test.py	Refactors smoke test helper and updates test to run `produce` mode.
tests/rptest/services/rpk_benchmark_service.py	Updates rpk invocation to `benchmark <mode>` and plumbs `mode` through service.
tests/rptest/perf/rpk_benchmark_test.py	Updates perf test harness to run the benchmark with an explicit mode.
src/go/rpk/pkg/cli/benchmark/benchmark.go	Introduces shared benchmark config/run lifecycle, topic setup options, and hooks in subcommands.
src/go/rpk/pkg/cli/benchmark/produce.go	Adds the `benchmark produce` subcommand implementation.
src/go/rpk/pkg/cli/benchmark/BUILD	Adds `produce.go` to the Bazel target sources.

vbotbuildovich · 2026-03-31T10:43:48Z

Retry command for Build#82536

please wait until all jobs are finished before running the slash command

/ci-repeat 1
skip-redpanda-build
skip-units
skip-rebase
tests/rptest/tests/cluster_features_test.py::FeaturesNodeJoinTest.test_old_node_join
tests/rptest/tests/cluster_features_test.py::FeaturesMultiNodeUpgradeTest.test_upgrade
tests/rptest/tests/license_enforcement_test.py::LicenseEnforcementTest.test_license_enforcement@{"clean_node_after_recovery":false,"clean_node_before_recovery":true}
tests/rptest/tests/license_enforcement_test.py::LicenseEnforcementTest.test_license_enforcement@{"clean_node_after_recovery":true,"clean_node_before_recovery":false}
tests/rptest/tests/cluster_features_test.py::FeaturesMultiNodeUpgradeTest.test_rollback
tests/rptest/tests/cluster_features_test.py::FeaturesSingleNodeUpgradeTest.test_upgrade
tests/rptest/tests/license_enforcement_test.py::LicenseEnforcementTest.test_license_enforcement@{"clean_node_after_recovery":true,"clean_node_before_recovery":true}
tests/rptest/tests/license_enforcement_test.py::LicenseEnforcementTest.test_license_enforcement@{"clean_node_after_recovery":false,"clean_node_before_recovery":false}

vbotbuildovich · 2026-03-31T11:04:52Z

CI test results

test results on build#82536

test_class	test_method	test_arguments	test_kind	job_url	test_status	passed	reason	test_history
FeaturesMultiNodeUpgradeTest	test_rollback	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f5-4bf1-abcd-464a576ada44	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3520, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesMultiNodeUpgradeTest&test_method=test_rollback
FeaturesMultiNodeUpgradeTest	test_rollback	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-5949-4e0a-8156-1486acb7ba62	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3520, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesMultiNodeUpgradeTest&test_method=test_rollback
FeaturesMultiNodeUpgradeTest	test_upgrade	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f5-4192-804e-7256b9525b5f	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3520, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesMultiNodeUpgradeTest&test_method=test_upgrade
FeaturesMultiNodeUpgradeTest	test_upgrade	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-5949-4494-908a-d85f6eb69ff0	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3520, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesMultiNodeUpgradeTest&test_method=test_upgrade
FeaturesNodeJoinTest	test_old_node_join	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f6-4c2a-9782-0a6c85d6f2d3	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3513, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesNodeJoinTest&test_method=test_old_node_join
FeaturesNodeJoinTest	test_old_node_join	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-594a-49b2-adc6-8b4ccc43ab64	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3513, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesNodeJoinTest&test_method=test_old_node_join
FeaturesSingleNodeUpgradeTest	test_upgrade	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f2-4aca-840e-d58f3c1a01c1	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3513, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesSingleNodeUpgradeTest&test_method=test_upgrade
FeaturesSingleNodeUpgradeTest	test_upgrade	null	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-5945-4fb7-8598-71dc09ec0ec2	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3513, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=FeaturesSingleNodeUpgradeTest&test_method=test_upgrade
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": false, "clean_node_before_recovery": false}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f6-45eb-8ba8-2cd561944355	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": false, "clean_node_before_recovery": false}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-594e-4b37-a4bf-c260d79d5d01	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": true, "clean_node_before_recovery": false}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f7-4fbd-ab09-f92a385fb236	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": true, "clean_node_before_recovery": false}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-594f-4b52-9886-fbb742ece4ec	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": false, "clean_node_before_recovery": true}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f2-4baa-b2c7-a73ddf2a78e2	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": false, "clean_node_before_recovery": true}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-5944-4846-95ec-2d450a7ccaae	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": true, "clean_node_before_recovery": true}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4354-62f2-4aca-840e-d58f3c1a01c1	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement
LicenseEnforcementTest	test_license_enforcement	{"clean_node_after_recovery": true, "clean_node_before_recovery": true}	integration	https://buildkite.com/redpanda/redpanda/builds/82536#019d4356-5945-4fb7-8598-71dc09ec0ec2	FAIL	0/11	Test FAILS after retries.Significant increase in flaky rate(baseline=0.3493, p0=0.0000, reject_threshold=0.0100)	https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=LicenseEnforcementTest&test_method=test_license_enforcement

graham-rp

It looks like this breaks existing behavior (rpk benchmark now errors out). Would rpk benchmark --mode work instead?

StephanDollberg · 2026-04-07T08:25:19Z

Yes, this changes that intentionally. No need to maintain that.

Make the current (and only) produce mode be explicit by requiring a "produce" subcommand. This is in preparation for adding "consume".

Pure non-functional refactor of moving everything purely produce related to its own file and into subfunctions. Again in preparation for adding other subcommands like consume.

When cancelling the command we delete the test topic we created. Sometimes the topic might still leak (SIGKILL or rpk is killed during topic creation before RP has responded). Provide a `--reset-topic` flag which deletes and recreates the topic similar to what OMB provides.

Add a flag to use a prexisting topic. Can sometimes be useful for various reasons (existing data etc.).

StephanDollberg · 2026-04-07T13:50:57Z

Pure rebase

StephanDollberg requested review from ballard26 and travisdowns March 31, 2026 09:50

StephanDollberg requested review from a team, kbatuigas and r-vasquez as code owners March 31, 2026 09:50

Copilot AI review requested due to automatic review settings March 31, 2026 09:50

github-actions bot added area/rpk area/build labels Mar 31, 2026

Copilot started reviewing on behalf of StephanDollberg March 31, 2026 09:50 View session

Copilot AI reviewed Mar 31, 2026

View reviewed changes

Comment thread src/go/rpk/pkg/cli/benchmark/benchmark.go

graham-rp reviewed Apr 6, 2026

View reviewed changes

StephanDollberg added 4 commits April 7, 2026 14:50

rpk: add explicit mode subcommand to benchmark

006c6c9

Make the current (and only) produce mode be explicit by requiring a "produce" subcommand. This is in preparation for adding "consume".

rpk: refactor produce benchmark mode wiring

ba4e8c8

Pure non-functional refactor of moving everything purely produce related to its own file and into subfunctions. Again in preparation for adding other subcommands like consume.

rpk: Add --use-prexisting-topic to benchmark

986b62e

Add a flag to use a prexisting topic. Can sometimes be useful for various reasons (existing data etc.).

StephanDollberg force-pushed the stephan/rpk-benchmark-refactor branch from 822fef8 to 986b62e Compare April 7, 2026 13:50

graham-rp approved these changes Apr 7, 2026

View reviewed changes

travisdowns approved these changes Apr 7, 2026

View reviewed changes

StephanDollberg merged commit 4d29664 into dev Apr 8, 2026
28 checks passed

StephanDollberg deleted the stephan/rpk-benchmark-refactor branch April 8, 2026 07:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rpk: Refactor benchmark#30007

rpk: Refactor benchmark#30007
StephanDollberg merged 4 commits intodevfrom
stephan/rpk-benchmark-refactor

StephanDollberg commented Mar 31, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

vbotbuildovich commented Mar 31, 2026 •

edited

Loading

Uh oh!

vbotbuildovich commented Mar 31, 2026

Uh oh!

graham-rp left a comment

Uh oh!

StephanDollberg commented Apr 7, 2026 •

edited

Loading

Uh oh!

StephanDollberg commented Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

StephanDollberg commented Mar 31, 2026

Backports Required

Release Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

vbotbuildovich commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Retry command for Build#82536

Uh oh!

vbotbuildovich commented Mar 31, 2026

CI test results

Uh oh!

graham-rp left a comment

Choose a reason for hiding this comment

Uh oh!

StephanDollberg commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

StephanDollberg commented Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vbotbuildovich commented Mar 31, 2026 •

edited

Loading

StephanDollberg commented Apr 7, 2026 •

edited

Loading