WIP: OCPBUGS-77056: add external certs benchmark scripts#1374
Conversation
Signed-off-by: Brett Tofel <btofel@redhat.com>
|
Skipping CI for Draft Pull Request. |
|
Important Review skippedDraft detected. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Tip Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs). Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
@bentito: This pull request references Jira Issue OCPBUGS-77056, which is valid. 3 validation(s) were run on this bug
Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
The fix for OCPBUGS-77056 that enables faster startup behavior benchmarked in this PR is implemented and proposed in the following linked pulls:
|
|
This PR is primarily to tie together changes in router and library-go and to illustrate a benchmarking scheme for the Perf/Scale team. @SachinNinganure please let me know if the benchmarking scripts helps test changes to the router made in the linked PRs. We are hoping you're able to work testing with these PRs in place into some of your work on https://issues.redhat.com/browse/CORENET-6842 Primarily we're looking to make sure that parallelizing these calls to speed up the router's handling of adding routes with external certs, is not simply shifting the problem to the API server or elsewhere. So when we save time in the router pod are we causing CPU or memory problems elsewhere on the cluster. Thanks! |
I have started putting some note in this doc. https://docs.google.com/document/d/1tEZBNEaKLvXI8l1KIcHKCQgbAzkv0_YQSzkXcrlq56Y/edit?tab=t.0 I have just tried to understand the benchmark scripts and yes they directly test both the pr-s. for 6842 I will be running our ingress perf test, you may take a look at doc for more info I will be running the benchmark scripts you created for the linked pr verification and additionally , create the perf-scale-grafana boards for the resource utilization and other verification. Thank you |
This PR adds an orchestrator (
00-run-all.sh) and supporting scripts to benchmark the OpenShift router's startup time when handlingspec.tls.externalCertificateroutes, aimed at validating fixes for OCPBUGS-77056.The benchmark harness tests how quickly the router can load N external certificates, comparing the existing bug (where a global write lock causes serial registrations) against a patched image that allows concurrent processing.