Commit 9ea7370
benchmarks(agentic): switch to with-subagents corpus + idle-gap cap
Roll the aiperf submodule to dde0cc53, which:
- Adds the semianalysis_cc_traces_weka_with_subagents public-dataset
entry pointing at semianalysisai/cc-traces-weka-with-subagents-051926
- Switches the inferencex-agentx-mvp scenario to that corpus and to the
new --trace-idle-gap-cap-seconds=60.0 lock (drops the legacy
--use-think-time-only + --inter-turn-delay-cap-seconds pair)
Update benchmark_lib.sh's resolve_trace_source() to download the new
dataset and pass --public-dataset semianalysis_cc_traces_weka_with_subagents,
and refresh the build_replay_cmd() comment to reflect the new lock.
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Signed-off-by: Cam Quilici <cjquilici@gmail.com>1 parent bd290a0 commit 9ea7370
2 files changed
Lines changed: 15 additions & 11 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
902 | 902 | | |
903 | 903 | | |
904 | 904 | | |
905 | | - | |
| 905 | + | |
906 | 906 | | |
907 | 907 | | |
908 | | - | |
| 908 | + | |
909 | 909 | | |
910 | | - | |
911 | | - | |
912 | | - | |
913 | | - | |
914 | | - | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
| 913 | + | |
| 914 | + | |
| 915 | + | |
915 | 916 | | |
916 | 917 | | |
917 | 918 | | |
| |||
955 | 956 | | |
956 | 957 | | |
957 | 958 | | |
958 | | - | |
959 | | - | |
960 | | - | |
| 959 | + | |
| 960 | + | |
| 961 | + | |
| 962 | + | |
| 963 | + | |
| 964 | + | |
961 | 965 | | |
962 | 966 | | |
963 | 967 | | |
| |||
- docs/cli-options.md+1-1
- src/aiperf/common/config/loadgen_config.py+4
- src/aiperf/common/scenario/base.py+14-3
- src/aiperf/common/scenario/inferencex_agentx_mvp.py+5-3
- src/aiperf/common/scenario/validator.py+46-20
- src/aiperf/plugin/enums.py+1-1
- src/aiperf/plugin/plugins.yaml+22
- tests/component_integration/test_scenario_validator_to_exporter_integration.py+10-7
- tests/unit/common/config/test_user_config_mooncake_trace.py+1-8
- tests/unit/common/config/test_user_config_scenario_hook.py+8
- tests/unit/common/scenario/test_scenario_registry.py+4-3
- tests/unit/common/scenario/test_scenario_validator.py+32-12
- tests/unit/common/scenario/test_scenario_validator_advanced_adversarial.py+11-8
- tests/unit/common/scenario/test_scenario_validator_adversarial.py+29-30
- tests/unit/dataset/loader/test_weka_async_subagent.py+1
- tests/unit/dataset/loader/test_weka_trace_block_size.py+1
- tests/unit/dataset/loader/test_weka_trace_byte_exact_corpus.py+1
- tests/unit/dataset/loader/test_weka_trace_filters_adversarial.py+1
- tests/unit/dataset/loader/test_weka_trace_graph_adversarial.py+1
- tests/unit/dataset/loader/test_weka_trace_hash_coherence.py+1
- tests/unit/dataset/loader/test_weka_trace_model_rewrite.py+1
- tests/unit/dataset/loader/test_weka_trace_reproducibility.py+1
0 commit comments