You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(benchmarks): Complete benchmark infrastructure with history and profiling
Add comprehensive benchmark infrastructure for performance tracking:
- Row count configuration: quick=500K, default=1M, full=2M rows
- Profile naming: bench_<component>_<scenario>_<timestamp>_<commit>.prof
- History archiving: Every run archived with git commit info
- Diff command: Compare arbitrary history files with threshold detection
- History analysis: DataFrame utilities (long/wide format) for custom queries
New files:
- history_analysis.py: Load history into pandas DataFrames
Modified files:
- benchmark_materialize_aliases.py: --full flag, profile naming, row counts
- baseline_utils.py: diff command, get_git_info()
- run_benchmark.sh: --full flag passthrough
- README.md: Documentation for new features
Usage:
./run_benchmark.sh --full # Full analysis with profiling
python baseline_utils.py diff A.json B.json # Compare runs
python history_analysis.py list results/history/ # List metrics
Part of benchmark infrastructure for Phase 3 join optimization.
0 commit comments