[experiment] Benchmark -Zthreads=sync on rustc-perf benchmarks#158357
[experiment] Benchmark -Zthreads=sync on rustc-perf benchmarks#158357petrochenkov wants to merge 1 commit into
-Zthreads=sync on rustc-perf benchmarks#158357Conversation
|
r? @oli-obk rustbot has assigned @oli-obk. Use Why was this reviewer chosen?The reviewer was selected based on:
|
|
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
[experiment] Benchmark `-Zthreads=sync` on rustc-perf benchmarks
This comment has been minimized.
This comment has been minimized.
|
Finished benchmarking commit (345cec8): comparison URL. Overall result: no relevant changes - no action neededBenchmarking means the PR may be perf-sensitive. Consider adding rollup=never if this change is not fit for rolling up. @rustbot label: -S-waiting-on-perf -perf-regression Instruction countThis perf run didn't have relevant results for this metric. Max RSS (memory usage)This perf run didn't have relevant results for this metric. CyclesResults (primary -2.3%, secondary 12.8%)A less reliable metric. May be of interest, but not used to determine the overall result above.
Binary sizeThis perf run didn't have relevant results for this metric. Bootstrap: 510.446s -> 503.041s (-1.45%) |
|
This is surprising, the "fast path" for This may be an opportunity to simplify things. |
|
I assume the perf suite doesn't pass |
|
Ah, right, |
|
This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed. Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers. |
|
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
[experiment] Benchmark `-Zthreads=sync` on rustc-perf benchmarks
This comment has been minimized.
This comment has been minimized.
|
Finished benchmarking commit (52eea4b): comparison URL. Overall result: ❌✅ regressions and improvements - please read:Benchmarking means the PR may be perf-sensitive. It's automatically marked not fit for rolling up. Overriding is possible but disadvised: it risks changing compiler perf. Next, please: If you can, justify the regressions found in this try perf run in writing along with @bors rollup=never Instruction countOur most reliable metric. Used to determine the overall result above. However, even this metric can be noisy.
Max RSS (memory usage)Results (primary 17.2%, secondary 19.0%)A less reliable metric. May be of interest, but not used to determine the overall result above.
CyclesResults (primary 7.8%, secondary 12.0%)A less reliable metric. May be of interest, but not used to determine the overall result above.
Binary sizeThis perf run didn't have relevant results for this metric. Bootstrap: 505.309s -> 525.313s (3.96%) |
|
24.83% on helloworld is something that we clearly need to investigate. |
|
20% on helloworld is 5ms |
|
A fixed 5ms cost is still expensive if you have a ton of small deoendencies to compile. |
|
If you only do maybe, otherwise it will be amortized by the first big crate that takes advantage of parallelism. |
|
I wonder if sharding is the cause of the helloworld slowdown. Probably should test helloworld on #153826. |
|
From a quick look |
View all comments