Improve DynamicPPL benchmarks

DynamicPPL’s current benchmarks are fairly noisy: ratios vary substantially across runs. The current setup also compares the same set of models on both `main` and a PR branch, which makes benchmarks hard to run locally. 

It would be useful to improve the benchmark setup in two ways:

1. Reduce run-to-run noise (ie, run all experiments in a single CI job)
2. Introduce an external baseline, such as Stan, and report benchmark ratios relative to Stan’s log density and gradient evaluations rather than relative to main.

Concrete suggestions

- Report primal in absolute time 
- Report gradient / primal 

Reference: https://github.com/chalk-lab/Mooncake.jl/pull/1163#issue-4345803870



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve DynamicPPL benchmarks #1374

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve DynamicPPL benchmarks #1374

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions