Skip to content

Improve DynamicPPL benchmarks #1374

@yebai

Description

@yebai

DynamicPPL’s current benchmarks are fairly noisy: ratios vary substantially across runs. The current setup also compares the same set of models on both main and a PR branch, which makes benchmarks hard to run locally.

It would be useful to improve the benchmark setup in two ways:

  1. Reduce run-to-run noise (ie, run all experiments in a single CI job)
  2. Introduce an external baseline, such as Stan, and report benchmark ratios relative to Stan’s log density and gradient evaluations rather than relative to main.

Concrete suggestions

  • Report primal in absolute time
  • Report gradient / primal

Reference: chalk-lab/Mooncake.jl#1163 (comment)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions