Skip to content

Dk/add distributed benchmark#5591

Open
vopikamm wants to merge 8 commits into
mainfrom
dk/add-distributed-benchmark
Open

Dk/add distributed benchmark#5591
vopikamm wants to merge 8 commits into
mainfrom
dk/add-distributed-benchmark

Conversation

@vopikamm
Copy link
Copy Markdown
Collaborator

Add option to run benchmarks distributed. Is invoked by adding --distributed flag and providing the desired partition.

@vopikamm vopikamm requested a review from jackdfranklin May 15, 2026 10:13
@vopikamm vopikamm linked an issue May 15, 2026 that may be closed by this pull request
Comment thread benchmarking/run_benchmarks.jl Outdated
Comment thread benchmarking/run_benchmarks.jl Outdated
Comment thread benchmarking/run_benchmarks.jl Outdated
Comment thread benchmarking/run_benchmarks.jl Outdated
MPI is already a dependency of Oceananigans so we can just load it
normally
Since the macros return the expression in the sequential case, there is
no need to duplicate the output code
@jackdfranklin
Copy link
Copy Markdown
Collaborator

@simone-silvestri I've gone through and implemented the changes you suggested. I also took a lot of the output handling code out of the if statement, since we now use the macros - could you check that the logic is correct/unchanged?

@simone-silvestri
Copy link
Copy Markdown
Collaborator

Seems to be correct to me. The only caution with using the macros is not to nest them (otherwise we hit a deadlock since all the macros have a barrier at the end)

"--partition"
help = "Partition for distributed architecture as Rx x Ry x Rz (e.g., 2x2x1). Ignored unless --distributed is set."
arg_type = String
default = "2x2x1"
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is a bit loose. More of an example than a robust default

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we can improve on it as we work on load balancing and add new partitions. I think it is probably ok for the time being?

@simone-silvestri
Copy link
Copy Markdown
Collaborator

simone-silvestri commented May 15, 2026

At the moment we have the benchmark pipeline running on two reserved tartarus GPUS (GPU 0 and GPU 2), we could maybe add one mpi test to the pipeline with just 2 ranks to verify it works and keep it running.

@giordano giordano added GPU 👾 Where Oceananigans gets its powers from distributed 🕸️ Our plan for total cluster domination labels May 15, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

distributed 🕸️ Our plan for total cluster domination GPU 👾 Where Oceananigans gets its powers from

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add option for distributed computation to benchmark

4 participants