Commit dbf3379
committed
ci: Make OpenMPI yield when idle for the MPI example notebooks
The data-initialization notebook's point-to-point routing (NBX exchange)
hung the 4-engine ipyparallel cluster under OpenMPI, while IntelMPI ran
fine. OpenMPI defaults to an aggressive busy-wait: with the engines
contending for cores, the probe/barrier loop spins at 100% and starves the
peers it waits on, so MPI never progresses. IntelMPI yields by default.
Set OMPI_MCA_mpi_yield_when_idle=1 in the examples-mpi job so OpenMPI yields
the CPU while waiting. Also revert the earlier Python-level sleep(0) in
nbx_exchange: sched_yield at the wrong layer (the spin is in OpenMPI's C
progress engine), which did not help.1 parent 6d1a540 commit dbf3379
2 files changed
Lines changed: 7 additions & 11 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
47 | 52 | | |
48 | 53 | | |
49 | 54 | | |
| |||
71 | 76 | | |
72 | 77 | | |
73 | 78 | | |
74 | | - | |
| 79 | + | |
75 | 80 | | |
76 | 81 | | |
77 | 82 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
11 | | - | |
12 | | - | |
13 | 11 | | |
14 | 12 | | |
15 | 13 | | |
| |||
85 | 83 | | |
86 | 84 | | |
87 | 85 | | |
88 | | - | |
89 | | - | |
90 | | - | |
| 86 | + | |
91 | 87 | | |
92 | 88 | | |
93 | 89 | | |
94 | 90 | | |
95 | 91 | | |
96 | 92 | | |
97 | | - | |
98 | | - | |
99 | | - | |
100 | | - | |
101 | | - | |
102 | 93 | | |
103 | 94 | | |
0 commit comments