File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -288,6 +288,27 @@ mpirun -n <ranks> -hosts <node1>,<node2> ./simulation
288288Nodes must have passwordless SSH from the launch node and no ` pam_slurm_adopt `
289289blocking. Suppress the SSH login banner on remote nodes with ` touch ~/.hushlogin ` .
290290
291+ ** OFI provider path** : Intel MPI 2021.x ships its own libfabric providers in
292+ ` $I_MPI_ROOT/libfabric/lib/prov/ ` . The system libfabric may not include the tcp
293+ or shm providers. Always set:
294+
295+ ``` bash
296+ export FI_PROVIDER_PATH=$I_MPI_ROOT /libfabric/lib/prov
297+ ```
298+
299+ Without this, ` PMPI_Init ` aborts with ` OFI fi_getinfo() failed: No data available ` .
300+ This is handled automatically by ` source ./mfc.sh load -c crnch -m gpu ` .
301+
302+ ** SLURM GPU access** : on SLURM-managed Intel GPU nodes, processes outside a SLURM
303+ allocation cannot open ` /dev/dri/renderD128 ` . Always request the GPU resource:
304+
305+ ``` bash
306+ # SBATCH --gres=gpu:max_1100:1 # Intel GPU Max 1100
307+ ```
308+
309+ Without ` --gres ` , ` omp_get_num_devices() ` returns 0 and the process aborts with
310+ integer divide-by-zero in ` s_initialize_mpi_domain ` (rank % num_devices with 0 devices).
311+
291312### ` libumf.so.1 ` not found at runtime
292313The 2026.0 Level Zero and OpenCL UR adapters link against ` libumf.so.1 ` .
293314If not in ` LD_LIBRARY_PATH ` , all adapters fail silently and sycl-ls reports
Original file line number Diff line number Diff line change @@ -122,7 +122,8 @@ crnch-gpu FC=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/b
122122crnch-gpu PATH=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/compiler/2025.0/bin:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/bin:${PATH}
123123crnch-gpu MKLROOT=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mkl/2025.0
124124crnch-gpu I_MPI_ROOT=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14
125- crnch-gpu LD_LIBRARY_PATH=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mkl/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/compiler/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/lib:${LD_LIBRARY_PATH}
125+ crnch-gpu LD_LIBRARY_PATH=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mkl/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/compiler/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/libfabric/lib: ${LD_LIBRARY_PATH}
126126crnch-gpu LIBRARY_PATH=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mkl/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/compiler/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/2025.0/lib:/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/lib:${LIBRARY_PATH}
127127crnch-gpu I_MPI_FABRICS=shm:ofi
128128crnch-gpu FI_PROVIDER=tcp
129+ crnch-gpu FI_PROVIDER_PATH=/net/projects/tools/x86_64/rhel-8/intel-oneapi/2025.1/mpi/2021.14/libfabric/lib/prov
You can’t perform that action at this time.
0 commit comments