Skip to content

also change 10.0f to 10.0 for CUDA 12.8.0#209

Merged
casparvl merged 3 commits intoEESSI:mainfrom
bedroge:cuda_12.8.0_100f
Apr 21, 2026
Merged

also change 10.0f to 10.0 for CUDA 12.8.0#209
casparvl merged 3 commits intoEESSI:mainfrom
bedroge:cuda_12.8.0_100f

Conversation

@bedroge
Copy link
Copy Markdown
Contributor

@bedroge bedroge commented Apr 21, 2026

Follow-up for #200, turned out it was required to do this for 10.0f as well, see EESSI/software-layer#1462 (comment).

Also fixes a bug, as there was no CUDA version check in the hook. It now specifically checks for CUDA 12.8.0, the issue doesn't affect 12.9.1.

@bedroge
Copy link
Copy Markdown
Contributor Author

bedroge commented Apr 21, 2026

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc100

@eessi-bot-aws
Copy link
Copy Markdown

eessi-bot-aws Bot commented Apr 21, 2026

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2026.04/pr_209/149982

date job status comment
Apr 21 08:04:46 UTC 2026 submitted job id 149982 awaits release by job manager
Apr 21 08:04:54 UTC 2026 released job awaits launch by Slurm scheduler
Apr 21 08:10:57 UTC 2026 running job 149982 is running
Apr 21 08:25:16 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-149982.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc100-17767597900.tar.zstsize: 78 MiB (82671460 bytes)
entries: 95
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/modules/all
NCCL/2.27.7-GCCcore-14.2.0-CUDA-12.8.0.lua
UCX-CUDA/1.18.0-GCCcore-14.2.0-CUDA-12.8.0.lua
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/software
NCCL/2.27.7-GCCcore-14.2.0-CUDA-12.8.0
UCX-CUDA/1.18.0-GCCcore-14.2.0-CUDA-12.8.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/reprod
NCCL/2.27.7-GCCcore-14.2.0-CUDA-12.8.0/20260421_082301UTC
UCX-CUDA/1.18.0-GCCcore-14.2.0-CUDA-12.8.0/20260421_081522UTC
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100
2025.06/init/easybuild/eb_hooks.py
Apr 21 08:25:16 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/5) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/22Jul2025-foss-2024a-kokkos %scale=1_node /ade8cad7 @BotBuildTests:x86-64-zen2+default
P: perf: 435.785 timesteps/s (r:0, l:None, u:None)
[ OK ] (2/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen2+default
P: latency: 1.32 us (r:0, l:None, u:None)
[ OK ] (3/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen2+default
P: latency: 2.04 us (r:0, l:None, u:None)
[ OK ] (4/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen2+default
P: latency: 0.18 us (r:0, l:None, u:None)
[ OK ] (5/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 7976.78 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/5 test case(s) from 5 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-149982.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Copy Markdown
Contributor Author

bedroge commented Apr 21, 2026

        make  -j 16  NVCC_GENCODE="-gencode=arch=compute_100,code=sm_100"
== Summary:
   * [SUCCESS] UCX-CUDA/1.18.0-GCCcore-14.2.0-CUDA-12.8.0
   * [SUCCESS] NCCL/2.27.7-GCCcore-14.2.0-CUDA-12.8.0

@bedroge
Copy link
Copy Markdown
Contributor Author

bedroge commented Apr 21, 2026

bot: build repo:eessi.io-2023.06-software instance:eessi-bot-mc-aws for:arch=x86_64/amd/zen2
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-mc-aws on:arch=zen2 for:arch=x86_64/amd/zen2,accel=nvidia/cc100

@eessi-bot-aws
Copy link
Copy Markdown

eessi-bot-aws Bot commented Apr 21, 2026

New job on instance eessi-bot-mc-aws for repository eessi.io-2023.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2
Job dir: /project/def-users/SHARED/jobs/2026.04/pr_209/149983

date job status comment
Apr 21 08:26:05 UTC 2026 submitted job id 149983 awaits release by job manager
Apr 21 08:26:22 UTC 2026 released job awaits launch by Slurm scheduler
Apr 21 08:31:29 UTC 2026 running job 149983 is running
Apr 21 08:35:43 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-149983.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-17767602940.tar.zstsize: 0 MiB (27717 bytes)
entries: 1
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
reprod directories under 2023.06/software/linux/x86_64/amd/zen2/reprod
no reprod directories in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
2023.06/init/easybuild/eb_hooks.py
Apr 21 08:35:43 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] ( 1/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/29Aug2024-foss-2023b-kokkos %scale=1_node /aeb2d9df @BotBuildTests:x86-64-zen2+default
P: perf: 443.19 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 2/10) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/2Aug2023_update2-foss-2023a-kokkos %scale=1_node /04ff9ece @BotBuildTests:x86-64-zen2+default
P: perf: 453.7 timesteps/s (r:0, l:None, u:None)
[ OK ] ( 3/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /775175bf @BotBuildTests:x86-64-zen2+default
P: latency: 2.64 us (r:0, l:None, u:None)
[ OK ] ( 4/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /52707c40 @BotBuildTests:x86-64-zen2+default
P: latency: 2.6 us (r:0, l:None, u:None)
[ OK ] ( 5/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node %device_type=cpu /b1aacda9 @BotBuildTests:x86-64-zen2+default
P: latency: 5.9 us (r:0, l:None, u:None)
[ OK ] ( 6/10) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node %device_type=cpu /c6bad193 @BotBuildTests:x86-64-zen2+default
P: latency: 5.9 us (r:0, l:None, u:None)
[ OK ] ( 7/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /15cad6c4 @BotBuildTests:x86-64-zen2+default
P: latency: 0.8 us (r:0, l:None, u:None)
[ OK ] ( 8/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /6672deda @BotBuildTests:x86-64-zen2+default
P: latency: 0.9 us (r:0, l:None, u:None)
[ OK ] ( 9/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %scale=1_node /2a9a47b1 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 6363.19 MB/s (r:0, l:None, u:None)
[ OK ] (10/10) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %scale=1_node /1b24ab8e @BotBuildTests:x86-64-zen2+default
P: bandwidth: 6408.04 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 10/10 test case(s) from 10 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-149983.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Apr 21 08:59:51 UTC 2026 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-17767602940.tar.zst to S3 bucket succeeded

@eessi-bot-aws
Copy link
Copy Markdown

eessi-bot-aws Bot commented Apr 21, 2026

New job on instance eessi-bot-mc-aws for repository eessi.io-2025.06-software
Building on: amd-zen2
Building for: x86_64/amd/zen2 and accelerator nvidia/cc100
Job dir: /project/def-users/SHARED/jobs/2026.04/pr_209/149984

date job status comment
Apr 21 08:26:10 UTC 2026 submitted job id 149984 awaits release by job manager
Apr 21 08:26:20 UTC 2026 released job awaits launch by Slurm scheduler
Apr 21 08:27:25 UTC 2026 running job 149984 is running
Apr 21 08:28:26 UTC 2026 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-149984.out
✅ no message matching FATAL:
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc100-17767600100.tar.zstsize: 0 MiB (27721 bytes)
entries: 1
modules under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/modules/all
no module files in tarball
software under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/software
no software packages in tarball
reprod directories under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100/reprod
no reprod directories in tarball
other under 2025.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc100
2025.06/init/easybuild/eb_hooks.py
Apr 21 08:28:26 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/5) EESSI_LAMMPS_lj %device_type=cpu %module_name=LAMMPS/22Jul2025-foss-2024a-kokkos %scale=1_node /ade8cad7 @BotBuildTests:x86-64-zen2+default
P: perf: 435.376 timesteps/s (r:0, l:None, u:None)
[ OK ] (2/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_allreduce %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /e4bf9965 @BotBuildTests:x86-64-zen2+default
P: latency: 1.36 us (r:0, l:None, u:None)
[ OK ] (3/5) EESSI_OSU_coll %benchmark_info=mpi.collective.osu_alltoall %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node %device_type=cpu /3da4890b @BotBuildTests:x86-64-zen2+default
P: latency: 2.03 us (r:0, l:None, u:None)
[ OK ] (4/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_latency %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /3255009a @BotBuildTests:x86-64-zen2+default
P: latency: 0.18 us (r:0, l:None, u:None)
[ OK ] (5/5) EESSI_OSU_pt2pt_CPU %benchmark_info=mpi.pt2pt.osu_bw %module_name=OSU-Micro-Benchmarks/7.5-gompi-2025a %scale=1_node /59f4b331 @BotBuildTests:x86-64-zen2+default
P: bandwidth: 7889.95 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 5/5 test case(s) from 5 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-149984.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Apr 21 08:59:43 UTC 2026 uploaded transfer of eessi-2025.06-software-linux-x86_64-amd-zen2-accel-nvidia-cc100-17767600100.tar.zst to S3 bucket succeeded

@casparvl casparvl merged commit c4ed80c into EESSI:main Apr 21, 2026
73 of 79 checks passed
@bedroge bedroge deleted the cuda_12.8.0_100f branch April 21, 2026 09:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants