You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
================================================== End of ROCm SMI Log ===================================================
189
189
```
190
+
## Enroot and Pyxis with GPU partition
191
+
We can use partitioned GPUs just like any other unpartitioned GPU when we use enroot and pyxis. But for this, slurm first needs to identify partitioned GPUS as the generic resources.
192
+
Some config changes:
193
+
1. Add the below line to /etc/slurm/gres.conf file, so that whenever GPUS are partitioned, the slurm automatically detects the number of gres resources.
194
+
AutoDetect=rsmi
195
+
Example gres.conf file :
196
+
```bash
197
+
AutoDetect=rsmi
198
+
Name=gpu File=/dev/dri/renderD128
199
+
Name=gpu File=/dev/dri/renderD136
200
+
Name=gpu File=/dev/dri/renderD144
201
+
Name=gpu File=/dev/dri/renderD152
202
+
Name=gpu File=/dev/dri/renderD160
203
+
Name=gpu File=/dev/dri/renderD168
204
+
Name=gpu File=/dev/dri/renderD176
205
+
Name=gpu File=/dev/dri/renderD184
206
+
```
207
+
2. If gres is specified in the node info in /etc/slurm/slurm.conf file, make sure it specifies the correct number of GPUs for that node
0 commit comments