Skip to content

Commit 290f432

Browse files
committed
Move nel-ci-guide.md to Model-Optimizer-Internal per review feedback
Reviewer @shengliangxu flagged that the NEL CI evaluation guide contains NVIDIA-internal infrastructure (JET clusters, svc-jet service account, gitlab-master NEL CI triggers, COMPEVAL_HF_TOKEN, internal lustre paths) and should not ship in the public repo. The file has been moved to Model-Optimizer-Internal:agent/nel-ci-guide.md (see internal MR: zhiyu/add-nel-ci-guide-to-agent). This commit removes the public copy and the "NEL CI and Cluster-Specific Notes" section from evaluation/SKILL.md that referenced it. Signed-off-by: Zhiyu Cheng <zhiyuc@nvidia.com>
1 parent 9cb309b commit 290f432

2 files changed

Lines changed: 0 additions & 289 deletions

File tree

.claude/skills/evaluation/SKILL.md

Lines changed: 0 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -319,19 +319,6 @@ After job submission, you can monitor progress using:
319319

320320
---
321321

322-
### NEL CI and Cluster-Specific Notes
323-
324-
For running evaluations on NVIDIA JET clusters (oci-hsg, cw, oci-nrt) or SLURM clusters like dlcluster, read `references/nel-ci-guide.md`. It covers:
325-
- NEL CI GitLab trigger pattern vs NEL SLURM executor
326-
- Cluster-specific GPU counts and storage paths
327-
- Checkpoint availability (compute nodes may not share login node filesystems)
328-
- Environment variable prefixes (`host:`, `lit:`) for SLURM executor
329-
- SGLang must bind `--host 0.0.0.0` for health checks
330-
- Directory setup and `chmod 777` for JET service account access
331-
- Common issues (NGC auth, gated datasets, walltime, `NEL_OTHER_OVERRIDES` space-splitting)
332-
333-
---
334-
335322
Direct users with issues to:
336323

337324
- **GitHub Issues:** <https://github.com/NVIDIA-NeMo/Evaluator/issues>

.claude/skills/evaluation/references/nel-ci-guide.md

Lines changed: 0 additions & 276 deletions
This file was deleted.

0 commit comments

Comments
 (0)