Skip to content

Commit 419df99

Browse files
authored
feat: update sglang_runner (#36)
* feat: update sglang_runner and meta.json * Update sglang runner to c43a8309 * chore: remove deprecated runner nvidia_sglang_bbeb34f6 - Delete old runner implementation - Keep only nvidia_sglang_c43a8309
1 parent 3ae358a commit 419df99

4 files changed

Lines changed: 452 additions & 1 deletion

File tree

.gitignore

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,4 +21,7 @@ configs/runner_configs/*.yaml
2121
# Local-only benchmark artifacts (not needed for submission)
2222
accuracy_outputs.jsonl
2323
run.log
24-
samples.jsonl
24+
samples.jsonl.ipynb_checkpoints/
25+
*_backup/
26+
backup/
27+
.ipynb_checkpoints/
Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,11 @@
1+
{
2+
"id": "nvidia_sglang_c43a8309",
3+
"platform": "nvidia",
4+
"name": "SGLang on NVIDIA",
5+
"framework": "SGLang",
6+
"submitted_by": "Gong-K",
7+
"description": "AccelMark runner for NVIDIA GPUs using SGLang. Supports all suites (A–G). Enables direct throughput and latency comparison between SGLang and vLLM on the same hardware.",
8+
"supersedes_chain": [],
9+
"notes": "Decouple runners from suite and scenario knowledge — load_model() uses use_async from parallelism dict instead of checking scenario name.",
10+
"created": "2026-04-03"
11+
}
Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
# AccelMark — NVIDIA SGLang runner dependencies
2+
# Tested combination: torch 2.9.1 + sglang 0.4.x + CUDA 12.1
3+
#
4+
# SGLang install — use the official release wheel:
5+
# pip install sglang[all]
6+
# or for a pinned version:
7+
# pip install "sglang[all]==0.5.6"
8+
#
9+
# See https://sgl-project.github.io/start/install.html for full options.
10+
11+
# Core
12+
torch==2.9.1
13+
torchvision==0.24.1
14+
torchaudio==2.9.1
15+
16+
# LLM inference
17+
sglang[all]>=0.5.6
18+
19+
# Transformers (for tokenizer)
20+
transformers==4.57.1
21+
tokenizers>=0.22.2
22+
huggingface-hub>=0.36.2
23+
accelerate>=1.13.0
24+
25+
# AccelMark dependencies
26+
numpy>=2.2.6
27+
jsonschema>=4.26.0
28+
psutil>=7.2.2
29+
tqdm>=4.67.3
30+
31+
# NVIDIA monitoring
32+
nvidia-ml-py>=13.595.45
33+
34+
# Async support
35+
aiohttp>=3.13.5
36+
37+
# Config file parsing
38+
pyyaml>=6.0

0 commit comments

Comments
 (0)