lazy import for vllm benchmark by RuBing-Yang · Pull Request #129 · Tencent/AngelSlim

RuBing-Yang · 2025-11-05T02:54:24Z

This pull request refactors how external dependencies are imported in the benchmarking code for speculative decoding, focusing on lazy imports for improved modularity and startup performance. The main changes involve switching direct imports of fastchat, shortuuid, and vllm to use the new angelslim.utils.lazy_imports module, and updating code to reference these modules accordingly. This change helps avoid unnecessary imports when modules are not used, and sets up the codebase for easier dependency management.

Dependency import refactoring:

Replaced direct imports of fastchat, shortuuid, and vllm in generate_baseline_answer.py and generate_eagle_answer.py with lazy imports from angelslim.utils.lazy_imports, and updated all usages to reference these lazy-loaded modules. [1] [2] [3]
Updated all usages of LLM and SamplingParams from vllm to use vllm.LLM and vllm.SamplingParams via the lazy import, including type hints and instantiations. [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12]

Benchmark engine updates:

Changed direct import and usage of load_questions from fastchat.llm_judge.common to reference fastchat.llm_judge.common.load_questions via the lazy import in benchmark_engine.py, generate_baseline_answer.py, and generate_eagle_answer.py. [1] [2] [3] [4] [5]

Requirements update:

Added vllm>=0.11.0 to requirements/requirements_speculative.txt to ensure the required version is installed for lazy loading.

Miscellaneous:

Added from __future__ import annotations at the top of generate_baseline_answer.py and generate_eagle_answer.py to support postponed evaluation of type annotations, which helps with forward references and lazy imports. [1] [2]
Removed unused direct imports of shortuuid and vllm from the affected files, cleaning up the codebase. [1] [2]

Let me know if you'd like to discuss how lazy imports work or why this change improves the codebase!

RuBing-Yang added 3 commits November 5, 2025 10:52

lazy import for vllm benchmark

a441d52

add vllm requirement

9c88f47

remove mp from lazy imports

382c1ab

liusong1222 approved these changes Nov 5, 2025

View reviewed changes

RuBing-Yang merged commit 9bfa6e5 into Tencent:main Nov 5, 2025
5 checks passed

dawnranger pushed a commit to dawnranger/AngelSlim that referenced this pull request Mar 11, 2026

lazy import for vllm benchmark (Tencent#129)

f14ff15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lazy import for vllm benchmark#129

lazy import for vllm benchmark#129
RuBing-Yang merged 3 commits into
Tencent:mainfrom
RuBing-Yang:spec_decode

RuBing-Yang commented Nov 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RuBing-Yang commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RuBing-Yang commented Nov 5, 2025 •

edited

Loading