perf(whisper): lazy-import timing module to avoid loading scipy/numba by YashD5291 · Pull Request #1413 · ml-explore/mlx-examples

YashD5291 · 2026-04-02T19:34:16Z

Summary

Move from .timing import add_word_timestamps from module-level to inside the if word_timestamps: guard in transcribe()
Makes scipy and numba truly optional when word_timestamps=False (the default)

Problem

timing.py imports scipy and numba at module level:

import numba           # timing.py:8
from scipy import signal  # timing.py:10

These are loaded unconditionally via transcribe.py's top-level from .timing import add_word_timestamps, even when word_timestamps=False. This eagerly loads ~620 Python modules and ~212 MB of native libraries that are never used.

Fix

One-line change: move the import inside the if word_timestamps: conditional where it's actually needed.

- from .timing import add_word_timestamps

  if word_timestamps:
+     from .timing import add_word_timestamps
+
      add_word_timestamps(...)

Impact

Import time: import mlx_whisper no longer loads scipy/numba/llvmlite
Packaging: Downstream apps using PyInstaller/cx_Freeze can exclude scipy (~71 MB), numba (~31 MB), and llvmlite (~110 MB) from bundles when word timestamps aren't needed
Zero behavior change: When word_timestamps=True, the import fires on first use and everything works identically

Testing

word_timestamps=False (default): scipy/numba never imported — verified via sys.modules inspection
word_timestamps=True: import fires inside the conditional, all word timestamp functionality works as before

Move `from .timing import add_word_timestamps` from module-level to inside the `if word_timestamps:` guard in transcribe(). timing.py imports scipy and numba at module level for word-level timestamp alignment (DTW + median filter). These are heavy dependencies (~212 MB combined) that load ~620 Python modules on import. When word_timestamps=False (the default), none of this code is ever called, yet it all gets loaded eagerly. This change makes scipy and numba truly optional for the common case of transcription without word timestamps, which: - Reduces import time significantly - Enables downstream packagers (PyInstaller, cx_Freeze) to exclude scipy/numba/llvmlite from bundles when word timestamps aren't needed - Saves ~212 MB in frozen/packaged applications

YashD5291 mentioned this pull request Apr 2, 2026

chore: reduce bundle size ~41 MB YashD5291/Esper#2

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(whisper): lazy-import timing module to avoid loading scipy/numba#1413

perf(whisper): lazy-import timing module to avoid loading scipy/numba#1413
YashD5291 wants to merge 1 commit intoml-explore:mainfrom
YashD5291:lazy-timing-imports

YashD5291 commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

YashD5291 commented Apr 2, 2026

Summary

Problem

Fix

Impact

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant