Skip to content

Commit d6ae47c

Browse files
committed
README: add Footprint & Performance section (34MB app, 1.27GB models, 533 tok/s)
1 parent 90f7581 commit d6ae47c

1 file changed

Lines changed: 20 additions & 0 deletions

File tree

execuwhisper/macos/README.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -38,6 +38,26 @@ https://github.com/user-attachments/assets/b840bf99-e221-4c19-ba2e-771903fa357b
3838

3939
(Recording outline lives in [`docs/DEMO_SCRIPT.md`](docs/DEMO_SCRIPT.md).)
4040

41+
## Footprint & Performance
42+
43+
ExecuWhisper is one of the smaller fully-on-device dictation stacks shippable today. Measurements taken on an Apple Silicon Mac during active dictation:
44+
45+
| What | Size |
46+
|---|---:|
47+
| App bundle (`ExecuWhisper.app`) — Swift UI + 2 helper binaries + `mlx.metallib` + `libomp.dylib` | **34 MB** |
48+
| Models on disk (downloaded once on first launch) | **1.27 GB** |
49+
|    Parakeet-TDT ASR (`model.pte` + tokenizer) | 800 MB |
50+
|    LFM2.5-350M formatter (`lfm2_5_350m_mlx_4w.pte` + tokenizer) | 472 MB |
51+
| Peak working-set memory during inference (sum of UI + both helpers) | **~4.8 GB** |
52+
|    ExecuWhisper UI (SwiftUI) | 106 MB |
53+
|    `parakeet_helper` (ASR, Metal) | 1.4 GB |
54+
|    `lfm25_formatter_helper` (LFM2.5, MLX) | 3.3 GB |
55+
| LFM2.5 formatter throughput (mean over 100-row AMI eval) | **533 tok/s** |
56+
57+
> **For context:** the 34 MB app bundle is roughly 5–10× smaller than a typical Electron-based dictation app, and the 1.27 GB on-disk model footprint is well under what a single 7B-class chat LLM would occupy. Peak memory of ~4.8 GB is the cost of keeping both helper processes warm with their KV caches resident on the Metal GPU; idle steady-state RSS sits closer to ~1.7 GB.
58+
59+
Throughput numbers from `eval/eval_ami_mlx_4w_g32.json` on the [formatter HF repo](https://huggingface.co/younghan-meta/LFM2.5-350M-ExecuWhisper-Formatter); footprint numbers from `vmmap --summary` ("Physical footprint (peak)") on the running app.
60+
4161
## Architecture
4262

4363
<p align="center">

0 commit comments

Comments
 (0)