Skip to content

Commit aec1d11

Browse files
committed
README: trim Footprint section to throughput-only
1 parent 46a0191 commit aec1d11

1 file changed

Lines changed: 2 additions & 14 deletions

File tree

execuwhisper/macos/README.md

Lines changed: 2 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -44,23 +44,11 @@ At a glance: microphone → `AudioRecorder` → `parakeet_helper` (Metal, [pytor
4444

4545
## Footprint & Performance
4646

47-
ExecuWhisper is one of the smaller fully-on-device dictation stacks shippable today. Measurements taken on an Apple Silicon Mac during active dictation:
48-
49-
| What | Size |
47+
| What | Value |
5048
|---|---:|
51-
| App bundle (`ExecuWhisper.app`) — Swift UI + 2 helper binaries + `mlx.metallib` + `libomp.dylib` | **34 MB** |
52-
| Models on disk (downloaded once on first launch) | **1.27 GB** |
53-
|    Parakeet-TDT ASR (`model.pte` + tokenizer) | 800 MB |
54-
|    LFM2.5-350M formatter (`lfm2_5_350m_mlx_4w.pte` + tokenizer) | 472 MB |
55-
| Peak working-set memory during inference (sum of UI + both helpers) | **~4.8 GB** |
56-
|    ExecuWhisper UI (SwiftUI) | 106 MB |
57-
|    `parakeet_helper` (ASR, Metal) | 1.4 GB |
58-
|    `lfm25_formatter_helper` (LFM2.5, MLX) | 3.3 GB |
5949
| LFM2.5 formatter throughput (mean over 100-row AMI eval) | **533 tok/s** |
6050

61-
> **For context:** the 34 MB app bundle is roughly 5–10× smaller than a typical Electron-based dictation app, and the 1.27 GB on-disk model footprint is well under what a single 7B-class chat LLM would occupy. Peak memory of ~4.8 GB is the cost of keeping both helper processes warm with their KV caches resident on the Metal GPU; idle steady-state RSS sits closer to ~1.7 GB.
62-
63-
Throughput numbers from `eval/eval_ami_mlx_4w_g32.json` on the [formatter HF repo](https://huggingface.co/younghan-meta/LFM2.5-350M-ExecuWhisper-Formatter); footprint numbers from `vmmap --summary` ("Physical footprint (peak)") on the running app.
51+
Throughput numbers from `eval/eval_ami_mlx_4w_g32.json` on the [formatter HF repo](https://huggingface.co/younghan-meta/LFM2.5-350M-ExecuWhisper-Formatter).
6452

6553
## Features
6654

0 commit comments

Comments
 (0)