Skip to content

Commit 83a76f2

Browse files
authored
docs: update speech-to-text benchmarks (#1014)
## Description Updated Whisper model (speech-to-text) inference time docs section. ### Introduces a breaking change? - [ ] Yes - [x] No ### Type of change - [ ] Bug fix (change which fixes an issue) - [ ] New feature (change which adds functionality) - [x] Documentation update (improves or adds clarity to existing documentation) - [ ] Other (chores, tests, code style improvements etc.) ### Tested on - [ ] iOS - [ ] Android ### Testing instructions No functional changes - no need for testing. ### Screenshots <!-- Add screenshots here, if applicable --> ### Related issues <!-- Link related issues here using #issue-number --> ### Checklist - [ ] I have performed a self-review of my code - [ ] I have commented my code, particularly in hard-to-understand areas - [x] I have updated the documentation accordingly - [ ] My changes generate no new warnings ### Additional notes <!-- Include any additional information, assumptions, or context that reviewers might need to understand this PR. -->
1 parent 81af692 commit 83a76f2

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

docs/docs/02-benchmarks/inference-time.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -139,15 +139,15 @@ Average time for encoding audio of given length over 10 runs. For `Whisper` mode
139139

140140
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
141141
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
142-
| Whisper-tiny (30s) | 248 | 254 | 1145 | 435 | 526 |
142+
| Whisper-tiny (30s) | 89 | 93 | 403 | 277 | 260 |
143143

144144
### Decoding
145145

146146
Average time for decoding one token in sequence of approximately 100 tokens, with encoding context is obtained from audio of noted length.
147147

148148
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
149149
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
150-
| Whisper-tiny (30s) | 23 | 25 | 121 | 92 | 115 |
150+
| Whisper-tiny (30s) | 6 | 6 | 40 | 28 | 25 |
151151

152152
## Text to Speech
153153

docs/versioned_docs/version-0.8.x/02-benchmarks/inference-time.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -139,15 +139,15 @@ Average time for encoding audio of given length over 10 runs. For `Whisper` mode
139139

140140
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
141141
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
142-
| Whisper-tiny (30s) | 248 | 254 | 1145 | 435 | 526 |
142+
| Whisper-tiny (30s) | 89 | 93 | 403 | 277 | 260 |
143143

144144
### Decoding
145145

146146
Average time for decoding one token in sequence of approximately 100 tokens, with encoding context is obtained from audio of noted length.
147147

148148
| Model | iPhone 17 Pro (XNNPACK) [ms] | iPhone 16 Pro (XNNPACK) [ms] | iPhone SE 3 (XNNPACK) [ms] | Samsung Galaxy S24 (XNNPACK) [ms] | OnePlus 12 (XNNPACK) [ms] |
149149
| ------------------ | :--------------------------: | :--------------------------: | :------------------------: | :-------------------------------: | :-----------------------: |
150-
| Whisper-tiny (30s) | 23 | 25 | 121 | 92 | 115 |
150+
| Whisper-tiny (30s) | 6 | 6 | 40 | 28 | 25 |
151151

152152
## Text to Speech
153153

0 commit comments

Comments
 (0)