fix!: speech to text live transcription by IgorSwat · Pull Request #816 · software-mansion/react-native-executorch

IgorSwat · 2026-02-17T14:26:43Z

Description

Various improvements & adjustments in Speech-to-Text module. The list of changes includes:

Adjusting native implementation to the new format of Whisper models (single file, bundled encode & decode methods)
Refactoring native implementation in order to support multiple STT models in the future
Fixing an impropriate behavior of Whisper streaming

Introduces a breaking change?

Yes
No

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
Documentation update (improves or adds clarity to existing documentation)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

Testing instructions

You can run the tests defined for Speech-to-Text module, as well as test it manually with the 'speech' demo app (SpeechToText screen).

Screenshots

Related issues

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

msluszniak

Some comments are not needed imo

chmjkb

Overall solid work, thanks 👏🏻
Left a couple of nits

chmjkb

Two more things:

I wasn't able to compile the app for Android (due to Norbert bumping minSdkVersion in RNET). You have to bump the minSdkVersion in the example app.
Once compiled, it doesn't ask for mic permissions (im using a Pixel 10) and silently fails.

chmjkb

I think you should change the TS side as you're returnign a different thing from C++, for example:

  public async encode(waveform: Float32Array): Promise<Float32Array> {
    return new Float32Array(await this.nativeModule.encode(waveform));
  }

Also, why did you switch back to type in SpeechToTextModelConfig?

msluszniak

And rebase

msluszniak · 2026-03-09T17:14:18Z

@IgorSwat could you fix the lint and reference API, then I will run test and speech demo app.

chmjkb

Please update the docs (not the API reference) as they're outdated now.

IgorSwat · 2026-03-10T10:38:11Z

Hot take: can we just remove the API reference from the repo, please?

msluszniak · 2026-03-10T10:47:52Z

Hot take: can we just remove the API reference from the repo, please?

No, why would we do that?

IgorSwat · 2026-03-10T11:11:14Z

Hot take: can we just remove the API reference from the repo, please?

No, why would we do that?

Okay, nevermind, It passed after just 2 rebases. Could be worse :)

msluszniak · 2026-03-10T11:14:52Z

Okay, nevermind, It passed after just 2 rebases. Could be worse :)

It will always pass after rebase, I can guarantee you. So really it's not that bad, you can handle it once at the very end of the review process and it's just one command, but big benefit of having complete API. First try didn't work because signature of updated package Zod changed, and then you changed code so lines had mismatches ;)

msluszniak

Brilliant work on this one 🚀

chmjkb

One minor comment, overall great work 👏🏻

chmjkb · 2026-03-11T10:01:14Z

Also please get rid of the API reference files

IgorSwat · 2026-03-11T13:25:32Z

Also please get rid of the API reference files

Done.

IgorSwat requested review from chmjkb and msluszniak February 17, 2026 14:26

msluszniak reviewed Feb 17, 2026

View reviewed changes

Comment thread .../react-native-executorch/common/rnexecutorch/models/speech_to_text/common/schema/OnlineASR.h Outdated

Comment thread packages/react-native-executorch/common/rnexecutorch/models/speech_to_text/whisper/ASR.h Outdated

IgorSwat force-pushed the @is/speech-to-text branch from 3ca7f15 to ea943e4 Compare February 17, 2026 14:56

msluszniak reviewed Feb 18, 2026

View reviewed changes

Comment thread packages/react-native-executorch/common/rnexecutorch/models/speech_to_text/SpeechToText.h Outdated

Comment thread packages/react-native-executorch/src/constants/modelUrls.ts Outdated

msluszniak assigned IgorSwat Feb 20, 2026

msluszniak added the bug fix PRs that are fixing bugs label Feb 20, 2026

msluszniak linked an issue Feb 20, 2026 that may be closed by this pull request

Fix Speech to Text streaming mode #741

Closed

msluszniak changed the title ~~@is/speech to text~~ fix: speech to text live transcription Feb 20, 2026

IgorSwat force-pushed the @is/speech-to-text branch from 7b1e6ff to 2ee6d1d Compare March 2, 2026 09:21

msluszniak reviewed Mar 2, 2026

View reviewed changes

msluszniak reviewed Mar 3, 2026

View reviewed changes

Comment thread packages/react-native-executorch/common/rnexecutorch/models/BaseModel.h

chmjkb requested changes Mar 4, 2026

View reviewed changes

chmjkb requested changes Mar 5, 2026

View reviewed changes

IgorSwat force-pushed the @is/speech-to-text branch 3 times, most recently from 816c75a to ae017ef Compare March 6, 2026 16:12

chmjkb requested changes Mar 6, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/modules/natural_language_processing/SpeechToTextModule.ts Outdated

chmjkb requested changes Mar 6, 2026

View reviewed changes

msluszniak reviewed Mar 9, 2026

View reviewed changes

msluszniak mentioned this pull request Mar 9, 2026

fix: Protect acquiring buffer by different workers #805

Closed

12 tasks

IgorSwat force-pushed the @is/speech-to-text branch from a2e04e0 to 081aea0 Compare March 9, 2026 16:09

chmjkb requested changes Mar 10, 2026

View reviewed changes

IgorSwat force-pushed the @is/speech-to-text branch 2 times, most recently from acd7682 to 04a71b4 Compare March 10, 2026 11:06

IgorSwat added 16 commits March 10, 2026 16:47

Fix punctation comparision issue

9b1fb41

Final timestamp fix: silence estimation

bbe87ac

Remove special tokens

0715570

Add pause to streaming mode

d59b6e8

Apply review suggestions

9bc0eee

Set up url's

51dbba2

Final fixes

b2a6897

Apply review suggestions

4fd4fc0

Enable multilingual models

6d73887

Fix demo app crash

e1fdf4f

Pretty good heuristic that fixes everything :)

9e5bf5a

Optimize by removing unnecessary copies

9243543

Final nits

2047e6d

Apply review suggestions

ba73116

Update docs & API reference

52f29c1

Fix S2T tests

43cf499

IgorSwat force-pushed the @is/speech-to-text branch from 1e9d17e to 3874dae Compare March 10, 2026 15:49

Add quantized .en models & fix test URLs

b75f178

IgorSwat force-pushed the @is/speech-to-text branch from 3874dae to b75f178 Compare March 10, 2026 16:23

IgorSwat changed the title ~~fix: speech to text live transcription~~ fix!: speech to text live transcription Mar 10, 2026

test: fix one test case in STT

bdffb21

msluszniak approved these changes Mar 10, 2026

View reviewed changes

msluszniak mentioned this pull request Mar 10, 2026

Do not wait 100ms in the streaming loop #587

Closed

chmjkb requested changes Mar 11, 2026

View reviewed changes

Comment thread packages/react-native-executorch/src/index.ts Outdated

Remove API reference files

66e8e85

chmjkb approved these changes Mar 11, 2026

View reviewed changes

IgorSwat merged commit 6dd8fb6 into main Mar 11, 2026
5 checks passed

IgorSwat deleted the @is/speech-to-text branch March 11, 2026 13:41

Conversation

IgorSwat commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Screenshots

Related issues

Checklist

Additional notes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chmjkb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chmjkb left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chmjkb left a comment

Choose a reason for hiding this comment

Uh oh!

msluszniak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

msluszniak commented Mar 9, 2026

Uh oh!

chmjkb left a comment

Choose a reason for hiding this comment

Uh oh!

IgorSwat commented Mar 10, 2026

Uh oh!

msluszniak commented Mar 10, 2026

Uh oh!

IgorSwat commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msluszniak commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msluszniak left a comment

Choose a reason for hiding this comment

Uh oh!

chmjkb left a comment

Choose a reason for hiding this comment

Uh oh!

IgorSwat commented Feb 17, 2026 •

edited

Loading

IgorSwat commented Mar 10, 2026 •

edited

Loading

msluszniak commented Mar 10, 2026 •

edited

Loading