Update docs with info on event interleaving by kidq330 · Pull Request #4 · membraneframework/membrane_whisper_plugin

kidq330 · 2026-04-22T12:08:55Z

No description provided.

kidq330 · 2026-04-22T12:11:46Z

Waiting for elixir-nx/bumblebee#454, adding a try block when the GenServer crashes would be a possible workaround

varsill · 2026-04-28T14:04:36Z


    assert_end_of_stream(pipeline_pid, :sink, :input, 20_000)

    [


You are not using this list anywhere, are you? :D

No, I only used it to group the assertions together visually

But I suppose we should assert on receiving these events and I don't see it hapenning 🤔

varsill · 2026-04-28T14:08:06Z

  @impl true
  def init(opts) do
-    GenServer.cast(self(), :serving_start)
+    Process.flag(:trap_exit, true)


Why do we need this one?

Nx.Serving.run starts the serving in a separate process internally using spawn_link, and sends an EXIT signal to the ModelServer when it crashes. Without the trap, the ModelServer process gets killed before executing the code from the catch block.

I don't know if there's any machinations to avoid the trap, and I've had an LLM search through Bumblebee and Nx but it looks like the linking behaviour is not configurable

How about spawning it under a dedicated Agent which would by monitored by the ModelServer?

varsill · 2026-04-28T14:09:47Z


+  The transcripts are sent via the `:output` pad along with the audio buffers, as `Membrane.Whisper.TranscriptEvent` events.
+  A sequence of audio buffers is followed by an event containing the transcript for said sequence, e.g.:
+  `<audio frames 0s - 10s> <event with transciption of 0s-10s> <audio frames 10s-20s> <event with transcription of 10s-20s> <audio frames 20s-30s> ...`


It's not clear to me what happens when e.g. there is a chunk of audio (0s-10s) but the voice starts at 5s - will the TranscriptionEvent be sent with 0s-10s timestamp or 5s-10s?

We don't have a mechanism for timestamping the events by default, hence the interleaving. The only way for TranscriptEvent.start_timestamp_seconds etc. to be non-nil is to enable timestamp generation in the Whisper serving via timestamps: :segments.

Unfortunately, enabling timestamps in Whisper seems to make the serving output transcripts correspond to audio of arbitrary length (the model doesn't respect the chunk duration anymore), violating the interleaving invariant. I'm working on something that might fix this by queueing buffers inside the filter and push them downstream based on the timestamps returned by Whisper when enabled, but that's outside the scope of this PR.

Update docs with info on event interleaving

d143395

kidq330 self-assigned this Apr 22, 2026

kidq330 added this to Smackore Apr 22, 2026

kidq330 moved this to In Progress in Smackore Apr 22, 2026

kidq330 added 3 commits April 28, 2026 11:28

Catch Bumblebee crash

e5bf977

Refactor test

c254308

Fix credo unnamed variable

1d9d26b

kidq330 marked this pull request as ready for review April 28, 2026 09:52

varsill self-requested a review April 28, 2026 14:02

varsill requested changes Apr 28, 2026

View reviewed changes

kidq330 requested a review from varsill April 30, 2026 07:35

kidq330 added 2 commits May 13, 2026 12:23

Monitor the ModelServer

4a60b8a

monitor in filter instead of ModelServer

c36cae6

kidq330 moved this from In Progress to In Review in Smackore May 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update docs with info on event interleaving#4

Update docs with info on event interleaving#4
kidq330 wants to merge 6 commits into
masterfrom
context_update_docs

kidq330 commented Apr 22, 2026

Uh oh!

kidq330 commented Apr 22, 2026

Uh oh!

varsill Apr 28, 2026

Uh oh!

kidq330 Apr 29, 2026

Uh oh!

varsill May 6, 2026 •

edited

Loading

Uh oh!

varsill Apr 28, 2026

Uh oh!

kidq330 Apr 30, 2026

Uh oh!

varsill May 6, 2026

Uh oh!

varsill Apr 28, 2026

Uh oh!

kidq330 Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kidq330 commented Apr 22, 2026

Uh oh!

kidq330 commented Apr 22, 2026

Uh oh!

varsill Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

kidq330 Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

varsill May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

varsill Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

kidq330 Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

varsill May 6, 2026

Choose a reason for hiding this comment

Uh oh!

varsill Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

kidq330 Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

varsill May 6, 2026 •

edited

Loading