Fix memory leaks and possible use-after-free by uvlad7 · Pull Request #2024 · alphacep/vosk-api

uvlad7 · 2026-02-22T14:59:45Z

Free string returned by vosk_text_processor_itn - the C function transfers ownership of the returned string, but the Python binding was not freeing it, causing a leak on every Processor.process() call. void free(void*) is now exposed via cffi and called after copying the string.
Fix set_spk_model leak - calling vosk_recognizer_set_spk_model twice on the same recognizer - or once a on recognizer created with SpkModel - was leaking both the old speaker model reference and the old spk_feature_ object. The old model is now unref'd and the old feature object is deleted before reassigning.
Reference counting for BatchModel - BatchModel now tracks how many BatchRecognizer instances hold a reference to it. Each BatchRecognizer increments the count on construction and decrements on destruction; vosk_batch_model_free also decrements instead of deleting directly. The object is only deleted when the count reaches zero. The implementation uses the same mechanism as regular Model and ensures that different lifetimes in Python/Ruby bindings don't cause use-after-free. This can be a breaking change, but it simplifies bindings, otherwise it's needed to ensure that BatchRecognizer keeps a refecence of its BatchModel in all implementations.
Safe BatchRecognizer teardown - the destructor now ensures any in-progress stream is finished and all pending chunks are drained before the recognizer is destroyed, and before it releases its reference to the model. This is intended to address the heap-use-after-free crashes reported in issue Crash with Node.js acceptWaveformAsync (heap-use-after-free) #1189, where a recognizer was freed while an async waveform call was still running. Note: this commit was generated with AI assistance. The general approach looks reasonable to me, but I have not fully reviewed or tested this code and I'm not certain it fully resolves the race condition described in Crash with Node.js acceptWaveformAsync (heap-use-after-free) #1189. Extra scrutiny on BatchRecognizer::~BatchRecognizer() and the finished_ logic would be appreciated.

uvlad7 added 4 commits February 22, 2026 17:58

Free str returned from vosk_text_processor_itn

478ad79

Fix memory leak when vosk_recognizer_set_spk_model called twice

d9a96f0

Impl ref counting in BatchModel

bd0c27a

Vibecode possible fix for alphacep#1189

dd5a46f

uvlad7 changed the title ~~Fix memory leak: free str returned from vosk_text_processor_itn~~ Fix memory leaks and possible use-after-free Feb 22, 2026

rm vosk_cffi.py

1c87b56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leaks and possible use-after-free#2024

Fix memory leaks and possible use-after-free#2024
uvlad7 wants to merge 5 commits intoalphacep:masterfrom
uvlad7:python_memory_leak

uvlad7 commented Feb 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

Conversation

uvlad7 commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

1 participant

uvlad7 commented Feb 22, 2026 •

edited

Loading