Tokens in VLMPipeline output by dkalinowski · Pull Request #3808 · openvinotoolkit/openvino.genai

dkalinowski · 2026-05-06T09:31:21Z

Description

Extend VLMDecodedResults (and DecodedResults) with additional field: tokens.
OVMS requires tokens for new tool parsers to work, for example by LFM or gemma models.

CVS-184756

Checklist:

This PR follows GenAI Contributing guidelines.
Tests have been updated or added to cover the new code.
This PR fully addresses the ticket. - PR in OVMS with LFM/gemma parsers will follow this topic
I have made corresponding changes to the documentation.

Copilot

Pull request overview

This PR extends generation outputs to include raw token IDs alongside decoded texts, making DecodedResults / VLMDecodedResults usable for downstream token-based tooling (e.g., OVMS tool parsers).

Changes:

Added tokens to DecodedResults (C++), and populated it across LLM/VLM pipelines (including speculative decoding and continuous batching).
Exposed tokens through Python (pybind + .pyi) and JavaScript (N-API helper + TS wrappers/classes).
Added Python tests validating token availability and round-trip tokenizer.decode(tokens) == text.

Reviewed changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
tests/python_tests/test_vlm_pipeline.py	Adds a VLM test validating `tokens` presence/type and decode round-trip.
tests/python_tests/test_llm_pipeline.py	Adds an LLM test validating `DecodedResults.tokens` and decode round-trip.
src/python/py_openvino_genai.cpp	Exposes `DecodedResults.tokens` via pybind11.
src/python/openvino_genai/py_openvino_genai.pyi	Updates Python typing stubs to include `DecodedResults.tokens`.
src/js/src/helper.cpp	Adds `tokens` field serialization to JS objects for LLM/VLM decoded results.
src/js/lib/pipelines/vlmPipeline.ts	Threads `tokens` through VLM pipeline callback result into `VLMDecodedResults`.
src/js/lib/pipelines/llmPipeline.ts	Threads `tokens` through LLM pipeline callback result into `DecodedResults`.
src/js/lib/decodedResults.ts	Adds `tokens` field/constructor arg to `DecodedResults` and `VLMDecodedResults`.
src/cpp/src/visual_language/pipeline.cpp	Populates `decoded.tokens` from encoded results in VLM pipeline.
src/cpp/src/visual_language/continuous_batching_adapter.hpp	Propagates `tokens` in continuous batching adapter results.
src/cpp/src/speculative_decoding/stateful/stateful_pipeline_base.cpp	Populates `DecodedResults.tokens` for stateful speculative decoding.
src/cpp/src/llm/pipeline_static.cpp	Populates `DecodedResults.tokens` for LLM generation.
src/cpp/src/llm/pipeline_stateful.cpp	Populates `DecodedResults.tokens` in stateful LLM decoded-result assembly.
src/cpp/src/continuous_batching/pipeline_base.cpp	Adds `tokens` to `VLMDecodedResults` produced by continuous batching path.
src/cpp/include/openvino/genai/llm_pipeline.hpp	Adds `tokens` member to the public `DecodedResults` C++ type.

    std::vector<std::string> texts;
    std::vector<float> scores;
    std::vector<GenerationFinishReason> finish_reasons;
+    /// @brief Generated token ids per sequence (parallels @ref texts).
+    std::vector<std::vector<int64_t>> tokens;
    PerfMetrics perf_metrics;


Copilot

Pull request overview

Copilot reviewed 17 out of 17 changed files in this pull request and generated 3 comments.

dkalinowski · 2026-05-13T13:53:52Z

+    Napi::Array js_array = Napi::Array::New(env, value.size());
+    for (size_t i = 0; i < value.size(); ++i) {
+        const auto& sequence = value[i];
+        Napi::BigInt64Array sequence_array = Napi::BigInt64Array::New(env, sequence.size());
+        std::copy(sequence.begin(), sequence.end(), sequence_array.Data());
+        js_array[i] = sequence_array;


sgonorov

Make sure to polish copilot comments first.

yatarkan

@Retribution98 Could you please review JS part

dkalinowski · 2026-05-13T13:56:09Z

@sgonorov I cleaned up copilot comment except one: #3808 (comment)

It has valid point that it will break ABI for C++ users, for those who link against a prebuilt shared library.
We need to either accept that or hide getters behind accessors + pimpl pattern.

## Description From openvinotoolkit#3808 (comment) ## Checklist: - [x] This PR follows [GenAI Contributing guidelines](https://github.com/openvinotoolkit/openvino.genai?tab=contributing-ov-file#contributing).  N/A Tests have been updated or added to cover the new code.  N/A This PR fully addresses the ticket.  N/A I have made corresponding changes to the documentation.

Retribution98

JS part looks good

dkalinowski added 2 commits May 6, 2026 10:46

v1

9e0ea39

v2

35cd38a

dkalinowski requested review from Copilot, dtrawins, mzegla and przepeck May 6, 2026 09:31

dkalinowski requested review from Retribution98, Wovchena, as-suvorov, pavel-esir, popovaan, sbalandi, sgonorov and yatarkan as code owners May 6, 2026 09:31

Copilot started reviewing on behalf of dkalinowski May 6, 2026 09:36 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

przepeck approved these changes May 6, 2026

View reviewed changes

dkalinowski added 3 commits May 7, 2026 12:52

copilot review

7f8ceb2

fix tests

0d30f1f

Merge remote-tracking branch 'origin/master' into tokens-in-vlm-output

6fbd064

Copilot AI review requested due to automatic review settings May 7, 2026 12:45

Copilot started reviewing on behalf of dkalinowski May 7, 2026 12:46 View session

Copilot AI reviewed May 7, 2026

View reviewed changes

sgonorov approved these changes May 11, 2026

View reviewed changes

yatarkan approved these changes May 13, 2026

View reviewed changes

add algorithm include

bd5eb41

Wovchena mentioned this pull request May 14, 2026

Don't require ABI stability #3861

Merged

1 task

Retribution98 approved these changes May 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tokens in VLMPipeline output#3808

Tokens in VLMPipeline output#3808
dkalinowski wants to merge 6 commits into
openvinotoolkit:masterfrom
dkalinowski:tokens-in-vlm-output

dkalinowski commented May 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

dkalinowski May 13, 2026

Uh oh!

Uh oh!

Uh oh!

sgonorov left a comment

Uh oh!

yatarkan left a comment

Uh oh!

dkalinowski commented May 13, 2026

Uh oh!

Retribution98 left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

dkalinowski commented May 6, 2026

Description

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

dkalinowski May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sgonorov left a comment

Choose a reason for hiding this comment

Uh oh!

yatarkan left a comment

Choose a reason for hiding this comment

Uh oh!

dkalinowski commented May 13, 2026

Uh oh!

Retribution98 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Retribution98 left a comment •

edited

Loading