fix(workflows): normalize two-digit years in GetDOBTask by rosetta-livekit-bot[bot] · Pull Request #1837 · livekit/agents-js

rosetta-livekit-bot · 2026-06-19T07:02:30Z

Summary

Port GetDOBTask from livekit/agents with two-digit year normalization from fix(workflows): normalize two-digit years in GetDOBTask agents#6124
Normalize years like 90 to 1990 and 05 to 2005 before date validation
Export the beta DOB workflow and add a changeset

Testing

pnpm --filter @livekit/agents typecheck
pnpm --filter @livekit/agents lint (passes with existing warnings)
pnpm --filter @livekit/agents build

No tests added per porting request.

Ported from livekit/agents#6124

Original PR description

Closes #6067

GetDOBTask's prompt tells the model to normalize two-digit years ("90" likely means 1990), but _update_dob_impl takes year as a raw int with no lower bound. Smaller/faster models often pass the spoken value through literally, and date(90, 5, 15) is a valid Python date (year 90 AD): the future-date check passes, no ToolError is raised, and the task completes with a corrupted birthdate. Because this workflow tends to feed identity/healthcare/fintech intake, it's silent data corruption rather than a visible error.

Fix: normalize year < 100 at the top of _update_dob_impl with a pivot window keyed on the current year, which is exactly what the prompt already promises. 90 -> 1990, 05 -> 2005, 26 -> 2026, 27 -> 1927. Four-digit years are left untouched, so there's no regression. The issue floated a hard floor (raise on year < 1900) as an alternative; I went with normalization since it matches the documented prompt contract and the smaller behavioral change.

Verified:

Added tests/test_dob.py (unit, no LLM). It fails on main (date(90, 5, 15) != date(1990, 5, 15)) and passes with the fix.
pytest tests/test_dob.py --unit -> 2 passed.
ruff check and ruff format --check clean on both files.

…1517)

…1525) Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

Agent.llmNode now returns ReadableStream<ChatChunk | string | FlushSentinel>, but the agent_v2 hook overrides and AgentHookAdapter still declared the narrower ChatChunk | string union, so passing super.llmNode as the fallback failed to type-check. Widen the override return types and the adapter's fallback/return signatures to include FlushSentinel. Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: Brian Yin <brian.yin@livekit.io> Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

Catch end-call close listener errors to avoid unhandled rejections during shutdown, and make public tool type guards return false for null inputs.

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

…egment (#1760) Co-authored-by: Cursor <cursoragent@cursor.com>

…#1698) Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: Toubat <brian.yin@livekit.io>

# Conflicts: # agents/src/voice/agent_activity.test.ts # agents/src/voice/generation.ts # agents/src/voice/generation_tts_timeout.test.ts

The test asserted an exact tick count ([0,1,2]) against real timers with a 5ms margin, which flakes on loaded CI runners. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

Co-authored-by: Long Chen <longch1024@gmail.com>

Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>

Co-authored-by: u9g <jason.lernerman@livekit.io>

changeset-bot · 2026-06-19T07:02:41Z

🦋 Changeset detected

Latest commit: 2a64eae

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 35 packages

Name	Type
@livekit/agents	Major
@livekit/agents-plugin-anam	Major
@livekit/agents-plugin-assemblyai	Major
@livekit/agents-plugin-baseten	Major
@livekit/agents-plugin-bey	Major
@livekit/agents-plugin-cartesia	Major
@livekit/agents-plugin-cerebras	Major
@livekit/agents-plugin-deepgram	Major
@livekit/agents-plugin-did	Major
@livekit/agents-plugin-elevenlabs	Major
@livekit/agents-plugin-fishaudio	Major
@livekit/agents-plugin-google	Major
@livekit/agents-plugin-hedra	Major
@livekit/agents-plugin-hume	Major
@livekit/agents-plugin-inworld	Major
@livekit/agents-plugin-lemonslice	Major
@livekit/agents-plugin-liveavatar	Major
@livekit/agents-plugin-livekit	Major
@livekit/agents-plugin-minimax	Major
@livekit/agents-plugin-mistral	Major
@livekit/agents-plugin-mistralai	Major
@livekit/agents-plugin-neuphonic	Major
@livekit/agents-plugin-openai	Major
@livekit/agents-plugin-perplexity	Major
@livekit/agents-plugin-phonic	Major
@livekit/agents-plugin-resemble	Major
@livekit/agents-plugin-rime	Major
@livekit/agents-plugin-runway	Major
@livekit/agents-plugin-sarvam	Major
@livekit/agents-plugin-silero	Major
@livekit/agents-plugin-soniox	Major
@livekit/agents-plugin-tavus	Major
@livekit/agents-plugins-test	Major
@livekit/agents-plugin-trugen	Major
@livekit/agents-plugin-xai	Major

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no bugs or issues to report.

devin-ai-integration

Devin Review found 2 new potential issues.

devin-ai-integration · 2026-07-02T13:54:07Z

+  private async deliverReply(session: AgentSession): Promise<void> {
+    try {
+      if (this.owningActivity) {
+        await this.owningActivity.waitForIdle();
+      } else if ('waitForIdle' in session && typeof session.waitForIdle === 'function') {
+        await session.waitForIdle();
+      }
+
+      const updates = [...this.pendingUpdates];
+      this.pendingUpdates = [];
+      const pendingItems = updates.flatMap((update) => update.items);
+      if (pendingItems.length === 0) return;
+
+      const targetAgent = this.owningActivity?.agent ?? getCurrentAgent(session);
+
+      const itemsToInsert = updates
+        .filter((update) => update.target !== targetAgent)
+        .flatMap((update) => update.items);
+
+      let chatCtx: ChatContext | undefined;
+      if (itemsToInsert.length > 0) {
+        chatCtx = targetAgent.chatCtx.copy();
+        chatCtx.insert(itemsToInsert);
+      }
+
+      const lastItem = pendingItems[pendingItems.length - 1]!;
+      const targetItems = targetAgent.chatCtx.items;
+      const atTail =
+        targetItems.length > 0 && targetItems[targetItems.length - 1]!.id === lastItem.id;
+      const callIds = pendingItems
+        .filter((item): item is FunctionCallOutput => item.type === 'function_call_output')
+        .map((item) => item.callId);
+      const instructions = renderTemplate(
+        atTail ? this.toolOptions.replyAtTailTemplate : this.toolOptions.replyMaybeCoveredTemplate,
+        { callIds },
+      );
+
+      session.generateReply({
+        instructions,
+        toolChoice: 'none',
+        chatCtx,
+      });
+    } finally {
+      this._replyTaskDone = true;
+    }
+  }


🔴 Async tool's final result can be permanently lost when a delivery is already in progress

A background tool's final result is silently dropped (enqueueReply at agents/src/voice/tool_executor.ts:529) when the delivery task is still running from a prior update, so the agent never speaks the completed result to the user.

Impact: The user never receives the final answer from a long-running background tool, even though the tool completed successfully.

Race between enqueueReply and deliverReply draining pendingUpdates

The race window:

Tool calls ctx.update('started') → first enqueueReply starts deliverReply (sets _replyTaskDone = false).

deliverReply copies and clears pendingUpdates (agents/src/voice/tool_executor.ts:590-591), then awaits waitForIdle() + generateReply().

While deliverReply is still awaiting, the tool finishes and runTool calls enqueueReply with the final result (agents/src/voice/tool_executor.ts:529).

Inside enqueueReply, the guard at agents/src/voice/tool_executor.ts:426 checks this._replyTaskDone — it's still false (deliverReply hasn't finished), so no new delivery task is started. The final result is pushed to pendingUpdates.

deliverReply finishes and sets _replyTaskDone = true at agents/src/voice/tool_executor.ts:625, but never re-checks pendingUpdates.

No subsequent enqueueReply call ever arrives → the final result sits in pendingUpdates forever.

Prompt for agents

The deliverReply method drains pendingUpdates once and then exits, setting _replyTaskDone = true. But new updates can arrive (via enqueueReply) after the drain but before _replyTaskDone is set. Since _replyTaskDone is still false at that point, enqueueReply does not start a new delivery task, and the update is stranded. Fix: After setting _replyTaskDone = true in the finally block, check if pendingUpdates has new items. If so, recursively start a new deliverReply (or loop). Alternatively, use a loop inside deliverReply that keeps draining until pendingUpdates is empty, only setting _replyTaskDone = true when there's truly nothing left. The key invariant is: if pendingUpdates is non-empty after _replyTaskDone becomes true, a new delivery must be scheduled. Relevant code: ToolExecutor.enqueueReply (line 415-441), ToolExecutor.deliverReply (line 582-627), and ToolExecutor.runTool final enqueueReply call (line 529).

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration · 2026-07-02T13:54:08Z

+    const unlock = await this.duplicateLock.lock();
+    try {
+      const duplicateResult = await this.checkDuplicate(functionName, {
+        onDuplicate: tool.onDuplicate,
+        confirmDuplicate,
+      });
+      if (duplicateResult !== undefined) return duplicateResult;
+
+      if (this.runningTasks.has(callId)) {
+        throw new Error(`Task already running for call_id: ${callId}`);
+      }
+
+      const firstUpdateFuture = new Future<unknown>();
+      runCtx._attachExecutor(this, firstUpdateFuture);
+
+      const controller = new AbortController();
+      const abort = () => {
+        queueMicrotask(() => {
+          controller.abort();
+          if (!firstUpdateFuture.done) {
+            firstUpdateFuture.reject(new Error('tool call was aborted'));
+          }
+        });
+      };
+      abortSignal?.addEventListener('abort', abort, { once: true });
+
+      // Once a tool goes non-blocking (it called ctx.update and detached from its
+      // owning speech), a speech interruption must NOT abort it — async tools are
+      // meant to survive interruptions and deliver their result later (matches
+      // Python, where the exe_task is independent and only cancel()/drain() stop it).
+      // Stop forwarding the speech abort to this tool; explicit cancel()/drain()/
+      // aclose() still abort it directly via task.controller.
+      void firstUpdateFuture.await
+        .then(() => {
+          if (runCtx.functionCall.extra.__livekit_agents_tool_non_blocking === true) {
+            abortSignal?.removeEventListener('abort', abort);
+          }
+        })
+        .catch(() => {});
+
+      const toolPromiseRef: { promise?: Promise<unknown> } = {};
+      const promise = this.runTool({
+        tool,
+        runCtx,
+        rawArguments: args as Parameters,
+        firstUpdateFuture,
+        controller,
+        onUserToolStarted,
+        toolPromiseRef,
+      }).finally(() => {
+        this.runningTasks.delete(callId);
+        runningTasks.get(runCtx.session)?.delete(callId);
+        abortSignal?.removeEventListener('abort', abort);
+        runCtx._detachExecutor();
+      });
+
+      const task: RunningTask = {
+        ctx: runCtx,
+        promise,
+        controller,
+        firstUpdateFuture,
+        executor: this,
+        allowCancellation: Boolean(tool.flags & ToolFlag.CANCELLABLE),
+        toolPromiseRef,
+      };
+      this.runningTasks.set(callId, task);
+      let sessionTasks = runningTasks.get(runCtx.session);
+      if (!sessionTasks) {
+        sessionTasks = new Map();
+        runningTasks.set(runCtx.session, sessionTasks);
+      }
+      sessionTasks.set(callId, task);
+
+      return firstUpdateFuture.await;
+    } finally {
+      unlock();
+    }


🚩 Duplicate lock serializes all concurrent tool executions, not just duplicates

The duplicateLock mutex in ToolExecutor.execute (agents/src/voice/tool_executor.ts:214) is acquired before the duplicate check and held until firstUpdateFuture.await resolves at line 287. For blocking tools (those that never call ctx.update()), firstUpdateFuture only resolves when the tool's execute() completes (line 504-506). This means the lock is held for the entire duration of the tool execution, serializing ALL concurrent tool calls through a single mutex — even calls to completely different tools that have no duplicate concern. For example, if the LLM emits two parallel tool calls (getWeather and playMusic), the second one cannot even begin its duplicate check until the first tool finishes. This may be intentional to match Python's behavior, but it's a significant performance constraint for parallel tool calls.

Was this helpful? React with 👍 or 👎 to provide feedback.

toubatbrian and others added 30 commits May 28, 2026 14:43

Refactor ToolContext to parity class taking a list of Tool | Toolset (#…

244d8bc

…1517)

feat(agents): add Toolset support to ToolContext and AgentActivity (#…

6b85bfe

…1525) Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

Merge branch 'main' into 1.5.0

57c1634

Merge branch 'main' into 1.5.0

07648b0

feat(agents): add beta end call tool (#1474)

c8b56bd

Co-authored-by: Brian Yin <brian.yin@livekit.io> Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

arden end-call shutdown and tool guards

f089da4

Catch end-call close listener errors to avoid unhandled rejections during shutdown, and make public tool type guards return false for null inputs.

fix(elevenlabs): end server vad turns (#1745)

0daf4ef

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

Add Inworld delivery mode inference TTS option (#1749)

d423f3f

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

Don't retain recorded events when recording is disabled (#1750)

9d86bc5

ci: speed up type checking (#1742)

03429b8

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

fix(voice): scope forwardAudio playback-started listener to its own s…

ec765ea

…egment (#1760) Co-authored-by: Cursor <cursoragent@cursor.com>

feat(barge-in): add default threshold support and drop http transport (…

416871a

…#1698) Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

feat(voice): add agent instruction updates (#1783)

eae6074

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com> Co-authored-by: Toubat <brian.yin@livekit.io>

Merge remote-tracking branch 'origin/main' into 1.5.0

882491c

# Conflicts: # agents/src/voice/agent_activity.test.ts # agents/src/voice/generation.ts # agents/src/voice/generation_tts_timeout.test.ts

test: use fake timers in manual-abort Task test to deflake CI

b579216

The test asserted an exact tick count ([0,1,2]) against real timers with a 5ms margin, which flakes on loaded CI runners. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

fix(elevenlabs): stop recognize from mutating instance language (#1789)

2cd85da

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

fix(phonic): add client header to conversations (#1781)

d9f56db

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

fix(inference): remove stale Cartesia STT model type (#1794)

20fcec7

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

feat(llm): list tools for unknown functions

2404611

fix(voice): skip stale end-of-turn metrics (#1803)

2765bf0

Co-authored-by: rosetta-livekit-bot[bot] <282703043+rosetta-livekit-bot[bot]@users.noreply.github.com>

Merge branch 'main' into 1.5.0

73a575f

Async Toolsets (#1736)

2aa44aa

Co-authored-by: Long Chen <longch1024@gmail.com>

fix formatting and build

d5456f0

feat(llm): list tools for unknown functions (#1800)

000f05a

Add object tool syntax compatibility (#1819)

2998c52

Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: u9g <jason.lernerman@livekit.io>

feat(llm): add LLMStream.collect() to await full response (#1568)

d130e27

Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: Cursor <cursoragent@cursor.com>

feat(testing): add withMockTools utility for mocking agent tools (#1549)

039dd45

Co-authored-by: u9g <jason.lernerman@livekit.io>

Update turbo.json

762a167

Add scoped filler support to RunContext (#1818)

8a91a0b

fix(workflows): normalize two-digit years in GetDOBTask

2a64eae

rosetta-livekit-bot Bot requested a review from tinalenguyen June 19, 2026 07:02

devin-ai-integration Bot reviewed Jun 19, 2026

View reviewed changes

Base automatically changed from 1.5.0 to main July 2, 2026 13:48

An error occurred while trying to automatically change base from 1.5.0 to main July 2, 2026 13:48

devin-ai-integration Bot reviewed Jul 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(workflows): normalize two-digit years in GetDOBTask#1837

fix(workflows): normalize two-digit years in GetDOBTask#1837
rosetta-livekit-bot[bot] wants to merge 31 commits into
mainfrom
port-dob-two-digit-years

rosetta-livekit-bot Bot commented Jun 19, 2026 •

edited

Loading

Uh oh!

changeset-bot Bot commented Jun 19, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Jul 2, 2026

Uh oh!

devin-ai-integration Bot Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

rosetta-livekit-bot Bot commented Jun 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

changeset-bot Bot commented Jun 19, 2026

🦋 Changeset detected

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rosetta-livekit-bot Bot commented Jun 19, 2026 •

edited

Loading