fix(hook): PendingToolRecoveryHook false positive on historical ToolResultBlocks in multi-turn (#1555) by Sparkle6979 · Pull Request #1566 · agentscope-ai/agentscope-java

Sparkle6979 · 2026-06-01T14:34:36Z

Summary

PendingToolRecoveryHook is silently non-functional in multi-turn conversations.

Root Cause

AgentBase.notifyPreCall() (line 672-678) merges the full memory snapshot with new callArgs into PreCallEvent.inputMessages. The hook's guard condition at line 110 checks this merged list for any ToolResultBlock:

boolean userProvidedResults =
    inputMessages.stream().anyMatch(m -> m.hasContentBlocks(ToolResultBlock.class));
if (userProvidedResults) return;  // skip recovery

In multi-turn scenarios, memory always contains ToolResultBlocks from earlier successful tool calls, so userProvidedResults is always true — the hook always skips recovery, and ReActAgent.doCall() throws IllegalStateException.

Fix

Narrow the check to only match ToolResultBlock IDs present in the pendingIds set:

boolean userProvidedResults =
    inputMessages.stream()
        .flatMap(m -> m.getContentBlocks(ToolResultBlock.class).stream())
        .anyMatch(tr -> pendingIds.contains(tr.getId()));

Historical ToolResultBlocks (e.g. call_1) are for already-resolved tool calls, so their IDs are never in pendingIds — no false positive. HITL results (user-provided ToolResultBlock for a pending call) do match pendingIds, so the hook correctly defers to doCall().

Why existing tests pass

HookStopAgentTest.testNewMsgWithPendingToolUseContinuesActing only covers single-turn: memory has no historical ToolResultBlock, so the false positive never triggers.

Closes #1555

Test plan

Core bug: multi-turn plain text recovery with historical ToolResultBlocks
Accumulated history: 3+ successful tool turns → interrupted → recovery
HITL single-turn and multi-turn: user provides matching result
Unrelated ToolResultBlock ID does not block auto-recovery
Single-turn regression: classic recovery still works
Normal multi-turn without pending calls unaffected
Multiple pending calls: all auto-recovered (single-turn and multi-turn)
Existing HookStopAgentTest passes (no regression)

…ingIds

…i-turn

codecov · 2026-06-01T14:47:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Sparkle6979 · 2026-06-01T14:50:45Z

Design note: why this approach vs suggested alternatives

The issue description proposed two options:

Option A (filter by role): filter out ASSISTANT/TOOL messages, then check for ToolResultBlock. This is fragile — in HITL scenarios, user-provided ToolResultBlocks also carry TOOL roles, so the filter does not reliably distinguish historical results from user-provided results.

Option B (expose snapshotSize on PreCallEvent): requires an API change to PreCallEvent, which is more invasive.

This PR approach (match pendingIds): directly checks whether any ToolResultBlock ID matches a pending tool call. Historical results are never in pendingIds, so no false positive. HITL results for a pending call DO match, so they correctly defer to doCall(). No API change, 3 lines modified.

AgentScopeJavaBot

🤖 AI Review

The fix is precise and correctly addresses the multi-turn false positive described in #1555. By narrowing the HITL guard from "any ToolResultBlock present" to "ToolResultBlock id ∈ pendingIds", the hook now correctly distinguishes user-supplied results for the currently pending tool call from historical results already resolved in earlier turns. Since pendingIds is built from ToolUseBlock ids on the last assistant message that have no matching result, historical (already-resolved) ids cannot collide with it, so the new condition is logically tight. The HITL deferral path remains intact: when the user sends a matching id the hook still returns the event unmodified for ReActAgent.doCall() to consume. No new null-safety risk of practical concern — pendingIds is sourced from non-null ToolUseBlock::getId, and getContentBlocks always returns a non-null list. Test coverage is thorough and well-partitioned (core bug, HITL, unrelated id, single-turn regression, multi-pending). No issues found that would block merge.

AgentScopeJavaBot · 2026-06-02T02:30:14Z

+            this.executed = executed;
+        }
+
+        @Tool(name = "test_tool", description = "A test tool")


[nit] createConditionalStopHook(counter, stopAt) couples the test to internal PostReasoningEvent firing order (the stopAt=3/stopAt=7 magic numbers). It works today but will be brittle if reasoning-event semantics evolve. Consider stopping based on a structural signal (e.g. once a specific ToolUseBlock id appears) in a follow-up to make these tests more refactor-resilient. Non-blocking.

AgentScopeJavaBot · 2026-06-02T02:30:14Z

-                inputMessages.stream().anyMatch(m -> m.hasContentBlocks(ToolResultBlock.class));
+                inputMessages.stream()
+                        .flatMap(m -> m.getContentBlocks(ToolResultBlock.class).stream())
+                        .anyMatch(tr -> pendingIds.contains(tr.getId()));


[nit] Defensive consideration only (not a real bug today): ToolResultBlock.getId() is documented as nullable, and Set#contains(null) is a legal lookup. In normal flows pendingIds is sourced from non-null ToolUseBlock ids so a null result-id cannot match, but if a malformed user message ever ships a null-id ToolResultBlock the hook will silently ignore it (treat as not-HITL and auto-patch). That is arguably the safer behavior, so no change required — flagging only so the assumption is explicit.

LearningGp · 2026-06-04T08:55:36Z

FYI #1409

Sparkle6979 · 2026-06-04T14:21:23Z

Noticed that PendingToolRecoveryHook.java has been removed from main and replaced by LegacyHookDispatcher. Closing this PR as the target file no longer exists.

Sparkle6979 · 2026-06-04T14:23:00Z

Checked the main branch — PendingToolRecoveryHook.java no longer exists under agentscope-core/src/main/java/io/agentscope/core/hook/. The hook directory now contains LegacyHookDispatcher.java instead.

If this file is still used somewhere (e.g. on a release branch), happy to rebase. Otherwise feel free to close.

LearningGp · 2026-06-08T10:50:00Z

PTAL @chickenlj

Sparkle6979 added 3 commits June 1, 2026 22:13

：fix: PendingToolRecoveryHook checks ToolResultBlock IDs against pend…

41372bd

…ingIds

Merge branch 'agentscope-ai:main' into fix/pending-tool-recovery-mult…

ea2f342

…i-turn

test: add multi-turn tests for PendingToolRecoveryHook fix

283718a

Sparkle6979 requested a review from a team June 1, 2026 14:34

Sparkle6979 mentioned this pull request Jun 1, 2026

[Bug]: PendingToolRecoveryHook always skips recovery for multi-turn ReActAgent (Supervisor) due to false positive userProvidedResults check #1555

Open

Sparkle6979 changed the title ~~fix: PendingToolRecoveryHook false positive on historical ToolResultBlocks in multi-turn (#1555)~~ fix(hook): PendingToolRecoveryHook false positive on historical ToolResultBlocks in multi-turn (#1555) Jun 1, 2026

AgentScopeJavaBot added bug Something isn't working area/core/agent Agent runtime, pipeline, hooks, plan labels Jun 2, 2026

AgentScopeJavaBot approved these changes Jun 2, 2026

View reviewed changes

sunwg2 mentioned this pull request Jun 4, 2026

fix(hook): skip auto-patch only when ToolResultBlock IDs match pendin… Fixes #42 #1409

Closed

5 tasks

Sparkle6979 closed this Jun 4, 2026

Sparkle6979 reopened this Jun 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(hook): PendingToolRecoveryHook false positive on historical ToolResultBlocks in multi-turn (#1555)#1566

fix(hook): PendingToolRecoveryHook false positive on historical ToolResultBlocks in multi-turn (#1555)#1566
Sparkle6979 wants to merge 3 commits into
agentscope-ai:mainfrom
Sparkle6979:fix/pending-tool-recovery-multi-turn

Sparkle6979 commented Jun 1, 2026

Uh oh!

codecov Bot commented Jun 1, 2026

Uh oh!

Sparkle6979 commented Jun 1, 2026

Uh oh!

AgentScopeJavaBot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AgentScopeJavaBot Jun 2, 2026

Uh oh!

AgentScopeJavaBot Jun 2, 2026

Uh oh!

LearningGp commented Jun 4, 2026

Uh oh!

Sparkle6979 commented Jun 4, 2026

Uh oh!

Sparkle6979 commented Jun 4, 2026

Uh oh!

LearningGp commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sparkle6979 commented Jun 1, 2026

Summary

Root Cause

Fix

Why existing tests pass

Test plan

Uh oh!

codecov Bot commented Jun 1, 2026

Codecov Report

Uh oh!

Sparkle6979 commented Jun 1, 2026

Design note: why this approach vs suggested alternatives

Uh oh!

AgentScopeJavaBot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AgentScopeJavaBot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

AgentScopeJavaBot Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

LearningGp commented Jun 4, 2026

Uh oh!

Sparkle6979 commented Jun 4, 2026

Uh oh!

Sparkle6979 commented Jun 4, 2026

Uh oh!

LearningGp commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants