Harden WeCom pull streaming and add compatible Dify snapshot refresh configuration by bjhx2003 · Pull Request #2053 · langbot-app/LangBot

bjhx2003 · 2026-03-11T14:06:35Z

概述 / Overview

本次 PR 主要增强了企业微信智能机器人 Pull 模式下的流式消息处理能力，并补齐 Dify 流式快照刷新配置，同时保持默认行为尽量兼容 master。

This PR mainly improves pull-mode streaming for the WeCom AI bot, adds Dify snapshot refresh configuration, and keeps default behavior as compatible with master as possible.

主要改动 / Key changes

增强企微 Pull 流式生命周期处理
- 为 WeCom pull 模式增加轮询等待时间、最大生命周期、首字等待占位文案等配置
- 在 follow-up 轮询超时、异常、无新 chunk 等场景下，使用最后快照或错误提示强制 finish，避免企微持续轮询后展示官方兜底文案
- 队列消费时仅保留最新快照，避免旧片段堆积导致显示滞后
Harden WeCom pull streaming lifecycle
- add polling timeout, max stream lifetime, and pending placeholder related settings
- force-finish stale or failed streams with the latest snapshot or an error fallback
- keep only the latest snapshot in the queue to avoid stale chunk buildup
改进企微回复与流水线流式处理
- respback 在流式模式下跳过强制延迟，提升首字响应速度
- 流式回复时使用最新 chunk 的 is_final 标记，避免 finish 状态错误
Improve streaming response handling in pipeline stages
- skip forced delay in streaming mode to reduce first-token latency
- use the latest chunk final flag when replying stream snapshots
优化企微消息转换
- WecomBotMessageConverter 在转文本时保留引用内容
- 非纯文本组件尽可能转换为可读文本，避免回复内容丢失
Improve WeCom message conversion
- preserve quote content when converting messages to text
- convert non-plain components to readable text where possible
补齐 Dify 流式快照刷新策略
- 新增 pipeline output 配置：
  - output.dify-stream.chunk-batch-size
  - output.dify-stream.flush-window-enabled
  - output.dify-stream.flush-window-ms
- 处理 workflow_finished 场景，保证流式输出能正确 finish
- 恢复 Dify workflow answer 提取兼容逻辑，避免回归
Add Dify snapshot refresh strategy
- add pipeline output config:
  - output.dify-stream.chunk-batch-size
  - output.dify-stream.flush-window-enabled
  - output.dify-stream.flush-window-ms
- handle workflow_finished correctly to ensure stream completion
- restore Dify workflow answer extraction compatibility logic
配置迁移与兼容默认值
- 将 Dify 专属的流式快照配置迁移到 pipeline output 配置
- 前端仅在选择 dify-service-api 作为 runner 时显示 dify-stream 配置项
- 默认值尽量兼容 master：
  - WeCom poll timeout 默认 500ms
  - placeholder 默认关闭
  - Dify chunk batch size 默认 8
  - flush window 默认关闭
Config migration and compatibility defaults
- move Dify-specific stream snapshot settings to pipeline output config
- show the dify-stream section only when the selected runner is dify-service-api
- keep defaults compatible with master where possible:
  - WeCom poll timeout defaults to 500ms
  - placeholder disabled by default
  - Dify chunk batch size defaults to 8
  - flush window disabled by default
Docker 构建优化
- 优化 .dockerignore
- 调整 Dockerfile 复制顺序与依赖安装顺序，尽量利用 Docker 层缓存
- 启动命令保持与 master 一致
Docker build optimization
- optimize .dockerignore
- improve Dockerfile layer usage for better build caching
- keep the startup command aligned with master
测试
- 补充并更新定向测试，覆盖：
  - final 收口
  - flush window 行为
  - placeholder fallback
  - latest snapshot
  - 配置来源迁移
  - 默认值兼容行为
Tests
- add/update targeted tests covering:
  - final stream completion
  - flush window behavior
  - pending placeholder fallback
  - latest snapshot consumption
  - config source migration
  - compatibility defaults behavior

更改前后对比截图 / Screenshots

请在此部分粘贴更改前后对比截图（可以是界面截图、控制台输出、对话截图等）:
Please paste the screenshots of changes before and after here (can be interface screenshots, console output, conversation screenshots, etc.):

修改前 / Before:

修改后 / After:
机器人配置，当选择企微智能机器人时增加如下配置，首字等待占位功能默认关闭，开启后可在n秒未收到首字时先回复消息占位，后续真实回复时会替换掉消息。

流水线配置，当选择dify服务api时，在输出处理tab，可以调整chunk大小，和开启时间窗口结合策略，默认关闭兼容

检查清单 / Checklist

已通过以下测试：

uv run pytest -q tests/unit_tests/pipeline/test_wecombot_dify_minfix.py -q

结果：

- 13 项测试通过

Validated with:

uv run pytest -q tests/unit_tests/pipeline/test_wecombot_dify_minfix.py -q

Result:

- 13 tests passed

### 更改前后对比截图 / Screenshots

> 请在此部分粘贴更改前后对比截图（可以是界面截图、控制台输出、对话截图等）:
> Please paste the screenshots of changes before and after here (can be interface screenshots, console output, conversation screenshots, etc.):
>
> 修改前 / Before:
>
> - 企业微信 Pull 模式下，流式消息在部分场景中不能正确 finish，可能持续 follow-up 并最终展示官方兜底文案
> - Dify 流式快照刷新策略参数缺少清晰归属，且默认行为与旧逻辑不完全兼容
> - 流式模式下仍会走强制延迟，首字体验较差
>
> 修改后 / After:
>
> - 企业微信 Pull 模式可在超时、异常、无新 chunk 等场景下正确收口
> - Dify 流式快照刷新参数迁移到 `output.dify-stream`，并通过默认值保持旧行为兼容
> - 流式模式跳过强制延迟，首字响应更快
> - 队列仅保留最新快照，减少旧片段堆积导致的显示滞后
>
> （如需要，我会在提交前补充控制台日志截图 / 对话截图）
> (Screenshots can be added before submission if needed.)


### PR 作者完成 / For PR author

*请在方括号间写`x`以打勾 / Please tick the box with `x`*

- [ ] 阅读仓库[贡献指引](https://github.com/langbot-app/LangBot/blob/master/CONTRIBUTING.md)了吗？ / Have you read the [contribution guide](https://github.com/langbot-app/LangBot/blob/master/CONTRIBUTING.md)?
- [ ] 与项目所有者沟通过了吗？ / Have you communicated with the project maintainer?
- [x] 我确定已自行测试所作的更改，确保功能符合预期。 / I have tested the changes and ensured they work as expected.

### 项目维护者完成 / For project maintainer

- [ ] 相关 issues 链接了吗？ / Have you linked the related issues?
- [x] 配置项写好了吗？迁移写好了吗？生效了吗？ / Have you written the configuration items? Have you written the migration? Has it taken effect?
- [ ] 依赖加到 pyproject.toml 和 core/bootutils/deps.py 了吗 / Have you added the dependencies to pyproject.toml and core/bootutils/deps.py?
- [ ] 文档编写了吗？ / Have you written the documentation?

- Reduce stream_poll_timeout from 0.15s to 0.05s for faster response - Change yield condition from % 8 to % 2 for higher push frequency - Skip plugin events for intermediate streaming chunks to reduce latency - Fix yield condition to ensure is_final is always sent - Add msg_id_map cleanup to prevent memory bloat - Update version to 4.9.0-wecom.1 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Move dependency installation before source code copy to leverage Docker layer caching. Code changes no longer trigger dependency reinstallation, reducing build time from 2-3 minutes to 10-20 seconds. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Use `uv sync --no-dev` to exclude dev dependencies - Remove gcc after build with apt-get purge - Clean apt cache with rm -rf /var/lib/apt/lists/* - Expand .dockerignore to exclude docs, tests, and other non-runtime files Image size reduced from 2.07GB to 1.67GB (19% reduction). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- 对齐版本号到 4.9.0.post2，并同步 pyproject、包版本与 uv.lock\n- 将企微通道层配置保留在机器人适配器中，新增占位文案延迟配置\n- 将 chunk 批大小与时间窗口下沉到流水线输出配置，补齐默认值与 UI 元数据\n- 将 Dify 流式输出改为 chunk 阈值或时间窗口双触发，并保证 final 立即收口\n- 优化 pull 模式占位文案逻辑，仅在首字超时后返回占位文案\n- 补充企微与 Dify 定向单元测试，覆盖双阈值 flush、占位延迟与最终收口场景

- Add workflow_finished event handling to set is_final=True - Add yielded_final flag to prevent duplicate final chunk yield - Fix WeCom continuous polling issue when Dify workflow ends Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Add PullPendingPlaceholderEnabled config option (default: true) - Show delay and content fields only when enabled via visibleOn - Pass effective values to WecomBotClient based on switch state Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Add _summarize_stream_text helper for content logging - Add publish/consume action logs with seq, content stats - Track cleared queue items and publish sequence - Help troubleshoot WeCom pull mode polling issues Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Add FULL_TEXT and HYBRID search types - Implement RRF (Reciprocal Rank Fusion) for hybrid search - Change add to upsert for idempotent document insertion - Upgrade chromadb dependency to >=1.0.0,<2.0.0 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Replace hide-exception and block-failed-request-output with exception-handling - Add three strategies: hide, show-hint, show-error - Add failure-hint field for customizable error message - Add database migration dbm021 for config conversion - Remove wecom-stream stage (moved to adapter config) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tibility - Move filters out of retrieval_settings to avoid empty results - Pass sender_id and session_name in retrieval settings - Some plugins (e.g. LangRAG) pass filters directly to vector_search Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Update tests to use adapter config instead of pipeline wecom-stream - Add test for workflow_finished event handling - Add test for ignoring empty message chunks Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tput

codecov · 2026-03-11T16:58:44Z

Codecov Report

❌ Patch coverage is 53.79939% with 152 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/langbot/pkg/provider/runners/difysvapi.py	38.12%	99 Missing ⚠️
src/langbot/libs/wecom_ai_bot_api/api.py	56.81%	19 Missing ⚠️
...angbot/libs/wecom_ai_bot_api/pull_stream_policy.py	76.11%	16 Missing ⚠️
src/langbot/pkg/plugin/handler.py	8.33%	11 Missing ⚠️
src/langbot/pkg/pipeline/respback/respback.py	55.55%	4 Missing ⚠️
src/langbot/pkg/platform/sources/wecombot.py	91.89%	3 Missing ⚠️

📢 Thoughts on this report? Let us know!

RockChinQ · 2026-03-11T16:59:31Z

感谢 PR！可否提供一下修改前后的截图，例如配置页面截图、运行效果截图？我们将更快开始 review

…sessions.publish` call.

bjhx2003 · 2026-03-12T02:17:25Z

感谢 PR！可否提供一下修改前后的截图，例如配置页面截图、运行效果截图？我们将更快开始 review
@RockChinQ 已经提交了修改后的截图

…ility

…ompatibility

# Conflicts: # src/langbot/pkg/platform/sources/wecombot.py

fdc310 · 2026-03-14T10:05:47Z

        inputs.update(query.variables)
        messsage_idx = 0
        is_final = False
+        stream_completed = False


这个参数每一处都是和is_final对应的，为什么不直接用is_final呢

fdc310 · 2026-03-14T17:38:28Z

+            if rendered_text:
+                content_parts.append(rendered_text)
+
+        return ''.join(content_parts)


这里直接把引用消息和消息拼接在一起有点不妥吧？

fdc310 · 2026-03-14T17:58:55Z

+
+        # 如果未开启首字等待占位，则将延迟设为0且占位文案设为空
+        effective_placeholder_delay = pending_placeholder_delay_ms / 1000 if pending_placeholder_enabled else 0
+        effective_placeholder = pending_placeholder if pending_placeholder_enabled else ''


这里首字占位，其实我觉着放在creat_message_card这里（主要当时我写钉钉和飞书的时候以为只有卡片流式来着所以函数名就这样了）这样创建首字在进入模型前就能开始，个人感觉比较合理。

fdc310 · 2026-03-14T18:00:57Z

+        is_stream_mode = await query.adapter.is_stream_output_supported() and has_chunks

-        random_delay = random.uniform(*random_range)
+        # 流式模式下跳过强制延迟，确保首字快速响应


就像wecom那边我说的，如果是在creat_message_card中创建首字消息，这里就不用这么处理了

fdc310 · 2026-03-14T18:07:59Z


+    def __init__(self, config: dict, logger: EventLogger):
+        enable_webhook = config.get('enable-webhook', True)
        if not enable_webhook:


这边方便的话把websocket给一起合入就更好了

fdc310 · 2026-03-27T08:43:38Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

bjhx2003 · 2026-03-30T08:42:03Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310

fdc310 · 2026-03-30T08:50:58Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310

我的锅，我其实很早就review了并留言，但是忘了推出去 (*꒦ິ⌓꒦ີ)，直到前两天

bjhx2003 · 2026-03-30T08:57:11Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310

我的锅，我其实很早就review了并留言，但是忘了推出去 (*꒦ິ⌓꒦ີ)，直到前两天

@fdc310 我尝试看看能不能拆开吧，因为后面我对微信客服接入和企微应用接管微信客服处理方面都做了改造，还引入了redis，整体是一个比较大的动作。

fdc310 · 2026-03-30T09:06:19Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310

我的锅，我其实很早就review了并留言，但是忘了推出去 (*꒦ິ⌓꒦ີ)，直到前两天

@fdc310 我尝试看看能不能拆开吧，因为后面我对微信客服接入和企微应用接管微信客服处理方面都做了改造，还引入了redis，整体是一个比较大的动作。

好的，感谢。如果可以的话尽量每一块开一个分支，包括您这边提到的位置客服，企微应用和redis相关的都单独pr，然后我们这边的话暂时还没有使用redis的规划，可能暂时不会引入

bjhx2003 · 2026-03-30T09:54:48Z

你好，请问改pr中dify相关的修改的能不能单独提一个pr

因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310

我的锅，我其实很早就review了并留言，但是忘了推出去 (*꒦ິ⌓꒦ີ)，直到前两天

@fdc310 我尝试看看能不能拆开吧，因为后面我对微信客服接入和企微应用接管微信客服处理方面都做了改造，还引入了redis，整体是一个比较大的动作。

好的，感谢。如果可以的话尽量每一块开一个分支，包括您这边提到的位置客服，企微应用和redis相关的都单独pr，然后我们这边的话暂时还没有使用redis的规划，可能暂时不会引入

@fdc310 了解，不过目前都是纯内存的，对于企业级应用来说没有redis + 单点还是不够稳定

RockChinQ · 2026-03-30T10:12:11Z

这块儿我们可以先看看代码再讨论，企业级的功能确实是我们目前方向

…

------------------ Original ------------------ From: 夜雨 ***@***.***> Date: Mon,Mar 30,2026 5:55 PM To: langbot-app/LangBot ***@***.***> Cc: Junyan Chin ***@***.***>, Mention ***@***.***> Subject: Re: [langbot-app/LangBot] Harden WeCom pull streaming and addcompatible Dify snapshot refresh configuration (PR #2053) bjhx2003 left a comment (langbot-app/LangBot#2053) 你好，请问改pr中dify相关的修改的能不能单独提一个pr 因为好久没有review，我以为你们不搞了，我这边又做了很多处理，所以就合到了同一个分支处理了 @fdc310 我的锅，我其实很早就review了并留言，但是忘了推出去 (*꒦ິ⌓꒦ີ)，直到前两天 @fdc310 我尝试看看能不能拆开吧，因为后面我对微信客服接入和企微应用接管微信客服处理方面都做了改造，还引入了redis，整体是一个比较大的动作。好的，感谢。如果可以的话尽量每一块开一个分支，包括您这边提到的位置客服，企微应用和redis相关的都单独pr，然后我们这边的话暂时还没有使用redis的规划，可能暂时不会引入 @fdc310 了解，不过目前都是纯内存的，对于企业级应用来说没有redis + 单点还是不够稳定 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

update:合并群聊模式下userid的生成

RockChinQ · 2026-04-25T14:45:11Z

PR 内容不太适合广泛利用，如果需要合入，请拆分为独立功能再提pr～感谢

蒋金龙（夜雨） and others added 19 commits March 10, 2026 10:16

fix(wecombot): improve dify streaming reply handling

824145f

fix(wecombot): harden pull stream lifecycle

7d3190f

fix(docker): set python path for src layout

039a6d4

fix(release): align version and compose image source

2d5ef8d

fix(dify,wecombot): move Dify stream throttling config to pipeline ou…

c54d9fc

…tput

chore(wecombot): align packaging files and trim debug logs

66d47f6

fix(wecombot): restore compatibility defaults and wrapper behavior

aed7c54

fix(difysvapi): restore workflow output compatibility in streaming

7c0b865

chore: Remove unused assignment of published variable from `stream_…

4786eee

…sessions.publish` call.

bjhx2003 added 2 commits March 13, 2026 18:53

refactor(wecombot): isolate pull stream policy and webhook-only visib…

7d580cf

…ility

merge(master): rehearse merge with wecom webhook-only strategy

6dd1ffc

bjhx2003 added 3 commits March 16, 2026 16:24

refactor(wecombot): minimize drift from master config behavior

5cbd604

fix(plugin): tolerate older runtime sdk without vector list

7791dd1

merge(master): resolve plugin handler conflict and keep runtime sdk c…

aa957ed

…ompatibility

RockChinQ force-pushed the master branch from 2947b25 to 4d6f109 Compare March 25, 2026 13:11

bjhx2003 added 5 commits March 26, 2026 18:38

feat(wecomcs): add redis streams scheduler and kf app support

5e4f21b

Merge remote-tracking branch 'origin/master' into feat/jjl

81835de

# Conflicts: # src/langbot/pkg/platform/sources/wecombot.py

fix(wecomcs): isolate bot streams and pipeline sessions

93e4c58

feat(dify): 持久化 Redis 会话绑定并补充 README 增强说明

ed51cce

merge(feat/wecom): 合并 feat/jjl 的增强改动

b47c5fa

fdc310 reviewed Mar 27, 2026

View reviewed changes

bjhx2003 added 2 commits March 27, 2026 17:00

docs(readme): 补充分支相对官方版本的增强能力说明

fcd036c

fix(wecomcs): prevent history replay and duplicate outbound msgids

86ac50b

bjhx2003 and others added 8 commits March 31, 2026 10:33

feat(pipeline): add dify session settings and recovery flow

f26e6cc

fix(web): show dify session settings in create and edit flows

c156439

chore(gitignore): ignore local agent workflow artifacts

a9c585a

fix(wecomcs): restore sharding config and prevent history replay

168360f

fix(web): prevent pipeline AI tab crash on create

e2673e2

fix(pipeline): preserve submitted config when creating pipeline

4369466

update:合并群聊模式下userid的生成

84121bc

Merge pull request #1 from Saiyuekin/feat/wecom

117dd6f

update:合并群聊模式下userid的生成

dosubot Bot mentioned this pull request Apr 18, 2026

[Bug]: 企微智能机器人，收不全dify agent的数据 #2136

Open

RockChinQ closed this Apr 25, 2026

Conversation

bjhx2003 commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

概述 / Overview

主要改动 / Key changes

更改前后对比截图 / Screenshots

检查清单 / Checklist

Uh oh!

codecov Bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

RockChinQ commented Mar 11, 2026

Uh oh!

bjhx2003 commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fdc310 Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

fdc310 Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

fdc310 Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

fdc310 Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

fdc310 Mar 14, 2026

Choose a reason for hiding this comment

Uh oh!

fdc310 commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bjhx2003 commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fdc310 commented Mar 30, 2026

Uh oh!

bjhx2003 commented Mar 30, 2026 • edited by fdc310 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fdc310 commented Mar 30, 2026

Uh oh!

bjhx2003 commented Mar 30, 2026

Uh oh!

RockChinQ commented Mar 30, 2026 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RockChinQ commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bjhx2003 commented Mar 11, 2026 •

edited

Loading

codecov Bot commented Mar 11, 2026 •

edited

Loading

bjhx2003 commented Mar 12, 2026 •

edited

Loading

fdc310 commented Mar 27, 2026 •

edited

Loading

bjhx2003 commented Mar 30, 2026 •

edited

Loading

bjhx2003 commented Mar 30, 2026 •

edited by fdc310

Loading

RockChinQ commented Mar 30, 2026 via email •

edited

Loading