Skip to content

[Cherry-Pick][CI] Sync dev optimizations to 2.4(#7335)#7346

Merged
EmmonsCurse merged 5 commits intoPaddlePaddle:release/2.4from
EmmonsCurse:ci_optimize_24_0412
Apr 12, 2026
Merged

[Cherry-Pick][CI] Sync dev optimizations to 2.4(#7335)#7346
EmmonsCurse merged 5 commits intoPaddlePaddle:release/2.4from
EmmonsCurse:ci_optimize_24_0412

Conversation

@EmmonsCurse
Copy link
Copy Markdown
Collaborator

Motivation

The CI pipeline may leave behind running containers or uncleaned workspaces when jobs are canceled or fail unexpectedly. This can cause resource leakage, workspace conflicts, and instability in subsequent jobs.

Modifications

Cherry-pick of #7198 #7227 #7283 #7268 #7315 #7335 to release/2.4.

Usage or Command

N/A

Accuracy Tests

N/A

Checklist

  • Add at least a tag in the PR title.
    • Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
    • You can add new tags based on the PR content, but the semantics must be clear.
  • Format your code, run pre-commit before commit.
  • Add unit tests. Please write the reason in this PR if no unit tests.
  • Provide accuracy results.
  • If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

@paddle-bot
Copy link
Copy Markdown

paddle-bot bot commented Apr 12, 2026

Thanks for your contribution!

fastdeploy-bot

This comment was marked as outdated.

@EmmonsCurse
Copy link
Copy Markdown
Collaborator Author

EmmonsCurse commented Apr 12, 2026

/skip-ci gpu_4cards_test
/skip-ci stable_test
/skip-ci base_test
/skip-ci logprob_test

fastdeploy-bot

This comment was marked as outdated.

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented Apr 12, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
⚠️ Please upload report for BASE (release/2.4@cdc5fce). Learn more about missing BASE report.

Additional details and impacted files
@@              Coverage Diff               @@
##             release/2.4    #7346   +/-   ##
==============================================
  Coverage               ?   56.43%           
==============================================
  Files                  ?      333           
  Lines                  ?    42621           
  Branches               ?     6478           
==============================================
  Hits                   ?    24052           
  Misses                 ?    16699           
  Partials               ?     1870           
Flag Coverage Δ
GPU 56.43% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

fastdeploy-bot

This comment was marked as outdated.

Copy link
Copy Markdown

@fastdeploy-bot fastdeploy-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤖 AI Code Review | 2026-04-12

📋 Review 摘要

PR 概述:将 dev 分支的 CI 优化(容器清理、环境变量配置、pip 安装优化)cherry-pick 到 release/2.4 分支。

变更范围.github/workflows/(多个 CI 配置文件)、scripts/run_pre_ce.sh、测试文件

影响面 Tag[CI]

📝 PR 规范检查

问题:PR 标题使用了 [Cherry-Pick] 标签,但这不是官方标签列表中的标签。

根据官方标签列表,有效的 CI 相关标签是 [CI]。虽然 Cherry-Pick 是特殊的合并流程,但建议标题只使用官方标签,可以在描述中说明这是 cherry-pick 的变更。

标题建议(可直接复制):

[CI] Sync dev optimizations to 2.4 (#7335)

问题

级别 文件 概述
🟡 建议 tests/ce/server/core/utils.py:29 send_request 默认 timeout 从 600 秒改为 60 秒,影响范围较大

总体评价

CI 优化整体合理,增强了容器清理机制防止资源泄漏,添加了必要的环境变量和内核参数配置。但 timeout 默认值的变更需要评估其对现有测试的影响。



def send_request(url, payload, timeout=600, stream=False):
def send_request(url, payload, timeout=60, stream=False):
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🟡 建议send_request 的默认 timeout 从 600 秒改为 60 秒,这是 10 倍的减少。

影响分析:

  • 多个测试文件(如 test_stream.pytest_base_chat.py 等)调用了 send_request 而没有指定 timeout 参数,都会使用这个默认值
  • 60 秒可能不足以处理某些需要长时间响应的请求

建议:

  1. 评估现有测试的超时需求
  2. 如需调整,建议逐步调整或在关键测试中显式指定 timeout

@EmmonsCurse EmmonsCurse merged commit 19b0038 into PaddlePaddle:release/2.4 Apr 12, 2026
23 of 25 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants