Skip to content

chore(deps): update nvidia-dcgm (patch)#8659

Open
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/patch-nvidia-dcgm
Open

chore(deps): update nvidia-dcgm (patch)#8659
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/patch-nvidia-dcgm

Conversation

@renovate

@renovate renovate Bot commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

This PR contains the following updates:

Package Update Change
dcgm-exporter patch 4.8.2-ubuntu24.04u14.8.2-ubuntu24.04u2
dcgm-exporter patch 4.8.2-ubuntu22.04u14.8.2-ubuntu22.04u2

Warning

Some dependencies could not be looked up. Check the Dependency Dashboard for more information.


Configuration

📅 Schedule: (UTC)

  • Branch creation
    • At any time (no schedule defined)
  • Automerge
    • At any time (no schedule defined)

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about these updates again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

Copilot AI review requested due to automatic review settings June 8, 2026 14:54
@renovate renovate Bot added the renovate This pull request was created by renovate label Jun 8, 2026
@renovate renovate Bot assigned djsly Jun 8, 2026

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

@renovate renovate Bot requested review from djsly, ganeshkumarashok and surajssd June 8, 2026 14:54
@github-actions github-actions Bot added the components This pull request updates cached components on Linux or Windows VHDs label Jun 8, 2026
@renovate renovate Bot changed the title chore(deps): update nvidia-dcgm to v4.8.2-ubuntu22.04u2 chore(deps): update nvidia-dcgm (patch) Jun 8, 2026
@renovate renovate Bot force-pushed the renovate/patch-nvidia-dcgm branch from 1c34d5c to 51e8bdd Compare June 8, 2026 15:14
@djsly

djsly commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator

AgentBaker Linux PR gate — E2E failure (mixed: 3 leaves shared infra; 1 ACL leaf likely on main)

  • Run: 167166471 (failed)
  • Failed: Run AgentBaker E2E → AzureCLI exit 1 (DONE 457 tests, 95 skipped, 5 failures in 1646.77s)

Group A — shared infra/test-fixture issue, NOT this PR (3 leaves):

  • Test_Ubuntu2204Gen2_ImagePullIdentityBinding_NetworkIsolated/{default (6.57s), scriptless_nbc (0.00s)}test_helpers.go:227 🔴 empty error, plus the parent container.
  • Same sub-7s empty-error shape has now hit 5 unrelated PRs in 48h (this PR, #8600, #8330, #8654, #8653). Confirmed systemic — needs NodeSIG-dev / E2E-infra triage of the ImagePullIdentityBinding_NetworkIsolated private-cluster/ACR-private-endpoint precondition.

Group B — ACL FIPS TL leaf, very likely existing main regression (2 leaves):

  • Test_ACLGen2FIPSTL/scriptless_nbc (265.83s) — validation.go:345 🔴: wireserver check "wireserver port 80 goalstate": unexpected curl exit code "0" (want 28 timeout or 7 refused) (plus root container).
  • The test expects WireServer port 80 to be blocked (curl exit 28=timeout or 7=refused) but got 0 (HTTP 200 reachable). That's an ACL FIPS TL firewall/network policy assertion. This PR (nvidia-dcgm patch bump in parts/common/components.json) touches GPU package versions only and has no path to ACL networking. Strongly suggests an existing ACL FIPS TL regression on main, not caused by this PR.

Confidence: HIGH that this PR is not the cause of either failure group.

Recommended next action:

  1. Rerun the failing job; do not block this PR on Group A.
  2. NodeSIG-dev: file a tracker on the ACL FIPS TL wireserver-block regression in validation.go:345 (the test expectation flipped, or the ACL network policy unit shipped in the latest VHD no longer blocks WireServer); investigate against main head independently of any specific PR.
  3. NodeSIG-dev / E2E-infra: triage the ImagePullIdentityBinding_NetworkIsolated fixture (sub-7s empty failures across multiple unrelated PRs).

Strongest alternative (less likely): transient ACR-private-endpoint outage for Group A + intermittent ACL firewall rule timing for Group B — refuted because each pattern is now reproducing deterministically on every recent PR build.

Posted by Clawpilot AgentBaker gate detective.

@surajssd surajssd left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do not merge, until renovate also adds support for Azure Linux. Once this is merged: #8660 I don't have to manually say that we should not merge this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

components This pull request updates cached components on Linux or Windows VHDs renovate This pull request was created by renovate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants