qwen3_5_moe: guard new cudaMemGetInfo blocks behind EXECUTORCH_BUILD_CUDA by rascani · Pull Request #19265 · pytorch/executorch

rascani · 2026-05-01T21:07:20Z

Summary

#19228 added structured GPU memory tracking to the qwen3_5_moe runner but did not wrap the new cudaMemGetInfo blocks in the existing EXECUTORCH_BUILD_CUDA guard that the rest of the file uses for CUDA-only APIs. The same main.cpp is built for the Metal target where the CUDA runtime headers are not available, so the new blocks failed to compile on macOS:

error: use of undeclared identifier 'cudaMemGetInfo'
    if (cudaMemGetInfo(&free, &total) == cudaSuccess) {

Wrap the three new scoped blocks in #ifdef EXECUTORCH_BUILD_CUDA, matching the existing guard pattern at lines 27, 68, 113, 168, and 184. The stats struct fields they would have populated (gpu_free_before_load_bytes, gpu_free_after_load_bytes, gpu_free_after_generate_bytes, gpu_peak_usage_mb) default to their sentinel values on non-CUDA builds, so the rest of the runner's stats reporting tolerates their absence.

Authored with Claude Code.

Test plan

CI

…CUDA pytorch#19228 added structured GPU memory tracking to the qwen3_5_moe runner but did not wrap the new cudaMemGetInfo blocks in the existing EXECUTORCH_BUILD_CUDA guard that the rest of the file uses for CUDA-only APIs. The same main.cpp is built for the Metal target where the CUDA runtime headers are not available, so the new blocks failed to compile on macOS: error: use of undeclared identifier 'cudaMemGetInfo' if (cudaMemGetInfo(&free, &total) == cudaSuccess) { Wrap the three new scoped blocks in #ifdef EXECUTORCH_BUILD_CUDA, matching the existing guard pattern at lines 27, 68, 113, 168, and 184. The stats struct fields they would have populated (gpu_free_before_load_bytes, gpu_free_after_load_bytes, gpu_free_after_generate_bytes, gpu_peak_usage_mb) default to their sentinel values on non-CUDA builds, so the rest of the runner's stats reporting tolerates their absence. Authored with Claude Code.

pytorch-bot · 2026-05-01T21:07:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19265

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job, 14 Pending, 4 Unrelated Failures

As of commit da9af5b with merge base a7e44bf ():

NEW FAILURE - The following job has failed:

MLX / test-mlx-stories110m / test-mlx-stories110m (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

Check Labels / Check labels (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-models-linux (mv2, portable, linux.2xlarge) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
Test Metal Backend / export-model-metal-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
Test Metal Backend / export-model-metal-artifact (nvidia, parakeet-tdt, non-quantized) / macos-job (gh) (matched macos rule in flaky-rules.json)
File doesn't exist
Test Metal Backend / export-model-metal-artifact (nvidia, parakeet-tdt, quantized-int4-metal) / macos-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-05-01T21:08:23Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

rascani requested a review from Gasoonjia May 1, 2026 21:07

rascani requested a review from lucylq as a code owner May 1, 2026 21:07

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 1, 2026

Gasoonjia approved these changes May 1, 2026

View reviewed changes

Gasoonjia added ciflow/metal ciflow/mlx labels May 1, 2026

rascani merged commit a3dd0fa into pytorch:main May 1, 2026
218 of 239 checks passed

rascani deleted the fix-qwen3_5_moe-cuda-guards branch May 1, 2026 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen3_5_moe: guard new cudaMemGetInfo blocks behind EXECUTORCH_BUILD_CUDA#19265

qwen3_5_moe: guard new cudaMemGetInfo blocks behind EXECUTORCH_BUILD_CUDA#19265
rascani merged 1 commit intopytorch:mainfrom
rascani:fix-qwen3_5_moe-cuda-guards

rascani commented May 1, 2026

Uh oh!

pytorch-bot Bot commented May 1, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rascani commented May 1, 2026

Summary

Test plan

Uh oh!

pytorch-bot Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19265

❌ 1 New Failure, 1 Cancelled Job, 14 Pending, 4 Unrelated Failures

Uh oh!

github-actions Bot commented May 1, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented May 1, 2026 •

edited

Loading

This PR needs a `release notes:` label