Fix issue w/ DML batched readback size by adrastogi · Pull Request #28680 · microsoft/onnxruntime

adrastogi · 2026-05-26T21:17:17Z

Description

Fixes DML batched GPU readback sizing to avoid uint32_t aggregate wraparound, which could undersize the D3D12 readback. Adds checked size_t accumulation, safer readback validation, and regression coverage for overflow cases.

Motivation and Context

DML batched readback previously accumulated output tensor byte sizes in uint32_t, so multiple individually valid outputs could wrap the aggregate size used to allocate the D3D12 readback heap. This change tightens the sizing path and keeps the readback allocation consistent with the per-tensor copy sizes.

Copilot

Copilot encountered an error: Your billing is not configured or you have Copilot licenses from multiple standalone organizations or enterprises. To use premium requests, select a billing entity via the GitHub site, under Settings > Copilot > Features.

Copilot

Copilot's findings

Files reviewed: 4/4 changed files
Comments generated: 5

        // Map the readback heap and copy it into the destination
        void* readbackHeapData = nullptr;
        ORT_THROW_IF_FAILED(m_readbackHeap->Map(0, nullptr, &readbackHeapData));
+        auto unmapReadbackHeap = gsl::finally([this]() { m_readbackHeap->Unmap(0, nullptr); });



+    static Status HResultToStatus(HRESULT hr, const char* operation, const char* details)
+    {
+        const StatusCode status_code = hr == E_INVALIDARG ? INVALID_ARGUMENT : FAIL;
+        return Status(ONNXRUNTIME, status_code,
+                      onnxruntime::MakeString(operation, " failed with HRESULT ", hr, ": ", details));
+    }


+#include <d3d12.h>
+#include <gsl/gsl>
+#include <wrl/client.h>
+
+using Microsoft::WRL::ComPtr;
+


+TEST(DmlReadbackHeapTest, ComputeTotalReadbackSizeRejectsSizeTOverflow) {
+  const std::array<size_t, 2> sizes = {std::numeric_limits<size_t>::max(), 1};
+
+  EXPECT_ANY_THROW((void)Dml::detail::ComputeTotalReadbackSize(gsl::make_span(sizes.data(), sizes.size())));
+}


+
+  EXPECT_ANY_THROW((void)Dml::detail::ComputeTotalReadbackSize(gsl::make_span(sizes.data(), sizes.size())));
+}
+
+TEST(DmlReadbackHeapTest, ComputeTotalReadbackSizeRejectsMidBatchOverflow) {
+  const size_t half_max = std::numeric_limits<size_t>::max() / 2;
+  const std::array<size_t, 2> sizes = {half_max + 1, half_max + 1};
+
+  EXPECT_ANY_THROW((void)Dml::detail::ComputeTotalReadbackSize(gsl::make_span(sizes.data(), sizes.size())));


Initial draft

fa4dbdd

adrastogi requested a review from Copilot May 26, 2026 21:17

Copilot AI reviewed May 26, 2026

View reviewed changes

adrastogi requested a review from Copilot May 26, 2026 21:20

Copilot started reviewing on behalf of adrastogi May 26, 2026 21:20 View session

Copilot AI reviewed May 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue w/ DML batched readback size#28680

Fix issue w/ DML batched readback size#28680
adrastogi wants to merge 1 commit into
mainfrom
adrastogi/dml-overflow

adrastogi commented May 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adrastogi commented May 26, 2026

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Copilot's findings

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants