Python API for HOST_ACCESSIBLE OrtValue allocation by ericcraw · Pull Request #28038 · microsoft/onnxruntime

ericcraw · 2026-04-10T18:53:19Z

Description

Adds memory_info= parameter to OrtValue.ortvalue_from_shape_and_type(), backed by two new C-level factory methods that look up the registered shared allocator via the full OrtMemoryInfo (including mem_type).

This is required because the current shared allocator query doesn't include the memory type making HOST_ACCESSIBLE invisible to python. UsesCpuMemory() is used in GetPyObjFromTensor so that tensors in HOST_ACCESSIBLE memory are returned as zero-copy numpy views.

Motivation and Context

Enable zero copy interop between numpy and ortvalue.

This is a follow up for #28037

Adds memory_info= parameter to OrtValue.ortvalue_from_shape_and_type(), backed by two new C-level factory methods that look up the registered shared allocator via the full OrtMemoryInfo (including mem_type). This is required because the current shared allocator query doesn't include the memory type making HOST_ACCESSIBLE invisible to python. UsesCpuMemory() is used in GetPyObjFromTensor so that tensors in HOST_ACCESSIBLE memory are returned as zero-copy numpy views.

yuslepukhin · 2026-04-16T01:59:20Z

+        :param memory_info: An OrtMemoryInfo from an OrtEpDevice (e.g. via ep_device.memory_info(OrtDeviceMemoryType.HOST_ACCESSIBLE)). When provided, the allocator matching this memory info is used directly, which allows allocating HOST_ACCESSIBLE memory for zero-copy numpy interop. The device_type, device_id, and vendor_id parameters are ignored when memory_info is provided.
        """

+        if memory_info is not None:


if memory_info is not None:

When memory_info is not None, the other device parameters are silently ignored. The docstring documents this. This is acceptable, but a warnings.warn() or a check that the caller didn't set both memory_info and non-default device params would be more user-friendly.

yuslepukhin · 2026-04-16T02:00:03Z

No Python test exercising the new memory_info= parameter or verifying that HOST_ACCESSIBLE OrtValues produce zero-copy numpy views.

Copilot

Pull request overview

Adds Python-level support for allocating OrtValue tensors using an explicit OrtMemoryInfo (including mem_type) so plugin EP HOST_ACCESSIBLE shared allocators can be selected, enabling zero-copy numpy interop for those tensors.

Changes:

Update tensor-to-numpy conversion to treat HOST_ACCESSIBLE tensors as CPU-memory-compatible via OrtDevice::UsesCpuMemory().
Add new pybind factory methods to allocate OrtValue from shape/type using a full OrtMemoryInfo lookup.
Extend OrtValue.ortvalue_from_shape_and_type() Python API with an optional memory_info= parameter to route allocations through those new factories.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
onnxruntime/python/onnxruntime_pybind_state.cc	Enables zero-copy numpy views for `HOST_ACCESSIBLE` tensors via `UsesCpuMemory()`.
onnxruntime/python/onnxruntime_pybind_ortvalue.cc	Adds `OrtMemoryInfo`-based `OrtValue` allocation factories using shared allocator lookup.
onnxruntime/python/onnxruntime_inference_collection.py	Exposes `memory_info=` on `OrtValue.ortvalue_from_shape_and_type()` and dispatches to new C++ factories.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-16T20:22:29Z

+  auto& env = GetOrtEnv()->GetEnvironment();
+  AllocatorPtr allocator = env.GetRegisteredSharedAllocator(memory_info);
+
+  if (!allocator) {
+    throw std::runtime_error("No shared allocator found for the given OrtMemoryInfo.");
+  }


The new OrtValueFromShapeAndTypeWithMemoryInfo throws a generic error when no shared allocator is found. This can be hard to diagnose (e.g., mem_type mismatch between DEFAULT vs HOST_ACCESSIBLE). Consider including key details from the requested memory_info (device type/vendor/id, device mem type, and OrtMemType) in the exception message so callers can see what was looked up.

Copilot · 2026-04-16T20:22:30Z

+        if memory_info is not None:
+            if isinstance(element_type, int):
+                return cls(
+                    C.OrtValue.ortvalue_from_shape_and_onnx_type_for_memory_info(
+                        shape,
+                        element_type,
+                        memory_info,
+                    )
+                )
+            return cls(
+                C.OrtValue.ortvalue_from_shape_and_type_for_memory_info(
+                    shape,
+                    element_type,
+                    memory_info,
+                )
+            )


The new memory_info allocation path and the UsesCpuMemory() zero-copy numpy conversion path don’t appear to have test coverage. Adding a Python test that allocates an OrtValue using memory_info=ep_device.memory_info(OrtDeviceMemoryType.HOST_ACCESSIBLE) and validates ort_value.numpy() works (and ideally is zero-copy) would protect this behavior and prevent regressions.

Copilot · 2026-04-16T20:22:30Z

-  const auto device_type = device.Type();
-  // Create an numpy array on top of the OrtValue memory, no copy
-  if (device_type == OrtDevice::CPU) {
+  // Create an numpy array on top of the OrtValue memory, no copy.


Grammar: use "a numpy array" (not "an numpy array").

Suggested change

// Create an numpy array on top of the OrtValue memory, no copy.

// Create a numpy array on top of the OrtValue memory, no copy.

yuslepukhin mentioned this pull request Apr 16, 2026

Zero-copy I/O for plugin EPs with HOST_ACCESSIBLE memory #28037

Draft

yuslepukhin reviewed Apr 16, 2026

View reviewed changes

yuslepukhin requested a review from Copilot April 16, 2026 20:17

Copilot started reviewing on behalf of yuslepukhin April 16, 2026 20:18 View session

Copilot AI reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python API for HOST_ACCESSIBLE OrtValue allocation#28038

Python API for HOST_ACCESSIBLE OrtValue allocation#28038
ericcraw wants to merge 1 commit intomicrosoft:mainfrom
ericcraw:python-host-accessible-api

ericcraw commented Apr 10, 2026 •

edited by yuslepukhin

Loading

Uh oh!

yuslepukhin Apr 16, 2026

Uh oh!

yuslepukhin commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Copilot AI Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	// Create an numpy array on top of the OrtValue memory, no copy.
	// Create a numpy array on top of the OrtValue memory, no copy.

Conversation

ericcraw commented Apr 10, 2026 • edited by yuslepukhin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

yuslepukhin Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

yuslepukhin commented Apr 16, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ericcraw commented Apr 10, 2026 •

edited by yuslepukhin

Loading