How to allocate an `OrtValue` array directly on GPU (to emulate low-memory conditions) #24363

vadimkantorov · 2025-04-09T18:59:22Z

vadimkantorov
Apr 9, 2025

I assume something like this would work: onnxruntime.OrtValue.ortvalue_from_numpy(np.zeros((40_000_000_000, ), 'uint8'), device_type='cuda', device_id=0)

But is it possible to bypass the np.zeros / np.empty and directly allocate a GPU tensor using ORT-only API (without resorting to torch / cupy / cuda_driver)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to allocate an `OrtValue` array directly on GPU (to emulate low-memory conditions) #24363

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to allocate an OrtValue array directly on GPU (to emulate low-memory conditions) #24363

Uh oh!

vadimkantorov Apr 9, 2025

Replies: 0 comments

How to allocate an `OrtValue` array directly on GPU (to emulate low-memory conditions) #24363

vadimkantorov
Apr 9, 2025