Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Video memory leak #4354

Open
XL634663985 opened this issue Feb 10, 2025 · 1 comment
Open

Video memory leak #4354

XL634663985 opened this issue Feb 10, 2025 · 1 comment
Assignees
Labels
Module:Performance General performance issues Module:Polygraphy Issues with Polygraphy triaged Issue has been triaged by maintainers waiting for feedback Requires more information from user to make progress on the issue.

Comments

@XL634663985
Copy link

XL634663985 commented Feb 10, 2025

Description

When I use "with TrtRunner(self.onnx_session) as runner" for inference, there is no video memory leak, but the inference speed is very slow (because it has to be reloaded every time). If the model is set as a global variable, and the model is activated using "runner.activate()" and remains unchanged, it will cause video memory leak. How to solve it?

Environment

TensorRT Version:10.2.0.19

NVIDIA GPU:3090

NVIDIA Driver Version:560.94

CUDA Version:cuda_12.1.0_531.14_windows

CUDNN Version:cudnn-windows-x86_64-9.2.1.18_cuda12-archive

Operating System:windows10

Python Version (if applicable):3.11.9

Tensorflow Version (if applicable):

PyTorch Version (if applicable):

Baremetal or Container (if so, version):

Relevant Files

Model link:

Steps To Reproduce

Commands or scripts:

Have you tried the latest release?:

Can this model run on other frameworks? For example run ONNX model with ONNXRuntime (polygraphy run <model.onnx> --onnxrt):

@poweiw
Copy link
Collaborator

poweiw commented Feb 10, 2025

Hello! We'll need a few more information to reproduce:

  1. Link to your onnx model
  2. A script to reproduce the error
  3. The source you saw the vram memory leak, whether it's from cuda-memcheck, cuda-gdb

@LeoZDong LeoZDong added triaged Issue has been triaged by maintainers waiting for feedback Requires more information from user to make progress on the issue. labels Feb 10, 2025
@LeoZDong LeoZDong added the Module:Performance General performance issues label Feb 10, 2025
@poweiw poweiw assigned LeoZDong and unassigned poweiw Feb 11, 2025
@poweiw poweiw added the Module:Polygraphy Issues with Polygraphy label Feb 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Module:Performance General performance issues Module:Polygraphy Issues with Polygraphy triaged Issue has been triaged by maintainers waiting for feedback Requires more information from user to make progress on the issue.
Projects
None yet
Development

No branches or pull requests

3 participants