Skip to content

Issues: NVIDIA/TensorRT

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Global tensors with dynamic slice using python question Further information is requested triaged Issue has been triaged by maintainers
#4229 opened Oct 30, 2024 by zengrh3
Engine built failure of TensorRT 10.5 when running setWeightStreamingBudgetV2 on GPU A10 question Further information is requested triaged Issue has been triaged by maintainers
#4228 opened Oct 29, 2024 by tp-nan
How to support referenceNet+unet? question Further information is requested triaged Issue has been triaged by maintainers
#4226 opened Oct 29, 2024 by songh11
TensorRT 10.5 Flux-dev torch.float16 precision Demo: Diffusion Issues regarding demoDiffusion Precision: FP16 triaged Issue has been triaged by maintainers
#4223 opened Oct 25, 2024 by algorithmconquer
bf16 convert failed triaged Issue has been triaged by maintainers
#4221 opened Oct 24, 2024 by cillayue
Can TensorRT calculate the number of Params and FLOPs for the model? question Further information is requested triaged Issue has been triaged by maintainers
#4219 opened Oct 23, 2024 by demuxin
TensorRT 10.5 Flux Dit BF16 precision Accuracy Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers
#4215 opened Oct 21, 2024 by QZH-eng
out of memory failure of TensorRT 10.5 when running flux dit on GPU L40S Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers
#4214 opened Oct 21, 2024 by QZH-eng
stable diffusion quantization in inpainting task is poor Demo: Diffusion Issues regarding demoDiffusion Quantization: PTQ triaged Issue has been triaged by maintainers
#4212 opened Oct 20, 2024 by worhar
How to strictly limiting the maximum GPU memory usage and clear GPU memory cache? Memory Usage question Further information is requested triaged Issue has been triaged by maintainers
#4211 opened Oct 19, 2024 by EmmaThompson123
Different versions of TensorRT get different model inference results Accuracy triaged Issue has been triaged by maintainers
#4209 opened Oct 18, 2024 by demuxin
Does Flux not support int8? Demo: Diffusion Issues regarding demoDiffusion Precision: INT8 triaged Issue has been triaged by maintainers
#4208 opened Oct 17, 2024 by algorithmconquer
flux model engine_from_bytes(bytes_from_path(self.engine_path)) OutOfMemory Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers
#4207 opened Oct 17, 2024 by algorithmconquer
flux-demo failure of TensorRT 10.5 when running a single L40 GPU, how to implement 2-GPUs with L40 Demo: Diffusion Issues regarding demoDiffusion triaged Issue has been triaged by maintainers
#4205 opened Oct 17, 2024 by algorithmconquer
optimization profile is missing values for shape input triaged Issue has been triaged by maintainers
#4204 opened Oct 16, 2024 by OswaldoBornemann
Deploy DeBERTa to Triton Inference Server triaged Issue has been triaged by maintainers
#4202 opened Oct 16, 2024 by nbroad1881
ProTip! Follow long discussions with comments:>50.