ONNX and Torch inference outputs are not the same #108

nhanerc · 2023-07-24T05:19:47Z

Hi,

I run the script below, and the outputs from both models are not the same.

# Text recognition
import torch
from PIL import Image
from torchvision import transforms as T
import onnxruntime as rt

parseq = torch.hub.load("baudm/parseq", "parseq", pretrained=True).eval()
img_transform = T.Compose(
    [T.Resize(parseq.hparams.img_size, T.InterpolationMode.BICUBIC), T.ToTensor(), T.Normalize(0.5, 0.5)]
)

x = torch.rand(1, 3, *parseq.hparams.img_size)
parseq.to_onnx("/tmp/parseq.onnx", x, opset_version=14, do_constant_folding=True)

options = rt.SessionOptions()
options.enable_profiling = False
rec_session = rt.InferenceSession("/tmp/parseq.onnx", options=options)

inp = Image.open("cute-184.jpg").convert("RGB")
inp = img_transform(inp).unsqueeze(0)

logits = rec_session.run(None, {rec_session.get_inputs()[0].name: inp.numpy()})[0]
logits = torch.tensor(logits)
pred = logits.softmax(-1)
label, confidence = parseq.tokenizer.decode(pred)
print("onnx:", label)

logits = parseq(inp)
pred = logits.softmax(-1)
label, confidence = parseq.tokenizer.decode(pred)
print("torch:", label)

The output and torch/onnxruntime version:

I have no idea why it happened.
I also tried this as others suggested, but it also failed to produce the same output.

parseq = torch.hub.load("baudm/parseq", "parseq", pretrained=True, refine_iters=False).eval()

YenYunn · 2024-03-11T07:28:02Z

My code is almost the same as yours. After converting it to ONNX, did you encounter this error during ONNX inference?
Traceback (most recent call last):
File "D:\PyCharmProject\parseq-main\inference.py", line 65, in
session = onnxruntime.InferenceSession('parseq.onnx',
File "D:\PyCharmProject\parseq-main.venv\lib\site-packages\onnxruntime\capi\onnxruntime_inference_collection.py", line 383, in init
self._create_inference_session(providers, provider_options, disabled_optimizers)
File "D:\PyCharmProject\parseq-main.venv\lib\site-packages\onnxruntime\capi\onnxruntime_inference_collection.py", line 435, in _create_inference_session
sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for Where(9) node with name '/Where_19'

felixdittrich92 · 2024-05-07T07:36:53Z

CC @baudm
We have had the same issue: the break in the AR decoding loop isn't translated correctly with ONNX
-> only with AR results in a shape mismatch for max_length dimension
-> with AR + RI everything looks fine but the logits differs +- 0.5

Not breaking the loop in AR fixes the issue (only required if you want to export):
https://github.com/mindee/doctr/pull/1585/files

Time for base model: 0.04333
Time for onnx model: 0.02047
parseq --> mean diff: 1.8520279354561353e-06

This can certainly be solved in a nicer way, but that should solve the problem first.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ONNX and Torch inference outputs are not the same #108

ONNX and Torch inference outputs are not the same #108

nhanerc commented Jul 24, 2023

YenYunn commented Mar 11, 2024

felixdittrich92 commented May 7, 2024 •

edited

ONNX and Torch inference outputs are not the same #108

ONNX and Torch inference outputs are not the same #108

Comments

nhanerc commented Jul 24, 2023

YenYunn commented Mar 11, 2024

felixdittrich92 commented May 7, 2024 • edited

felixdittrich92 commented May 7, 2024 •

edited