onnx inference speed #1857

xIaott-s · 2021-11-15T13:43:38Z

xIaott-s
Nov 15, 2021

Hi, I export the ecapa-tdnn model to onnx and use onnxruntime to inference. But the inference speed is slower than pytorch. Does anybody konw why?

aheba · 2021-11-15T15:46:06Z

aheba
Nov 15, 2021
Collaborator

@sty1992928 , as far as I know,
Generally, using ONNX + TensorRT improve by high range the performance,

are you using this project ?
https://github.com/NVIDIA-AI-IOT/torch2trt

0 replies

vadimkantorov · 2021-11-15T17:26:36Z

vadimkantorov
Nov 15, 2021

Onnxruntime has some unsolved perf problems with conv1d: microsoft/onnxruntime#7212 microsoft/onnxruntime#8513

We found only version 1.8.0 is somewhat performant (usually these issues are about bad configuration of cudnn / cudnn benchmarking workspace size). Please also report to onnxruntime to bump these issues in priority

0 replies

xIaott-s · 2021-11-16T02:42:38Z

xIaott-s
Nov 16, 2021
Author

are you using this project ?

I use cpu only，is this as expected？

0 replies

xIaott-s · 2021-11-16T02:55:04Z

xIaott-s
Nov 16, 2021
Author

Onnxruntime has some unsolved perf problems with conv1d: microsoft/onnxruntime#7212 microsoft/onnxruntime#8513

We found only version 1.8.0 is somewhat performant (usually these issues are about bad configuration of cudnn / cudnn benchmarking workspace size). Please also report to onnxruntime to bump these issues in priority

I use cpu only, does cpu mode has the same perf problems?

0 replies

vadimkantorov · 2021-11-16T09:36:27Z

vadimkantorov
Nov 16, 2021

Those issues are unrelated to CPU, so likely not...

0 replies

xIaott-s · 2021-11-17T03:28:28Z

xIaott-s
Nov 17, 2021
Author

Those issues are unrelated to CPU, so likely not...

Thank you anyway！The Int8 quant works，so my problem is temporarily gone.

0 replies

xle97 · 2023-02-11T06:19:45Z

xle97
Feb 11, 2023

Hi, I export the ecapa-tdnn model to onnx and use onnxruntime to inference. But the inference speed is slower than pytorch. Does anybody konw why?

hi，May I ask how you exported onnx？

0 replies

aheba · 2023-02-22T08:51:04Z

aheba
Feb 22, 2023
Collaborator

Hi you can use torch.onnx.export to do the onnx export of torch module;
Basically you will need for the forward + a tracing example..

0 replies

marziye-A · 2024-02-24T10:44:57Z

marziye-A
Feb 24, 2024

Hi, I export the ecapa-tdnn model to onnx and use onnxruntime to inference. But the inference speed is slower than pytorch. Does anybody konw why?

dear @xIaott-s, can you please share your code on how you have converted your model to onnx and quantized it?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnx inference speed #1857

{{title}}

Replies: 9 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

onnx inference speed #1857

xIaott-s Nov 15, 2021

Replies: 9 comments

aheba Nov 15, 2021 Collaborator

vadimkantorov Nov 15, 2021

xIaott-s Nov 16, 2021 Author

xIaott-s Nov 16, 2021 Author

vadimkantorov Nov 16, 2021

xIaott-s Nov 17, 2021 Author

xle97 Feb 11, 2023

aheba Feb 22, 2023 Collaborator

marziye-A Feb 24, 2024

xIaott-s
Nov 15, 2021

aheba
Nov 15, 2021
Collaborator

vadimkantorov
Nov 15, 2021

xIaott-s
Nov 16, 2021
Author

xIaott-s
Nov 16, 2021
Author

vadimkantorov
Nov 16, 2021

xIaott-s
Nov 17, 2021
Author

xle97
Feb 11, 2023

aheba
Feb 22, 2023
Collaborator

marziye-A
Feb 24, 2024