GPU Coder vs. ONNXRuntime, is there a difference in inference speed?

2 visualizaciones (últimos 30 días)
David
David el 1 de Abr. de 2021
Editada: Joss Knight el 7 de En. de 2022
Since I can export from Matlab to ONNX format, why can't I just import my model into TensorRT etc.? Will I get significant speed increases or is the benefit of GPU Coder more about being able to compile all my other Matlab code into optimized Cuda?
Thanks in advance.

Respuestas (1)

Joss Knight
Joss Knight el 2 de Abr. de 2021
You can compile your network for TensorRT using GPU Coder if that's your intended target, no need to go through ONNX.
I don't believe MathWorks have any published benchmarks against ONNX runtime specifically. GPU Coder on the whole outperforms other frameworks, although it does depend on the network.
  2 comentarios
Matti Kaupenjohann
Matti Kaupenjohann el 7 de En. de 2022
Could you show/link the benchmark which includes the performance of gpucoder against other frameworks (which one?).
Joss Knight
Joss Knight el 7 de En. de 2022
Editada: Joss Knight el 7 de En. de 2022
We don't publish the competitive benchmarks, you'll have to make a request through your sales agent. we can provide some numbers for MATLAB.

Iniciar sesión para comentar.

Categorías

Más información sobre Deep Learning Toolbox en Help Center y File Exchange.

Etiquetas

Productos


Versión

R2021a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by