Rules of thumb on GPU usage?

3 visualizaciones (últimos 30 días)
David Short
David Short el 21 de Sept. de 2015
Editada: Joss Knight el 14 de Oct. de 2015
I've converted several algorithms to using a gpu and I've always seen a tremendous improvement in execution time. For the first time, this particular trick has failed me. I'm seeing execution time lengthen when I use the GPU.
Is there a better way to determine the performance of a code snippet in a gpu than to alter the code and try it out?
Specifically when the target code may be executed on different classes of gpu's are there rules of thumb to predict the improvement/degradation that will result?

Respuestas (1)

Joss Knight
Joss Knight el 12 de Oct. de 2015
You ought to provide some examples so that we know the kind of thing you're getting at.
The main rule of thumb is that the GPU will generally perform well when your code is highly data-parallel. If you get a speed-up from vectorizing your code, you'll probably get a speed-up on the GPU. This means the same sort of operations are taking place in multiple places on a large dataset. If however, you have small pieces of data, a lot of disparate tasks, dependent operations, and loops, you probably don't have something that will parallelize well.
  2 comentarios
David Short
David Short el 13 de Oct. de 2015
Thank you Joss, With this particular problem which we can think of as a large linear algebra problem with conditionals at the end, I did see improvement through vectorizing the code, but the code relies on logical indexing to reduce the space of the computation. In practice with the GPU that meant huge data transfers that ate any times gained by going to the GPU for calculation.
I may be able to restructure the problem so that other than logical indexes the data transfers much less often, but I have not got there just yet.
I'll update when I have more.
Joss Knight
Joss Knight el 14 de Oct. de 2015
Editada: Joss Knight el 14 de Oct. de 2015
gpuArray supports logical indexing so I see no reason why you would need any data transfers (see the blog article I linked above for examples). Can you explain?

Iniciar sesión para comentar.

Categorías

Más información sobre GPU Computing en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by