Arrayfun GPU in "Game of Life" works slower than CPU
    3 visualizaciones (últimos 30 días)
  
       Mostrar comentarios más antiguos
    
    Uladzislau
 el 18 de Mzo. de 2020
  
    
    
    
    
    Editada: Joss Knight
    
 el 28 de Mzo. de 2020
            Hello! I've run the demo "paralleldemo_gpu_stencil" and have such a result:
CPU:          2.815ms per generation.
Simple GPU:   2.650ms per generation (1.1x faster).
Arrayfun GPU: 13.253ms per generation (0.2x faster).
I've used ThinkPad P50 with Quadro M1000M and Matlab R2019b with appropriate drivers. Why Arrayfun works such slow ?? 
Uladzislau. 
0 comentarios
Respuesta aceptada
  Joss Knight
    
 el 28 de Mzo. de 2020
        
      Editada: Joss Knight
    
 el 28 de Mzo. de 2020
  
      Check out this Answer. The arrayfun version is rather dependent on good memory performance since the kernel is accessing global GPU memory in a non-coalesced way (multiple threads accessing overlapping regions of memory that aren't contiguous). For your chip, the version that runs multiple kernels on multiple shifted copies of the grid is actually more efficient, despite the kernel launch overhead and the extra memory allocation needed.
0 comentarios
Más respuestas (0)
Ver también
Categorías
				Más información sobre GPU Computing en Help Center y File Exchange.
			
	Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!

