The Flop (Floating Point Operations per Second) Rate of MATLAB Code

9 visualizaciones (últimos 30 días)
Hello, I know Intel MKL / IPP libraries performance in simple operations (Multiplication, Summation, Matrix Multiplication, Vector Multiplication) gets something like 80-95% of the theoretical performance of the CPU (Measured in FLOPS).
Yet, doing so using MATLAB I get much worse results.
I have this simple script:
numElements = 2 ^ 16;
numIter = 100;
vecX = randn(numElements, 1, 'single');
vecY = randn(numElements, 1, 'single');
initTime = tic();
for ii = 1:numIter
vecX .* vecY;
end
stopTime = toc(initTime);
gFlops = (numElements * numIter) / stopTime
Yet I get only 1.1 GFLOPS on my i7-860 Which should be closer to 2.8GHz (Frequency) * 4 (Cores) * 4 (Single Precisio Operations per Cycle as SSE Vector - 128 Bit) = 44.8 GFLOPS.
Yet I get something like 1.4 GFLOPS. Which is only 3% of the theoretical performance.
How can MATLAB be so inefficient?
  2 comentarios
Amit
Amit el 28 de En. de 2014
BTW, MATLAB is only using 1 core, I'd believe. And for a benchmark, is there anything else running besides MATLAB?
Royi Avital
Royi Avital el 19 de Feb. de 2014
MATLAB use more than one core.

Iniciar sesión para comentar.

Respuesta aceptada

Amit
Amit el 28 de En. de 2014
Editada: Amit el 28 de En. de 2014
I don't think the way you're trying to calculate flops here is right. Even if one assumes that you can calculate Flops like this, you're missing out many overheads that matlab is doing. For example, try something like this:
numElements = 2 ^ 18;
numIter = 100;
vecX = randn(numElements, 1, 'single');
vecY = randn(numElements, 1, 'single');
initTime = tic();
% for ii = 1:numIter
% vecX .* vecY;
% end
vecX + vecY; % I used +, but you can switch to .* as well
stopTime = toc(initTime);
gFlops = (numElements * numIter) / stopTime
And see if you see any difference. I am pretty sure you will. Remember, for loop is slow.
  4 comentarios
Salma Hassan
Salma Hassan el 16 de Jun. de 2019
can i use tic and toc only to compue the time elpase
Walter Roberson
Walter Roberson el 16 de Jun. de 2019
tic and toc only provide elapsed time information, which is not the same as the amount of computation done, as elapsed time can include time that the operating system suspended MATLAB in order to work on something else.

Iniciar sesión para comentar.

Más respuestas (1)

Walter Roberson
Walter Roberson el 28 de En. de 2014
single() is often slower than double()
Your arrays are not that big; I am not sure that it is kicking in calls to the libraries.
  2 comentarios
Royi Avital
Royi Avital el 28 de En. de 2014
I tried with doubles and larger vectors. Same result.
Feel free to try yourself and show results with better utilization of the CPU.
Such a pity this software isn't close to take a real advantage of the resources.
Walter Roberson
Walter Roberson el 29 de En. de 2014
Try with timeit. Or if you have an older MATLAB that does not have that built-in, you can get timeit from the File Exchange.

Iniciar sesión para comentar.

Categorías

Más información sobre Logical en Help Center y File Exchange.

Productos

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by