Get the diagonal without calculating the explicit matrix

Dear all:
I am trying to calculate a diagonal of a matrix (denoted A), which is formed by multiplying two large-dimensional matrices (denoted as B*C).
A naive way to do it is: first, calculating explicitly A = B*C, then get diagonal out from A. However, the first step takes forever to run due to the high-dimension of B and C. But the only thing I need is the diagonal of A.
Another straightforward way in my mind is: I could create a loop by calculating each element of the diagonal of A one by one. It will surely save a lot of time, but I am not sure if this is the most efficient way.
I am wondering if anyone knows a faster/smarter way to calculate it.
Thank you very much in advance!
Best,
Long

1 comentario

Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
The best approach will depend on the dimensions of the matrices, and whether they are of sparse-type or not.

Iniciar sesión para comentar.

 Respuesta aceptada

Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
Assuming B*C results in a square matrix,
diagonal=sum(B.' .* C, 1);

8 comentarios

Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
If B*C is not square, the same concept applies, just a bit messier:
N=min(size(B,1), size(C,2));
diagonal=sum(B(1:N,:).' .* C(:,1:N), 1);
Thank you Matt - The algorithm is simple! However, it does not seem faster than doing the loop. Maybe it is because making a transpose of a high-dimensional matrix takes a lot of time. What do you think?
Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
Or perhaps the outer dimensions of your matrices are much smaller than the inner dimensions.
Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
Note that if your matrices are sparse, the looping method will be much slower.
N=7000;
B=sprand(N,10*N,10/N); C=B.';
tic; sum(B.'.*C,1); toc %Elapsed time is 0.021923 seconds.
tic;
for i=1:N
B(i,:)*C(:,i);
end
toc; %Elapsed time is 11.467097 seconds.
If the transpose in Matt's algorithm is really the time-consuming step, would it be possible to go upstream in your code and do all prior calculations in a way that the B you end up with is the transpose of the current one?
Thank you Matt! I have tested it in a relatively large subset of my original data, your algorithm is indeed faster. Thank you for the valuable advice!
Thank you the cyclist! Do you have any insight in doing the transpose quicker? I am a bit confused here.
Matt J
Matt J el 26 de Mzo. de 2020
Editada: Matt J el 26 de Mzo. de 2020
the cyclist means you might avoid the transpose by loading data column-wise instead of row-wise when you first build B.

Iniciar sesión para comentar.

Más respuestas (1)

Here is one way:
% Make up some inputs
N = 4;
B = rand(N);
C = rand(N);
% Calculate the diagonal
A_diag = 0;
for nr = 1:N
A_diag = A_diag + B(:,nr).*C(nr,:)';
end

1 comentario

Thanks the cyclist! This is a method I have applied currently.

Iniciar sesión para comentar.

Categorías

Más información sobre Creating and Concatenating Matrices en Centro de ayuda y File Exchange.

Preguntada:

el 26 de Mzo. de 2020

Editada:

el 26 de Mzo. de 2020

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by