The size of my PCA isn't correct

18 visualizaciones (últimos 30 días)
Héctor
Héctor el 18 de En. de 2016
Comentada: Frank Nylander el 4 de Feb. de 2024
I am trying to perform principal component analysis using pca and not princomp. My dataset consists of 303 samples each containing 3904 dimensions, which explains why I want to perform PCA. My data is in a matrix called Data which has a size of 303 (rows) by 3904 (columns). When I execute the PCA command with
[coeff,score] = pca(Data);
I am returned a matrix coeff of size 3904*302 and the matrix score is of size 303*302 which doesn't seem to be correct right? Online documentation suggests that coeff should be 3904*3904 (i.e. square with the dimension size of the variables, here the columns). What is going wrong here?
ps: I also tried transposing the data and this results in a coeff matrix of 303*303

Respuestas (1)

Roger Wohlwend
Roger Wohlwend el 19 de En. de 2016
It seems that Matlab expects a Matrix with more observations than variables (i.e. more rows than columns). As you violate that rule (only 303 observations for 3904 variables), Matlab transposes the Matrix in order to obtain a Matrix with more observations than variables. If you transpose the Matrix, Matlab interprets your variables as observations and the other way around. So the result is not valid. You cannot solve this Problem. If you don't have more than 303 observations, your Matrix must not have mor than 303 columns.
  1 comentario
Frank Nylander
Frank Nylander el 4 de Feb. de 2024
Has this been fixed? I'm getting the same problem.

Iniciar sesión para comentar.

Categorías

Más información sobre Dimensionality Reduction and Feature Extraction en Help Center y File Exchange.

Productos

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by