Graph analysis question
Mostrar comentarios más antiguos
I have this data:
X = 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
With
Y = 0.5 1 1.5 2 2.5 3 3.5 4 1 1.5 3 3.5 4.5 5.5 6 2 4 4.5 5.5 6.5 8
This may be more of a math question rather than a matlab question. If you plot the above data you will see three distinct sets of data, is there any way I can get matlab to automatically split this data into the three seperate variables. The number of variables may change depending on the data set, the data I have provided is indicative for the real data the X axis is actually dates.
In short I need Matlab to detect how many separate data sets there are and split the data into different variables.
I've tried using a max and min point script but I didn't get very far with it.
Thanks in advance
3 comentarios
Robert Cumming
el 3 de Ag. de 2011
in the above example do you want to split X into 21 seperate variables? Or do I not follow.
Loop at the functions unique and find
the cyclist
el 3 de Ag. de 2011
Robert, if you do plot(X,Y,'.'), you will understand better what he means. The data lie in three well defined linear groups.
Robert Cumming
el 3 de Ag. de 2011
yes I see now that I actually plotted it... Misinterpreted the question
Respuesta aceptada
Más respuestas (2)
the cyclist
el 3 de Ag. de 2011
0 votos
What you are asking about is generically known as "cluster analysis", and MATLAB does have some functionality for it, at least in the Statistics Toolbox. If you have that toolbox, look at "doc kmeans" as a possible starting point.
1 comentario
the cyclist
el 3 de Ag. de 2011
I put this answer up before I plotted out your data. Not sure that kmeans() is going to help you, because I think it is always searching for a "nearest neighbors" sort of clustering, rather than the linear groups you have. It seems to me that you have a fairly specialized problem here, and you'll likely need to write custom code for it.
Wolfgang Schwanghart
el 3 de Ag. de 2011
I just tried your example. While results of a kmeans clustering don't look too promising, the function clusterdata works quite well. What you should know a priori is the number of clusters in your data.
x = [1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21]';
y = [0.5 1 1.5 2 2.5 3 3.5 4 1 1.5 3 3.5 4.5 5.5 6 2 4 4.5 5.5 6.5 8]';
IDX = clusterdata([x y],'distance','chebychev','maxclust',3);
gscatter(x,y,IDX)
Regards, W.
2 comentarios
the cyclist
el 3 de Ag. de 2011
kmeans() seems to do better than clusterdata() to me, although neither very acceptably well. kmeans() at least gets most of the points attributed to the correct lines. clusterdata() combines almost all of the middle and right-hand lines into one cluster (at least with the syntax you provided).
Matt
el 3 de Ag. de 2011
Categorías
Más información sobre k-Means and k-Medoids Clustering en Centro de ayuda y File Exchange.
Productos
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!