How to use Parallel Coordinates Plot for Predictor selection?
2 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Sahil Bajaj
el 11 de Feb. de 2019
Respondida: Perry Gogas
el 13 de Nov. de 2019
Hi,
I have a question about Parallel Coordinates Plot from Classifier app (Machine Learning).
I have Parallel Coordinates Plot just like the one shown in the figure on this page: https://www.mathworks.com/help/stats/feature-selection-and-feature-transformation.html#buwh6hc-1
In the description of this page (on point 5), its mentioned that "If you identify predictors that are not useful for separating out classes, use Feature Selection to remove them and train classifiers including only the most useful predictors."
Its not clear to me how can I use this plot to figure out which predictior are not useful for separating out the classes? In my plot I have 35 features for 2 classes, I want to remove the features which are not helpful for disntnigushing my classes, so I want to reduce the dimensionality of my data and remove the unuseful features. But I have to idea how this figure can be helpful me in removing those features.
Any help would be really appreciated.
Thanks !
Sahil
0 comentarios
Respuesta aceptada
Patel Mounika
el 20 de Feb. de 2019
Let’s look at the parallel coordinate plot shown in the figure you sent: https://www.mathworks.com/help/stats/feature-selection-and-feature-transformation.html#buwh6hc-1
In this plot take look at the comparison of the sepal widths of the different flowers, the values for setosa, versicolor and virginica are overlapping because of which it will be difficult to classify based on this feature and on the other hand the values of petal width are distinct for different flowers (or not overlapping) which will help in classification of the flowers. So, based on this petal length and petal width are the features that separate the classes best compared to sepal width and sepal length.
Hope this helps.
Más respuestas (1)
Perry Gogas
el 13 de Nov. de 2019
But I think that you also have to look at the missclassified cases marked with the dashed lines. These too provide information on the importance of each variable.
0 comentarios
Ver también
Categorías
Más información sobre Classification Ensembles en Help Center y File Exchange.
Productos
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!