How to use Parallel Coordinates Plot for Predictor selection?

2 ビュー (過去 30 日間)
Sahil Bajaj
Sahil Bajaj 2019 年 2 月 11 日
回答済み: Perry Gogas 2019 年 11 月 13 日
Hi,
I have a question about Parallel Coordinates Plot from Classifier app (Machine Learning).
I have Parallel Coordinates Plot just like the one shown in the figure on this page: https://www.mathworks.com/help/stats/feature-selection-and-feature-transformation.html#buwh6hc-1
In the description of this page (on point 5), its mentioned that "If you identify predictors that are not useful for separating out classes, use Feature Selection to remove them and train classifiers including only the most useful predictors."
Its not clear to me how can I use this plot to figure out which predictior are not useful for separating out the classes? In my plot I have 35 features for 2 classes, I want to remove the features which are not helpful for disntnigushing my classes, so I want to reduce the dimensionality of my data and remove the unuseful features. But I have to idea how this figure can be helpful me in removing those features.
Any help would be really appreciated.
Thanks !
Sahil

採用された回答

Patel Mounika
Patel Mounika 2019 年 2 月 20 日
Let’s look at the parallel coordinate plot shown in the figure you sent: https://www.mathworks.com/help/stats/feature-selection-and-feature-transformation.html#buwh6hc-1
In this plot take look at the comparison of the sepal widths of the different flowers, the values for setosa, versicolor and virginica are overlapping because of which it will be difficult to classify based on this feature and on the other hand the values of petal width are distinct for different flowers (or not overlapping) which will help in classification of the flowers. So, based on this petal length and petal width are the features that separate the classes best compared to sepal width and sepal length.
Hope this helps.
  1 件のコメント
Sahil Bajaj
Sahil Bajaj 2019 年 2 月 20 日
Thank you so much Mounika, that really helps.

サインインしてコメントする。

その他の回答 (1 件)

Perry Gogas
Perry Gogas 2019 年 11 月 13 日
But I think that you also have to look at the missclassified cases marked with the dashed lines. These too provide information on the importance of each variable.

カテゴリ

Help Center および File ExchangeGraphics Performance についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by