Is it possible to index which variables to extract after Principle Component Analysis?

1 回表示 (過去 30 日間)
Hi All,
I am a little confused as to how I can determine which variables I want after running my data matrix X through the pca funtion.
[coeff,score,latent,tsquared,explained,mu] = pca(X);
I have a 1000x16 dataset. I know from the 'explained' output that 95% of my feature variability can be explained by 7 variables.
Is there a way of determining which 7 variables (which columns numbers from the original X matrix) these are through indexing from the original X matrix,coeff, scores matrix?
Any help would be greatly appreciated here.

採用された回答

the cyclist
the cyclist 2017 年 1 月 13 日
編集済み: the cyclist 2017 年 1 月 14 日
It will not be 7 of your original variable that explain 95% of the variation. This is an important concept to understand.
Rather, it will be 7 of the new variables, each of which is a linear combination of your original variables, that will explain the variation. Each column of coeff is the weight of the original variable. The first 7 columns of coeff are what you want.
  1 件のコメント
sh10101
sh10101 2017 年 1 月 14 日
Thanks for clearing this up!
So are the first 7 columns of coeff what I can now build my model using?

サインインしてコメントする。

その他の回答 (0 件)

カテゴリ

Help Center および File ExchangeDimensionality Reduction and Feature Extraction についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by