Mvregress and choice of the estimation algorithm

1 回表示 (過去 30 日間)
Maria445
Maria445 2018 年 9 月 23 日
回答済み: Ayush Aniket 2024 年 12 月 20 日
I am performing a multivariate regression analysis using mvregress:
[beta,Sigma] = mvregress(X,Y,'algorithm','cwls');
However, I noticed that, although the estimates are identical, the standard errors completely change if I use the cwls algorithm as opposed to the mvn one. In particular, with cwls, almost all coefficient appear as significant, while, using mvn, almost none of them is.
There are no missing responses in my data, so the default would be mvn.
Can a simple algorithm choice be so influential on the result? Is there a way I can choose wisely between the two?

採用された回答

Ayush Aniket
Ayush Aniket 2024 年 12 月 20 日
The choice of algorithm in multivariate regression can indeed influence the results, particularly the estimation of standard errors becuase of their methods of computing as described below:
  • CWLS (Component-Wise Least Squares) approach estimates the regression coefficients for each response variable separately, which can lead to smaller standard errors if the responses are not highly correlated. It does not account for the covariance between response variables in the error terms, potentially underestimating standard errors if there is significant correlation.
  • MVN (Multivariate Normal): Assumes a multivariate normal distribution for the errors and estimates the coefficients considering the covariance structure between responses This method generally provides more robust standard error estimates when responses are correlated, as it accounts for the covariance.
Some basic suggestions for choosing the right algorithm:
  • Correlation Between Responses: If your response variables are correlated, mvn is usually more appropriate as it accounts for this correlation in the estimation process.
  • Model Assumptions: Consider the assumptions of each method. mvn assumes multivariate normality of errors, which might not hold in all cases.
  • Data Characteristics: If you have a large dataset with complex covariance structures, mvn might be better suited. For simpler cases or when computational efficiency is a concern, cwls might suffice.

その他の回答 (0 件)

カテゴリ

Help Center および File ExchangeStatistics and Machine Learning Toolbox についてさらに検索

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by