Harrogath wrote:
tickersu wrote:
Harrogath wrote:
contradictory. Multicollinearity is a violation of the OLS assumptions and must be corrected dropping one variable.
Regards
Only perfect collinearity is a violation of an OLS assumption. Any other level of multicollinearity is not a violation of any OLS assumption.
You also do not necessarily need to drop one or more of the correlated indep. variables. It is a proposed remedy, but it depends on the purpose and intent of your research.
Yes, if you have perfect multicollinearity you are, in deed, having no regression calculation. The result is &####!, nothing. Thats why it is a violation.
Right, I stated that perfect collinearity is a violation. As you mentioned, you can’t estimate the regression.
But a severe multicollinearity is a violation in a practical way.
I would avoid calling it a violation, since there actually are assumptions that can be violated. Besides, you wouldn’t call a low R-squared a “practical violation”; it’s just something that might cause you to adjust your model. If you are fitting the regression equation for prediction purposes, then it’s okay to have multicollinearity (all you want to do is predict the DV). If you want to make inferences and examine relationships from your model, then you need to address the multicollinearity issue.
High M (around r > 0.7 for many variables) is just not healthy for your regression, and not even healthy, it turns it just useless. We already explained the problems of having high M.
Not true that the model becomes “useless” (depends on your intent for the model). Go consult the literature on this. There is no threshold for multicollinearity being a problem. You are right that higher bivariate and “group” correlations are more likely to pose an issue, but there isn’t a guaranteed cutoff. In other words, you can’t look at the correlation and say, “Ah, it’s higher than X, so we have multicollinearity problems”…The severity of multicollinearity is diagnosed by looking at many factors [signs and magnitude of beta estimates, variance inflation factors (VIFs– rule of thumb: if VIF is at least 10, you probably have an issue), etc.]. Multicollinearity can pose a problem with a bivariate correlation of 0.2 or it could not. It’s just something that needs to be diagnosed with a few tools.
If your research purpose needs a X matrix with high M, and you need that X matrix at all cost; well, you can not conclude nothing so. That will be your conclussion.
I’m not sure what you’re saying here. The only time multicollinearity is an issue with the matrices is when you have perfect collinearity (singular matrix), as we both said earlier.