KungFuPanda wrote:
In the Wiley review book under Multicollinearity, it states that “Multicollinearity does not affect the consistency of OLS estimates and regression coefficients, but makes them inaccurate and unreliable.”
Question: what is definition of ‘consistency’ of estimates vs. accuracy and reliability?
Thanks!
Consistency is when the probability distribution of a parameter’s estimator collapses to a single point as the sample size increases. In other words, the standard error for the sampling distribution (of the given estimator) shrinks towards zero as the sample size grows. Therefore, it’s less likely that the value of our estimator is far away from the true parameter value when we increase the sample size (if the estimator is consistent).
Reliability refers to the magnitude and sign of the estimate being correct, and accuracy refers to the standard error being small (more or less, precision of our estimates).
When multicollinearity is a problem, the consistency property of OLS still holds (there’s some math that can be used to show why, but you won’t need it for the exam).
However, the correlation between the (group) of independent variables makes it hard to determine the partial effect of any one variable on the dependent variable. The coefficient estimates are likely to be incorrect in terms of magnitude and possibly in direction.
For example, in a regression of salary

on years of education (x1) and age (x2), it is likely that x1 and x2 are correlated. For our purposes, let’s assume the correlation is high enough to pose the issue of multicollinearity (there’s no clear cutoff for “too high” of a correlation, by the way). We would guess that x1 and x2 each have a positive effect on y and both should be significant in a regression (and research tends to show this in some capacity). However, our (hypothetical) estimated regression shows that x1 is actually negative, small, and not significant. This would tip you off that something isn’t right (since the estimate is different than the common logic and other research). This is an example of how multicollinearity could affect the reliability of estimated coefficients (small effect of education and it’s estimated as negative). This also shows how the coefficients are not “accurate”– the standard error was inflated, meaning we had less precision in our estimates (insignificant t-test).
Hope this helps!