tickersu wrote:
Harrogath wrote:
I don’t believe much of this. Empirically, R2 and R2adj calculated from a parsimonious model won’t differ much, practically nothing.
I don’t need you to believe it. I have seen plenty of cases where the two differ quite a bit in absolute terms, even with a parsimonious model. I have literally graded hundreds of exams by hand to see students different projects and analyses with no more than five terms and an intercept in a model. There are plenty of cases where the two values won’t differ much, and there are plenty of cases where they differ quite a bit. One of the main reasons the adjustment calculation was made to was to penalize for overfitting a model to data (now there are many more options for that). The usual R-squared will necessarily increase with more terms in the model, all else constant; the penalty was developed to avoid optimism in the predictive ability of the model that might arise from including more terms, even junk. Hence, the adjustment to r-squared which. This is also the point to encourage parsimony unless there is dramatic improvement from the terms,
relative to the sample size (so a sample size of 1000 with 3 terms will have closer values than 3 terms in a sample of 50, again penalization due to relative number of terms to sample size; in short, more degrees of freedom, less penalty). Other penalized statistics include Akaike’s Information Criterion (AIC) and Bayesian Information Criterion (BIC).
R2 is the relevant measure for explanatory power. On the other hand, R2 adjusted is meant for comparing models with different quantity of independent variables. In a sole regression (whatever the quantity of variables) we look at R2. If I want to compare two or more models’ R2, then, I look at R2 adjusted. I see you are misleading the relevance of R2 or thinking that an adjusted measure is superior by definition. Remember that R2 adjusted is derived from R2 and will always be lower than R2, no matter what.
Also, wouldn’t see never a big difference between R2 and R2 adjusted in a parsimonious model with a good sample data (size). If you are talking about models built in the edge of assumptions, then you may be right. Also, I don’t know what is for you a big difference in R2 and R2 adjusted. As you saw in my simulation, changing from 3 to 50 variables R2 is dropped 15 percent points when adjusted. That difference is bad.
On the other hand, what kind of models are we talking about.
tickersu wrote:
Harrogath wrote:
The utility in calculating R2adj resides in that we are now able to compare R2 of different models where the difference is the quantity of variables inside.
That is one benefit of calculating the adjusted (penalized) R-squared. However, this is not the only benefit. The big purpose is that we can look at a model’s explanatory power with penalization for junk in finite samples, which is really only relevant for multiple regression cases. Standard textbooks (from real statisticians) don’t even introduce adjusted R-squared until multiple regression. A large difference between the two values for a single model indicates overfitting a model to data (too many terms relative to the sample size and or fitting noise, essentially). I suggest you read up on these two traditional calculations; nearly every source will tell you that this is what “adjustment” means and why it is done.
Again, I don’t know why you assume R2 adjusted is better than R2 because it penalizes for “junk”. This is not true, R2 captures junk through SSE. Junk is detected when T-cal’s are not statistically significant, when F-cal is not statistically significant, etc.
If you introduce junk into your model, both R2 and R2 adjusted will be lower. And the effect of penalizing for adding 1 extra variable will be the 1% explanation of why R2 fell. This is because R2 adjusted is used to make R2s comparable. It is not an absolute measure, it is a relative one.
tickersu wrote:
Harrogath wrote:
In the trade-off between adding a new explanatory variable vs reducing (possibly) R2 can only be noticed comparing R2adj of the two models.
Not true. This can also be witnessed in the F-statistic or other model-based statistics that are sensitive to the degrees of freedom; so, not only through the adjusted R-squared.
I was talking about in the scenario of R2, not other measures. Otherwise, you would be right.
tickersu wrote:
Harrogath wrote: So, S2000 was right about the use of R2adj in his comment above.
I agreed there is utility for comparing different sets of right-hand-side terms for the same dependent variable.
Yes, yes. My principal motivation to criticize your comment is that S2000 was right, but instead, you added some other explanations that may be considered misleading, so wanted to clarify.
tickersu wrote:
Harrogath wrote:
You can run a sensitivity analysis using R2adj formula. Suppose:
SST = 28,280
SSE =
5,060 5,400
Data points

= 120 (large enough sample)
Number of variables (k) = 3
R2 = 80.91% R2adj = 80.58%
For k = 5
R2 = 80.91% R2adj = 80.24%
For k = 9
R2 = 80.91% R2adj = 79.53%
For k = 50
R2 = 80.91% R2adj = 67.54%
How did you run these simulations? There are good ways and bad ways to do this. Providing a detailed explanation of your simulation is generally good so people can evaluate what you did.
Just apply the below formulas:
I share with you my little model you can replicate in excel. Sorry, it seems I typed SSE = 5,060 in the forum when in fact I was running the model with 5,400. Sorry for that. Fixed above.
tickersu wrote:
Harrogath wrote:
Nobody uses 50 variables in a single model, nor even 9.
There are plenty of models that use 9+ terms in the model. It heavily depends on the field and the quantity of data available. There is a world beyond your line of sight.
Well, we are in a finance forum, and we suppose you do too. However, I would accept disciplines like medicine and other researches could use a lot of variables without falling in the field of increasing variance of errors in the search of “good fit”.
At least, in economy and finance, the data is not infinite and a parsimonious model will always be preferred. 9 variables for an econ model is a crime. Sorry.
tickersu wrote:
Harrogath wrote:
The figure changes a lot for small samples, of course.
Right, this is part of the penalization for too few degrees of freedom (too many terms relative to sample size). This is clearly why there is utility in using both values to look at a single multiple regression model. If I have a sample size of 20 with 6 predictors and an r-squared of 80% the adjustment may well leave me at 50% which would indicate that the benefit (reduced error sum of squares) of all the predictors, relative to the sample size, is overstated by the usual r-squared.
There is the problem, if you do talk about a model in the edge of assumptions or even violated assumptions, then your comments are correct. Otherwise they are not. A sample size of 30 is the bottom possible.
tickersu wrote:
I’m going to be good, and avoid a big back and forth on the rest of this, because we tend to disagree a lot. The narrow scope in the CFA/econ realm is far from what’s out there and what’s even common in other fields.
Not narrow, you are just working on fields different from finance. You are always fighting against the CFA program because the curriculum does not teach in detail regressions for other disciplines. Sorry, but 99% of financial analysts, or investment managers in the entire world will never run a regression estimate in their lives. At most, they will interpret an ANOVA table, if so. CFAI has made its job well enough.
tickersu wrote:
Instead of taking my experiences or text for it, check out some of the literature on it and some of the common regression texts.
Should I consider your selective set of approved books? You may publish the list somewhere.
tickersu wrote:
I am interested to continue discussing how you did the simulation, though. It looks as if you only increased the number of terms each time?
See above. Yes, the only variable controlled was “k”. My intention was to demonstrate that R2 and R2 adjustment are not much different in a parsimonious model with a reasonable sample size.