tickersu wrote:
philstinnet wrote:
My statement was not a blanket statement. The excerpt below has my response -
Book - Understandable Statistics by Brase & Brase, 11th Edition, Page 432, Last Paragraph -
“For example, if the sample size is small and the sample shows extreme outliers or extreme lack of symmetry, use of the Student’s t distribution is inappropriate.”
Thanks
Phil
There are some key words in there, “extreme”, that make the statement safer, but it’s still a blanket statement. It also looks like it’s an introductory (undergraduate) text. Introductory texts don’t tend to cover the details of actually using the methods taught, but rather, they go for the black and white situations to give students a foundation. Yes, it might be inappropriate from a black and white stance without considering just how “violated” some of the assumptions are. For example, in practice, people (who don’t have a solid stats background) often use normality hypothesis tests to see if they need a parametric or nonparametric technique, but these tests are extremely sensitive to nonnormality and many parametric techniques are robust enough to handle some moderate nonnormality (in other words, they take a very black and white approach to something that isn’t clear cut). These people use the cookbook approach of “not normal, nonparametric” when really it isn’t the best way of doing things. A person with experience probably wouldn’t use the formal hypothesis test because of how sensitive it is (i.e. how easily it picks up the slightest nonnormality in the data) because this often doesn’t have a practical consequence. However, many introductory books will tell you to use these normality tests because they’re trying to build a basic foundation (which is different from real world practice where you use judgment and less sensitive methods in assessing normality, keeping with the example).(Keep in mind, I’m not saying to disregard the issue of normality, but it’s important to recognize that statistics isn’t a cookbook as many beginner texts can imply.)
To the original point: you can always do a nonparametric technique, but if you get the same general picture with a nonparametric technique as you get with the parametric, it would indicate that the “extreme” outliers or “extreme” asymmetry isn’t as big of a problem as you would expect .
I’ve seen this all first hand in graduate courses and from a PhD statistician, so I’m not pulling this from thin air. Hopefully that illustrates where I’m coming from– you probably could find this on this internet with searching, but I don’t think it’s a real important thing to find. It’s a logical result that the parametric technique is probably okay to use when it’s giving the same information as the nonparametric. If the parametric were truly inappropriate, you would get pretty different results (that’s why you could do both to see how it shakes out).