I liked the article, but I just want to mention th...

2005-11-10T22:48:00.000-07:00

I liked the article, but I just want to mention that you aren't allowed (statistically) to do this many 95% significance tests at a time. Basically, you have a random chance of making the wrong conclusion 5% of the time, and when you do thousands of tests, you expect to find extreme examples among them. For instance, with the ANderson/Anderson matchup, you expect to find results that extreme or more (about) 1 in 10000 times. Given that you did 30,000 tests, you might expect one like this anyway.

It would be possible to determine how many matchups you tested could even give a result this low, like you demonstrated in the 5 AB example. For instance, if only 1500 matchups could achieve a p-value this low, the extremeness of the result is more noteworthy than if 15,000 could. This context for the tests is important when drawing conclusions.

Again though, I liked the article. I agree that it would be better to do with a different measure than BA. I remember Carlos Delgado hitting, I think, 5 straight HRs of Sosa(on the Braves this year). That sort of dominance is missed by using BA.

Comments on Dan Agonistes: Searching for Significance

I liked the article, but I just want to mention th...