Here’s one for the probability experts that’s always stumped me:
One of the most famous numbers in baseball is 56 - the major league record for consecutive game hitting streak, set by Joe DiMaggio. The next highest is 44 (by Pete Rose, if I recall correctly).
However, it’s rarely, if ever, that a season has ended without a hitter amassing 200 hits - an average of more than one per game.
Now, obviously there’s some statistical variation…there will be games that a good hitter (say, Wade Boggs or Tony Gwynn in their primes) will get no hits, and some in which they get two or three.
But why should hitting streaks greater than 30 games in length be so rare? If we accept as a given that at least one hitter will get 200 hits in a year, assuming he plays each of his team’s games (162), and assuming he gets the same number of hittable at-bats each game (I suppose that this assumption could explain why in real life, it doesn’t happen), what’s the statistical probability of that hitter stringing together a hitting streak of, say, 100 games in a row? 80? 60 (yes, I know Joe D. did that in the minors)?
Basically, does raw probability support the unlikelihood of long hitting streaks, or does it make more sense to attribute the rarity to things such as change in opponent strategy, increase in pressure, etc.?
Chaim Mattis Keller
ckeller@kozmo.com
“Sherlock Holmes once said that once you have eliminated the
impossible, whatever remains, however improbable, must be
the answer. I, however, do not like to eliminate the impossible.
The impossible often has a kind of integrity to it that the merely improbable lacks.”
– Douglas Adams’s Dirk Gently, Holistic Detective