Okay, so here we go. We assume:
(1) A batter’s season-to-date average for hits per game is representative of how he will continue to play, and how he will continue to be pitched to. With 80+ games gone by, we’re probably not yet on solid ground, but I like it better than using a batter’s career data. denquixote might be able to make a good argument for using career data instead, but for now let’s use current season hits-per-game as the mean.
(2) The number of hits per batter per game is a Poisson process and can be modeled as in Punoqllads’s post above: a game is a “trial” and a hit is an “arrival” or “event”, so average hits-per-game corresponds to the Poisson term lambda (average). Notice that this is a leap from what he proved above, which is that on a Friday night, all batters in all games matched the Poisson distribution. I’m willing to take some flak from smarter math-heads on this assumption.
(3) We can model the output of any three batters by assuming a Poisson distribution with lambda equal to the sum of their hits-per-game averages. For any number of combined hits “N or more” we can subtract {P(N-1)+P(N-2)+…+P(0)} from 100% to get the probability that these three hitters will combine for N-or-more hits, if each is given one trial. I’ve read up on the Poisson distribution and I think this assumption proceeds from (2), but again, point out if you think I’ve gone astray.
With those three assumptions, we have the probability of getting seeing N hits (N across the top) given a group of three batters with a given HPG total (HPG down the left). The winning case, for N >= 6, is the last column.
HPG 0 1 2 3 4 5 6 7 8 9 6 or more
3 4.98% 14.94% 22.40% 22.40% 16.80% 10.08% 5.04% 2.16% 0.81% 0.27% 8.39%
3.05 4.74% 14.44% 22.03% 22.39% 17.08% 10.42% 5.30% 2.31% 0.88% 0.30% 8.90%
3.1 4.50% 13.97% 21.65% 22.37% 17.33% 10.75% 5.55% 2.46% 0.95% 0.33% 9.43%
3.15 4.29% 13.50% 21.26% 22.32% 17.58% 11.08% 5.81% 2.62% 1.03% 0.36% 9.98%
3.2 4.08% 13.04% 20.87% 22.26% 17.81% 11.40% 6.08% 2.78% 1.11% 0.40% 10.54%
3.25 3.88% 12.60% 20.48% 22.18% 18.02% 11.72% 6.35% 2.95% 1.20% 0.43% 11.12%
3.3 3.69% 12.17% 20.08% 22.09% 18.23% 12.03% 6.62% 3.12% 1.29% 0.47% 11.71%
3.35 3.51% 11.75% 19.69% 21.98% 18.41% 12.34% 6.89% 3.30% 1.38% 0.51% 12.32%
3.4 3.34% 11.35% 19.29% 21.86% 18.58% 12.64% 7.16% 3.48% 1.48% 0.56% 12.95%
3.45 3.17% 10.95% 18.89% 21.73% 18.74% 12.93% 7.43% 3.66% 1.58% 0.61% 13.58%
3.5 3.02% 10.57% 18.50% 21.58% 18.88% 13.22% 7.71% 3.85% 1.69% 0.66% 14.24%
3.55 2.87% 10.20% 18.10% 21.42% 19.01% 13.50% 7.99% 4.05% 1.80% 0.71% 14.91%
3.6 2.73% 9.84% 17.71% 21.25% 19.12% 13.77% 8.26% 4.25% 1.91% 0.76% 15.59%
3.65 2.60% 9.49% 17.31% 21.06% 19.22% 14.03% 8.54% 4.45% 2.03% 0.82% 16.28%
3.7 2.47% 9.15% 16.92% 20.87% 19.31% 14.29% 8.81% 4.66% 2.15% 0.89% 16.99%
3.75 2.35% 8.82% 16.54% 20.67% 19.38% 14.53% 9.08% 4.87% 2.28% 0.95% 17.71%
3.8 2.24% 8.50% 16.15% 20.46% 19.44% 14.77% 9.36% 5.08% 2.41% 1.02% 18.44%
3.85 2.13% 8.19% 15.77% 20.24% 19.48% 15.00% 9.62% 5.29% 2.55% 1.09% 19.19%
3.9 2.02% 7.89% 15.39% 20.01% 19.51% 15.22% 9.89% 5.51% 2.69% 1.16% 19.94%
3.95 1.93% 7.61% 15.02% 19.78% 19.53% 15.43% 10.16% 5.73% 2.83% 1.24% 20.71%
4 1.83% 7.33% 14.65% 19.54% 19.54% 15.63% 10.42% 5.95% 2.98% 1.32% 21.49%
4.05 1.74% 7.06% 14.29% 19.29% 19.53% 15.82% 10.68% 6.18% 3.13% 1.41% 22.27%
4.1 1.66% 6.79% 13.93% 19.04% 19.51% 16.00% 10.93% 6.40% 3.28% 1.50% 23.07%
4.15 1.58% 6.54% 13.58% 18.78% 19.48% 16.17% 11.18% 6.63% 3.44% 1.59% 23.87%
4.2 1.50% 6.30% 13.23% 18.52% 19.44% 16.33% 11.43% 6.86% 3.60% 1.68% 24.69%
4.25 1.43% 6.06% 12.88% 18.25% 19.39% 16.48% 11.67% 7.09% 3.77% 1.78% 25.51%
4.3 1.36% 5.83% 12.54% 17.98% 19.33% 16.62% 11.91% 7.32% 3.93% 1.88% 26.33%
4.35 1.29% 5.61% 12.21% 17.71% 19.26% 16.75% 12.15% 7.55% 4.10% 1.98% 27.17%
4.4 1.23% 5.40% 11.88% 17.43% 19.17% 16.87% 12.37% 7.78% 4.28% 2.09% 28.01%
4.45 1.17% 5.20% 11.56% 17.15% 19.08% 16.98% 12.60% 8.01% 4.45% 2.20% 28.86%
4.5 1.11% 5.00% 11.25% 16.87% 18.98% 17.08% 12.81% 8.24% 4.63% 2.32% 29.71%
It shows that if you pick the three best hitters (per game) in the league-- Suzuki, Jeter, Young (combined avg = 4.12) – you have a 23% of getting your payout and a 77% chance of losing your money. It looks like the break-even point (12.5% expectation of success) is when the batters have a combined average of 3.35 hits per game, and there’s a pretty steep drop-off even within the top twenty hitters, so if you’re arbitrarily picking batters you’re practically guaranteed to lose.
I was surprised to see this result because it implies that either the bookies are setting the odds wrong or the punters pick their batters badly. I’d be interested to have someone with a stronger math background examine my third assumption - that we can sum the averages and count the three parallel trials as a single trial of the collective.