Statistics question about Nate Silver's predictions

El_Zagna · November 12, 2012, 3:24am

A lot has been and is being said about Nate Silver’s accuracy, but there is one thing about his model that I don’t think I understand.

Silver didn’t make predictions per se but rather expressed the likelihood that something would happen in terms of probability, so it seems to me that for his model to be truly accurate he needed some “misses”. Even if you can say that something has a 90% chance of happening, that means that one out of ten times it won’t happen. If you say that two separate events each has a 90% chance of happening then for both of them to happen you’re looking at 90% * 90% = 81% that at least one will not happen.

If you make this kind of prediction across all 50 states then the likelihood that you will get them all right drops to .515% or about one in 200 tries (.9 ^ 50 if I remember this stuff correctly).

Even if all your state probabilities are at 99% you still only have a 60% of getting them all right. Throw in the sub 60% probabilities like Florida and your odds drop dramatically.

So it seems that as a predictive tool, Nate’s model is excellent, but as a probabilistic tool, it is much too pessimistic. Although it sounds counter intuitive, shouldn’t Nate have been hoping for some misses?

colonial · November 12, 2012, 3:39am

There has been some talk that Silver needs to recalibrate the percentages
because he did so well. Maybe something like upping the 2012 50% to 70%.

On the other hand, when you go 50 for 50 you might be excused for standing pat.

zombywoof · November 12, 2012, 3:47am

I’m no statistics expert, but in this case aren’t you’re looking at an 81% chance that both will happen, which works out to a 19% chance of at least one not happening?

OldGuy · November 12, 2012, 4:17am

Yes, you’re correct, provided the events are independent. But in this case they almost certainly are not. They are positively correlated which means the probably of both happening is between 90% (for perfect correlation) and 81% (for zero correlation). I don’t know what the correlation is.

This of course applies to the OP as well.

MikeS · November 12, 2012, 4:35am

Another thing to point out is that Silver’s model was pretty damn clear about how the vast majority of the states (i.e., the non-swing states) were going to go. By the time the election rolled around, he only had 13 states with forecast probabilities between 0.5% and 99.5%; and of these, only six (FL, NC, NH, VA, IA, CO) had forecast probabilities between 10% and 90%. You’ll notice, by the way, that Ohio is not on that list, which is why Silver thought Obama was the strong favorite in the week before the election.

FYI, if you calculate the probability of Silver getting all of the states right given the probabilities he was giving the day before the election, you get about 14%. Not impossible, but on the unlikely side.

Gorsnak · November 12, 2012, 4:39am

According to Silver’s blog post a day or two before the election, almost all of the remaining chance for Romney wins was based on polls being systematically skewed. Now, it turns out that the polls were skewed, but they were skewed Republican. I haven’t done the math, but I’m pretty sure if they’d been skewed Dem as much as they were actually skewed Rep, Romney would have taken Florida at least and likely Virginia and Ohio and who knows? So I’m not sure the probability of an Obama win was understated by Silver.

ultrafilter · November 12, 2012, 5:09am

Is this based on assuming independent events? Cause that’s not really plausible here.

But to answer the OP, yes. If I tell you that something will happen 95% of the time and it actually always happens, I’ve made a bad prediction. Nate got lucky this time, but he will “miss” eventually by the simple law of averages. I put “miss” in quotes because he’s not calling the states; he’s actually giving estimates of the probabilities that they’ll break each way. Not enough people understand that distinction, so it’s nice to see this question popping up.

drewtwo99 · November 12, 2012, 5:15am

He also missed 2 senate races. The one in North Dakota he missed by quite a bit in the sense that he assigned a high percentage to the republican, but the democrat won narrowly.

El_Zagna · November 12, 2012, 5:27am

Thanks for catching my error, zombywoof.

The correlation/independence factor makes sense. It’s been 40 years since my last statistics class, so **ultrafilter **would you be kind enough to give me an example of events that would be, say, moderately independent?

OldGuy · November 12, 2012, 6:58am

“Moderately independent” is technically impossible. Independent is an absolute. Things are either independent or not. “Correlated” however is a scale running from -1 to 1. So you can be moderately correlated. The simplest way to think about it, I guess is the sum of faces on dice. If you roll two dice, the numbers will be independent (correlation of zero though that’s not quite the same as independent). However, the sum of the numbers on the two dice will be correlated with the number on either one of them. The correlation is 0.707 (or more precisely 1/sqrt(2)). The correlation of sum of three dice will any one of them is 1/sqrt(3) = 0.577. Etc.*

As you add more and more dice, the relation between the sum and any one of them gets weaker and weaker. You’ll have to pick your own idea of when the correlation is moderate.
The basic formula for corr[x,y] = covariance[x,y]/(stddev[x] stddev[y]).
So let x be the number on one die, z be the sum of the numbers on the other dice and y be the sum of the numbers on all the dice, y = x+z. Since the dice are independent var[y] = nvar. The covariance of x and y is cov[x, x+z] = cov[x,x] + cov[x,z] = var + 0. So corr[x,y] = var/sqrt(varnvar) = 1/sqrt(n).

SayTwo · November 12, 2012, 7:12am

Wow, OldGuy, talk about obfuscating a question with some stats jumbo that doesn’t at all help to answer the question at hand.

The simple answer to the OP’s question is that, yes, you would expect some misses…and in an ideal world, you would expect them equally in opposite directions.

It is not at all more complex than that.

OldGuy · November 12, 2012, 7:29am

But I wasn’t answering the original post. You can see from the quote, I was answering the OP’s later question for “an example” of things that are “moderately independent”.

SayTwo · November 12, 2012, 7:53am

I fell asleep while reading that.

septimus · November 12, 2012, 8:09am

I don’t think you’d be able to calibrate such predictions well without much more data. Silver’s early numbers had to reflect the uncertainty of events during (especially) October.

For example, some pundits seemed to think that Hurricane Sandy had a significant effect on the vote. What if a key external event been instead “pro-Romney”?

hibernicus · November 12, 2012, 11:40am

Why not?

(Asking because I’m interested, not because I think you’re wrong)

DSeid · November 12, 2012, 11:51am

For another statistical take on various aggregators performances see here. They suggest using the “Brier’s score” to take into account the factors the op mentions.

Whenever a candidate led in pre-election polls, he won. This was true even for a margin of Romney +1% (NC). Evidently state polls have a systematic error of less than 1% - as good as 2008! (Also, like 2008, pre-election polls substantially underestimated actual margins, this year by a factor 0f 0.8 +/- 0.3. Majority-party voters in nonswing states like to vote – or minority-party voters don’t.)

Since Florida was a coin toss, it is better to examine our state win probabilities, as suggested at Science 2.0. The closer the probabilities are to 1.00, the more confident they are. Probability should also measure the true frequency of an event. If I say a probability is 0.80, I expect to be wrong 1 out of 5 times. Our record of 50 out of 51 (counting Florida as a loss) means that our average probability should have been about 0.98. It was 0.97.

This can be quantified using the Brier score, as described by Simon Jackman of Pollster.com. This score is the average of the squared deviations from a perfect prediction. For example, if Obama won a race that we said was 90% probable, that’s a score of (1.0-0.9)^2 = 0.01. If we were only 70% sure, the score is (1.0-0.7)^2 = 0.09. The average score for all 51 races is the Brier score. The Brier score rewards being correct – and rewards high confidence.

Botom line there was that Nate was very good and some simpler models were even better.

Evil_Economist · November 12, 2012, 12:26pm

Correlation is a measure only of the linear relationship between variables. Variables can be related without having a linear relationship. E.g., the classic example is y=x^2. y and x have a correlation of zero, but y is completely determined by x.

El_Zagna · November 12, 2012, 12:37pm

I shot myself in the head. Actually I appreciate OldGuy’s answer.

I guess the reason that the state numbers are correlated is because if there had been a scandal (Benghazi maybe) that hurt the President, that would drive *all *the states toward Romney, whereas a drop in the unemployment numbers would drive *all *the states towards Obama.

MikeS · November 12, 2012, 12:53pm

Fair point — yes, my calculation was based on all probabilities being independent.

El_Zagna · November 12, 2012, 12:54pm

It’s the external events variable that adds to the uncertainty as you get further from the election. So a poll two months out that shows Obama ahead 60/40 might have a lower probability than a poll two days out that shows a 51/49 split.

Topic		Replies	Views
Is Nate Silver just in an "I can never be wrong" position. Politics & Elections	186	30489	November 8, 2012
Nate Silver / 538 Was Right Politics & Elections	142	29946	November 15, 2012
538's %'s vs everybody else's "Tossup" rating Politics & Elections	41	11762	September 15, 2012
Oddly, I trust FiveThirtyEight more after hearing this. Do you? Great Debates	29	2586	October 22, 2008
In 2018, which political prognosticators will you pay attention to? Politics & Elections	69	6655	August 26, 2017

Statistics question about Nate Silver's predictions

Related topics