Nate Silver / 538 Was Right

pulykamell · November 8, 2012, 3:14pm

But isn’t that exactly what happened? Nebraska split 4-1 in 2008. A model could have easily just allocated all 5 to Nebraska. I don’t know if that is what happened, but that sounds the most plausible.

pulykamell · November 8, 2012, 3:17pm

Looking at the election scorecard I linked to earlier, that’s exactly what happened. All the states were predicted correctly by Sam’s model (including Indiana). His model just awarded all 5 EVs to McCain in Nebraska, when in reality, it was 4 for McCain and 1 for Obama.

So, Sam was 50-for-50 (statewise) in 2008, missing only one EV from Nebraska. Nate missed Indiana and also that EV from Nebraska, off by 12 EV.

Really_Not_All_That_Bright · November 8, 2012, 3:21pm

Mea culpa.

pulykamell · November 8, 2012, 3:30pm

Actually, it’s a bit more confusing than that.

Here’s Sam’s final map. So he did miss Indiana.

But in his EV estimate he has:

So that note was added after the fact, it seems. So his original prediction was 352, and should be number we go by, I think.

However, he also adds this:

So, a little fudging going on there, it seems. In his scorecard, he calls the “364-174” prediction the official prediction, but in his state-by-state rundown, he says he missed Indiana, which makes no sense given the 364-174 EV prediction.

Really_Not_All_That_Bright · November 8, 2012, 3:34pm

Oh, bloody hell.

John_Stamos_Left_Ear · November 8, 2012, 4:14pm

Ezra Klein at Wonk Blog compiled all of the final predictions he could find in one thread on Monday. An interesting read…

DigitalC · November 8, 2012, 6:28pm

Saw a tweet yesterday saying, “Nate Silver is drunk on the subway telling people the exact day they’ll die.”

Tom_Scud · November 8, 2012, 7:07pm

Brad Plumer, actually.

DSeid · November 8, 2012, 10:56pm

The aggregators compared:

In addition to picking the winner in all 50 states – besting his 49 out of 50 slate in 2008 – Silver was also the closest among the aggregators to picking the two candidates’ popular vote percentages. All told, he missed Obama’s total of 50.8 percent by just four-tenths of a percentage point (50.4) and Romney’s 48 percent by just three-tenths of a point (48.3) for an average miss of just 0.35 percentage points. HuffPo Pollster and RealClearPolitics tied for second with an average miss of 0.85 points.
In preparing to make these comparisons, CNET surveyed 11 swing states. In the end, Silver was closest to the final margins among the candidates in seven of them and also had the best overall record, missing by an average of just 1.46 points in the 11 states. TPM PollTracker was second with the closest predicted margins in three states, and the second-best average margin, 1.80 points.
It is worth noting that while Silver’s final pre-election calculations showed a tie vote in Florida, he still predicted a 50.3 chance that the president would prevail in the Sunshine State.
The performances by Silver and his fellow polling aggregators should be sounding alarm bells in the halls of long-venerated pollsters like Gallup – which, by the way, predicted that Romney would win the national popular vote by a point.

Wang is incompletely evaluated in that list (and I cannot understand why) and it does look like he may have done about as well.

I love 538. I love the informed analysis. But when two models both perform well the simpler of the two is to be preferred. Wang rocks (too).

TokyoBayer · November 9, 2012, 1:40am

There’s a bunch of them. “Drunk Nate Silver gets home 30 seconds before the Comcast installer shows up #drunknatesilver”

DSeid · November 10, 2012, 6:29am

More scorecard by Wang. He quantifies the various models probability estimates’ predictive values by means of something called a Normalized Brier’s score.

This score is the average of the squared deviations from a perfect prediction. For example, if Obama won a race that we said was 90% probable, that’s a score of (1.0-0.9)^2 = 0.01. If we were only 70% sure, the score is (1.0-0.7)^2 = 0.09. The average score for all 51 races is the Brier score. The Brier score rewards being correct – and rewards high confidence. …

… We appear to be slightly better than our very able colleagues. The additional factors used by the FiveThirtyEight model include national polls and maybe some other parameters. It seems that these parameters did not help.

A more interesting case is the Senate … In this case, additional factors used by FiveThirtyEight – “fundamentals” – may have actively hurt the prediction. This suggests that fundamentals are helpful mainly when polls are not available. Hmmm…should Nate eat part of a bug?

The actual scorecard shows the PEC model’s Senate Brier score up at 0.844 and 538 down at 0.118. Not too surprising given that Wang got all ten of the closest races correct and Nate missed two.

If Nate’s a witch then what is Wang?

More seriously, going by this election, the aggregators’ performance as a group documents the superiority of meta-analysis of state polling to national polling and to fundamental factors (like that now infamous University of Colorado model). And of course to puffed up punditry. But as good as 538 is it does not appear that Nate’s secret sauce (house effect adjustments, etc.) adds anything except possibly a small amount of additional noise.

Which may be blasphemy in these parts!

JKellyMap · November 10, 2012, 12:07pm

DSeid, I’m pretty sure Nate’s added ingredients (especially, economic forecasts – not current data, but forecasts) improve his value early on, say, four or five months before an election. That’s when he has more of a chance of (marginally) “knowing more about who you will be three months into the future than you know yourself”.

Closer to the election, I agree, that stuff doesn’t add much, if anything. The model accounts for this, by shedding off those extra ingredients in the final weeks.

That leaves the “figuring out which pollsters are good and which are crap” thng as probably the best thing Nate does even in the last weeks. But there are others that do this well also – sounds like that Wang guy is great at it.

Batfish · November 10, 2012, 12:41pm

Synopsis of conversation with a coworker Thursday:

Coworker: sad about election
Me: I understand you’re disappointed with the outcome but you didn’t really think Romney was going to win did you?
Coworker: Yes! Yes, I did!
Me: But the polls indicated Obama was leading in the battleground states.
Coworker: Not ALL the polls.
Me: Yeah, just the honest ones.
Coworker: No, they were rigged in Democrat’s favor.
Me: So the polls that correctly predicted the outcome were RIGGED and the ones that didn’t were HONEST?
Coworker: more sadness

DSeid · November 10, 2012, 1:20pm

I’d suspect that what you are saying is correct but I do not know that. Maybe someone can go back and look at what he was predicting early on, not just this time but in 2008, and see how predictive it was compared to other models. But the value of his early posts using things like economic forecasts and so on is more explaining that there really is a political science as opposed to the pooferall of the pundits, and less having a system that outperforms.

BigAppleBucky · November 10, 2012, 1:52pm

Drunk Nate Silver.

JKellyMap · November 10, 2012, 1:53pm

I agree that’s the big lesson here. Another reason why his blog posts (mini-articles, really) were the best thing about Nate, IMHO. Some of them should be required reading for high schoolers.

gamerunknown · November 10, 2012, 2:15pm

A tile.

Measure_for_Measure · November 11, 2012, 5:11am

DSeid:

More scorecard by Wang. He quantifies the various models probability estimates’ predictive values by means of something called a Normalized Brier’s score.

The actual scorecard shows the PEC model’s Senate Brier score up at 0.844 and 538 down at 0.118. Not too surprising given that Wang got all ten of the closest races correct and Nate missed two.

If Nate’s a witch then what is Wang?

More seriously, going by this election, the aggregators’ performance as a group documents the superiority of meta-analysis of state polling to national polling and to fundamental factors (like that now infamous University of Colorado model). And of course to puffed up punditry. But as good as 538 is it does not appear that Nate’s secret sauce (house effect adjustments, etc.) adds anything except possibly a small amount of additional noise.

Which may be blasphemy in these parts!

First of all, in the Presidential races the Brier scores are very close. Secondly, Wang may make House effect adjustments as well (not sure). What Wang does not like is throwing in economic variables in willy-nilly. Nate thinks it best to use them in June-July and phase them out as Nov 6 nears. I leaned towards Nate’s view – but Wang might be correct. The issue deserves careful analysis.

More generally, probabilistic forecasts can be evaluated by the 3 criteria proposed by Allan Murphy. The first is quality of accuracy: the Brier score gives a proxy for this. I’m wondering whether some of Nate’s forecasts were closer to 50% than warranted: it is asserted that intrade suffers from this problem, especially for low probability events. The second is consistency or honesty. Does the forecast have internal consistency. Is it the best possible forecast that could have been presented given what the forecaster knew, or was it massaged to fit the audience? cough Barone, George Will cough The weather service takes this issue very seriously and intentionally segregates its forecast professionals from those who interact with the media or even public officials. The third is economic value: does the forecast help policy makers, whether in government or business, make better decisions?

I adapted that presentation of Murphy’s work from Silver (2012) The Signal and the Noise. Heh. Now #2 at Amazon!

Geoff Berg: But the problem with people like Morris and the networks that give him airtime is not just that they’re fools or liars, it’s that the conservative movement is now dominated by people who believe that kind of obvious nonsense without thinking. Just before the election, Twitter was flooded with Fox/Limbaugh/NewsMax/WND types laughing about what a certainty it was that Barack Obama was about retire early.

Nate Silver, whose model correctly predicted both the 2008 presidential election and the 2010 midterms? Just a liberal hack in the tank for Obama. Companies like Public Policy Polling which consistently showed Obama ahead in swing states? It sometimes does polling for DailyKos and therefore must be untrustworthy.

A Fordham University study released today shows that PPP and DailyKos/SEIU/PPP were the numbers one and two most accurate polling organizations this cycle. Conservative comfort blanket Rasmussen was 24th out of 28. It’s pretty sad the way conservatives, like IIRC OMG!, thought that PPP was biased and Rasmussen was accurate. They could have looked up these groups past accuracy. Instead they used the rule of thumb “On my team/off my team” to estimate their accuracy. Those who use such methods really should be kept away from our nation’s policy levers as well as sharp objects for that matter.

drewtwo99 · November 11, 2012, 5:33am

Measure for Measure… you listen to Geoff Berg too???

Measure_for_Measure · November 11, 2012, 5:38am

Er, not really: I just use google news. First I’ve even heard of the guy. :o

ETA: You know what I want to see? A time series of Brier scores for the various forecasting outfits. Who cares what they thought on Nov 4? I’d like to know how their forecast accuracy tracked over time.

Topic		Replies	Views
Is Nate Silver just in an "I can never be wrong" position. Politics & Elections	186	30489	November 8, 2012
The weirdness of polls... What's up? Politics & Elections	191	17099	November 1, 2012
Never mind Nate Silver, "Yahoo Signal" should get the biggest props Politics & Elections	70	12051	December 1, 2012
538 Nowcast vs PEC Snapshot Politics & Elections	37	5839	September 11, 2016
Trying to make sense of the polling Politics & Elections	11	1489	November 10, 2016

Nate Silver / 538 Was Right

Er, not really: I just use google news. First I’ve even heard of the guy. :o

Related topics