Predicting the winner of the US presidential election

Polerius · July 11, 2012, 4:56pm

I have put up a simple website with estimates of the probabilities of Obama or Romney winning the election: prespredict.com

Basically, in 2008, I did some calculations to estimate the probability of Obama or McCain winning the election, using state poll results from electoral-vote.com, and plotted the probabilities vs time, right up to the election. The results were quite interesting. So, this year, I’m doing the same for the 2012 elections, and have put it all up on a quick website I put together.

I include both 2012 and 2008 results, and make an attempt to see how they correlate with campaign events and national news stories. There is also a section where I describe the methodology, in case you’re interested.

So far, the 2012 results are somewhat uninteresting (except for the fact that Obama’s estimated probability of winning is higher than I would have thought). For a more interesting “roller-coaster” graph see the 2008 results.

I’d be interested to know what you guys think.

Note: I have received mod approval for this post.

septimus · July 12, 2012, 7:58am

The biggest problem I see is that state probabilities are treated as independent.

Suppose an external event causes Romney to get a higher vote percentage in Pennsylvania than present polls predict. That event will cause Romney’s vote percentage to increase in Ohio and other states as well. Does your model consider this?

(ETA: The variance of the sum of independent variables is much less than that of the sum of dependent variables.)

Fiddle_Peghead · July 12, 2012, 4:56pm

As an example of a potential problem, I see where you wrote “…on May 9th, Obama came out in support of gay marriage, and this seems to have caused a strong decline in his chance of winning, going from 95% on May 9th to 82% on June 16.” Other than the fact that these events happened at about the same time, you offer no evidence that his gay marriage support is what resulted in his chance of winning going down. I’d be careful with broad statements such as this.

Simplicio · July 12, 2012, 5:07pm

Your method of calculating the probability of a state going one way or another is kind of weird. Where did you get it from? Why didn’t you use the usual method of assuming the actual probabilities were normally distributed around the poll results?

Polerius · July 12, 2012, 6:37pm

The dependence between states is something that I definitely want to add to the model, but as is the main plots I think are not affected by it as much.

That is, if the polls stay as they are, and if we assume an error in the way the polls estimate the underlying percentages in different states, the this poll error is likely independent from state-to-state. So my model of translating the existing poll difference to a probability of winning a given state (shown in the second figure on the “Methodology”), and then taking that and calculating the probability of a candidate having N electoral votes, should hold.

So it should be OK as an estimate of the probability of winning, if the election were held today.

It’s only if we want to use it in building an estimate of the probability of winning on election day that we have to take into account potential changes in the per-state percentages, and when we do that, it is more accurate to take into account correlations between state results.

As I said, this is something that I am planning on adding to the model, but the current estimate should be quite accurate as a present-day estimate, and hopefully decent as an election-day estimate.

Polerius · July 12, 2012, 6:40pm

I agree, and that’s why I provided some caveats:
[ul]
[li]In the “Observations” section I said [/li]“We have tried to find events that were turning points in the election campaigns. If there are other events that you think caused some of the turning points in the election, please let us know via the contact page.”
[li]Also, in the quote you mention, I did say “this **seems **to have caused”, that is, I’m not stating it as 100% fact, but that it’s a correlation I have noticed.[/li][/ul]

Polerius · July 12, 2012, 6:57pm

I didn’t use the usual method of assuming the actual probabilities were normally distributed around the poll results because I don’t agree that they accurately represent the odds of winning the state.

For a comparison, in this image I plot both the Gaussian model of predicting a state winner (i.e. normally distributed poll errors) and my model. They’re similar, but my model has a flat region in the middle to signify the fact that if the poll results are very close, either of the candidates could win.

Polerius · September 4, 2012, 4:25am

Small bump, since it’s been about two months since I posted this and quite a bit has happened since then (Romney picked Ryan as VP, the RNC, etc)

Romney picking Ryan seems to have had a modest effect on his chances, and the RNC has had only a minor effect so far.

Obama seems to be holding steady around 73-74% chance of winning.

Interestingly, this prediction now seems to agree with the folks at 538, which have Obama at 74.8% chance of winning.

Topic		Replies	Views
Estimating the probability of winning the presidential election Great Debates	4	1422	September 13, 2008
US Presidential election prediction website Politics & Elections	0	898	August 14, 2016
The weirdness of polls... What's up? Politics & Elections	191	17262	November 1, 2012
Yahoo declared Obama the winer! Politics & Elections	20	5178	February 18, 2012
Statistics question about Nate Silver's predictions Factual Questions	27	3578	November 12, 2012

Predicting the winner of the US presidential election

Related topics