A final unskewing
Mother Nature is our best teacher and the only one who is always right. – Viktor Hamburger, biologist In yesterday’s Los Angeles Times...
Senate: 50 Dem | 50 Rep (range: 47-52)
Control: D+0.3% from toss-up
Generic polling: D+0.8%
Control: D+0.8%
Harris: 274 EV (D+0.3% from toss-up)
Moneyball states: President AK AZ NC
Click any tracker for analytics and data
In Princeton, we’re making sure we have bottled water and other supplies. It’s not clear how serious Hurricane Sandy will be. But we are grateful for the advance warning that is made possible by the National Weather Service.
Seeing as how predictions are so useful, Andrew Ferguson and I have decided to put the Obama re-elect probability in the topline. We give two probabilities, which are built on the same assumptions that went into calculating the “strike zones” in the history graph. The “Random Drift” number is a minimum (conservative) probability, and the “Bayesian Prediction” is my best shot at calculating the actual win probability. In the coming 10 days, the two numbers will converge.
I will list the assumptions again. This will make the most sense if you know a little about what we do here at the Princeton Election Consortium.
Starting from a snapshot of today
Both predictions (“random drift” and “prediction”) start from a current snapshot of polling conditions, the Meta-Analysis of State Polls which forms the core of this site. The snapshot is listed in the top line above. It is currently Obama 297 EV, Romney 241 EV, Meta-margin Obama +1.96%. This predicts what would happen in an election held today.
To calculate this snapshot, we (a) use recent polls for each state (3 polls or 7 days, whichever is greater) to calculate the probability that one candidate is ahead, (b) calculate the exact distribution of all 2^51 = 2.3 quadrillion outcomes, measured in terms of electoral votes (EV), and (c) take the median of the distribution to get an expected outcome.
In addition, we calculate the amount by which polls must swing overall to create a perfect toss-up. This quantity is just like a two-candidate margin that people are used to seeing in polls, so we call it the Meta-Margin. Both the EV estimator and Meta-Margin are extremely precise, and performed very well on Election Eve in 2004 and 2008.
Projecting into the future
Between now and Election Day, opinion may move toward Obama or toward Romney. But by how much? To turn the snapshot into a prediction, we have to estimate how much movement may occur. There are two ways to do so. I have explained these before (“The Presidential predictor sharpens,” September 29). Here is how they fit into what’s listed above.
Random drift. Using past races, I have estimated how much the Meta-Margin is likely to fluctuates over time. In this “random drift” model, I assume that opinion is equally likely to move in either direction. (Nerds: at N days before the election, the drift has a standard deviation of 0.4% * sqrt(N).) If the Meta-Margin stays above zero, then Obama wins. Today, the probability of an Obama win is 89%.
Bayesian prediction. Here I make an additional assumption, that the final outcome is likely to be drawn from the values that the Meta-Margin has explored this year. This is equivalent to the idea that the Meta-Margin is more likely to move towards its average (Obama +3.1+/-1.3%) than away from it. This has been the case in past elections. In statistics this is called a “Bayesian prior,” as in prior assumption. Today, including that prior gives a win probability of 97%.
The prior is also used to calculate the red and yellow “strike zones” in the history graph.
In my view, the prediction is the correct probability. But if the prior seems like an unwarranted assumption to you, then use the Random Drift probability instead. This is a more conservative estimate. Anyway, over the coming days these probabilities will converge to the same value.
What an amazing site this is. I cannot tell you the number of days I have flipped around the morning shows, wishing there was someplace to evaluate and discuss ACTUAL FREAKING DATA instead of spin, uncertainty, and doubt. 2/3 of what is discussed sails right over my head, which is awesome.
What I find most amazing is the tenuousness of Ohio. Romney has led in 9 of the 80 or so polls run since spring. I can’t figure out if this statistic, or the polls, or the % Silver gives it going to O, is comforting or meaningless.
Also, I cannot figure out the fascination with the National Tracking polls. As someone who knows nothing about the polling analysis business, if the consensus that this metric is inherently prone to noise, and therefore somewhat unreliable? In addition to being meaningless.
Still, nothing soothes frazzled nerves like cold, hard data. Thanks for the site, Sam.
It’s frustrating to see that “journalists” still believe newspaper endorsements matter.
Is there a way to predict how Sandy might effect the election when it hits one week before the final day to vote?
Ohio newspaper poll has the race tied in OH at 49-49. That’s a very good poll for the state, the gold standard. I think they may have punted, though. Anyways, it’s kindof stale (mostly pre-third debate) and will roll off Tuesday. We’ll see what PPP says tomorrow… yikes!
Mike..do you have a link to that poll and article? I’d sure like to see the stats….
Here’s one link to a news story. The polling memo is not available yet: http://www.13wmaz.com/news/article/201906/175/Ohio-Poll-Romney-Obama-Tied-Among-Likely-Voters
Why? The stats here utilize that poll and many many more, to come to +2 for the President. This is proven more accurate.
Robert Wright resurrects the “undecideds break to the challenger” canard, citing Bob Krumm’s analysis of 2004 as evidence. The difference between this version of the argument and the one that tripped up Prof. Wang is that Krumm claims the big break is visible over the course of the final week:
http://www.theatlantic.com/politics/archive/2012/10/will-undecided-voters-break-toward-romney/264196/
But in an addendum, he links to a new post by John Sides and an older one by Nate Silver expressing skepticism about the rule. Silver compiled data on all elections since 1968 and found no strong evidence for a consistent effect.
…though Silver’s data look spookier if you concentrate on true incumbent candidates, though he suggests it’s only because the challenger often trails by a lot, and there seems to be regression to the mean.
What I find more convincing is Silver’s recent article historically testing the reliability of state polling. That suggests no pattern of a break toward the challenger in the last weeks, though it does also suggest not placing much credence in state leads under about 1.5%.
My only complaint about the Princeton Election Consortium website is that it is addictive. I love the number- crunching results that Dr. Wang and his crew provide. Thank you.
All of that is factually true. However, that is probably not a great reason to discount that poll. There is little evidence that much has changed since the 3rd debate. The best way to think of it is this — it is one of many polls (even one of many credible polls). Even the best among them have many possible sources of error. That is why PEC takes the median, which as of today is Obama +2. Stay calm. Stick with the model (or at least a model).
A question:
It strikes me that there must be an upper and lower limit within which the MM moves under the random drift model, and in an election where the number of undecideds is low compared with previous elections, the range between the upper and lower limits would be very narrow and get narrower closer to the election.
If that’s correct then as the MM for the leading candidate approaches the upper limit, it seems that opinion would be more likely to move toward the lower limit than it is to rise to and above the upper limit. In other words, the “assum[ption] that opinion is equally likely to move in either direction” is incorrect.
96%?! Wow, either Wang or Gallup is full of it.
Here’s hoping it’s Gallup.
Four more years!
The probability distribution is starting to get that “NMR spectrum” quality again, associated with lots of state leads firming up. I’d like the centroid to be higher than it is, but at least the uncertainties are less crazy-making.
RAND looks good again today…
https://mmicdata.rand.org/alp/index.php?page=election#election-forecast
RAND is starting to look like an eerie replay of Obama’s post-convention rise.
On a comical note, the person who runs the site unskwewed polls, states Silver is wrong because he is thin and somewhat effeminate. http://www.rawstory.com/rs/2012/10/27/conservative-pollster-nate-silver-is-wrong-because-he-is-thin-and-effeminate/
Did the update include the new WaPo poll that is +4 for Obama? Or did that really horrible Gravis poll cancel it out?
The update included the WaPo O+4 but not the Gravis, at lest from what is posted on Pollster. But as far as the model is concerned, the VA results are perfectly balanced, not only for the pool of polls as a whole, but miraculously for each and every polling date in the last week:
10/26 Tie (Gravis)
10/24 Obama 1, Romney 1, 2 tied
10/23 Obama 1, Romney 1
10/19 Obama 1, Romney 1
10/18 Obama 1, Romney 1
When the Gravis poll gets into the data the 10/18 polls will drop out and we’ll have 3 polls showing Obama leading, 3 polls showing Romney leading, and 3 polls showing a tie. Get used to VA being tied.
Froggy, thanks. I see the MM ticked down a little today, as did the BP, yet the EV prediction ticked up 4. Was NH put back in the mix and why was there a downtick in the MM?
DrOrbit, MN went from O+8 to O+5, NC went from R+3 to R+1, and WI moved from O+3.5 to O+2.
I agree with bsk (below) that the WI move looks wrong. Rather than dropping, by my reckoning WI should have moved up to O+5. So unless I’m mistaken, the MM should have increased rather than decreased.
That said, I fully expect the MM to bounce around a lot in the coming days. If it gets below 1% and stays there I’ll start worrying, and if it gets above 2% and stays there I’ll breathe easier. But moves up and down in the current range are nothing to get excited about.
Bug in the Wisconsin data – or do I still not understand the meaning of “last 7 days”?
WI has been squirrely for the last few updates. I see (tie,O5,O6,O5) from 10/19-25. What Even if you include the 10/18-18 O2 and 10/17-18 O3, the median is O+3.5. The 10am update moved WI to O2.
These states’ medians changed from 8am to 10am:
CO: O+3 => O+2
WI: O+3.5 => O+2
NC: R+3.5 => R+1
MN: O+8 => O+5
******
LA: R13 => ???
SD: ??? => R8.5
SD: the last 3 polls have a median of R+11.
There’s no reason I can see for the change in CO. Nothing new on Pollster, and O+3 looks correct. I’ve seen glitches like this appear in one update, and disappear in the next, without any apparent reason. This is another reason not to get worried about each up and down move in the MM.
Good point about the MM, but as a retired actuary, I’m troubled a bit by glitches in some of the calculations.
I just heard George Stephanopoulos tout the “new” OH poll with O49, R49. It’s the UCinc one from 10/18-23. We have four more recent polls with O2,O5,tie,O2. Sigh.