A final snapshot and House prediction

November 5, 2024 by Sam Wang

It is inevitable that polls will have some error. Averaging does not solve the problem since it only reduces error arising from sampling a population. Averaging does not eliminate systematic error.

Recently FiveThirtyEight wrote about total overall error in polls. Unfortunately, they didn’t distinguish random from systematic error. Systematic error arises from differences between who pollsters think will vote, compared with who actually votes. Every pollster has their own view of this subject. In the aggregate their judgment is better than they are individually. But that average can differ from what actually happens. That difference is systematic error.

In 2016 for the Presidential race, systematic error was about 1.5 points, leading to a surprise Trump win. We can assume that an error that large could happen this year. However, we don’t know in which direction. For all we know it could favor Trump again. Or it could favor Harris.

Let me show you which way I think it will go.

The Presidential race: how close is it really?

The Princeton Election Consortium defines a virtual margin in terms of the Electoral College. The question asked there is: how far is the race is from a perfect electoral tie?

First, here is a sharp statistical snapshot of all state polls, using median-based statistics.

A statistical snapshot of state polls alone. This is the median of all 2 to the 56th (72 quadrillion) possibilities, as calculated at the Princeton Election Consortium.

The median of the distribution, tracked in black, very slightly favors Trump, but really by nothing at all. How “nothing”? We can translate it into how much the margins would all have to shift to create a perfect electoral tie. That looks like this:

The Presidential race is likely to be less close than polls indicate

Today, this “meta-margin” is Trump +0.3%. That’s much smaller than a hypothetical systematic error of 1.5 points. This has two implications:

We don’t know who’s going to win based on polls alone (duh).
The final outcome is likely to be less close…in one direction or the other.

So it might be helpful to have another stream of information.

First, here are two streams of information that won’t work…

Betting markets and early voting: even worse than polls

There are several streams of information one could think about using. However, they have problems.

Betting markets are no good at all. They basically reflect polling data. They also reflect the biases of individual bettors. In theory these biases cancel out because of the magic of markets. However, we know for a fact that bettors can distort such electronic markets. In 2012, a “Romney whale” disrupted the Iowa electronic market. And this year, some Frenchman has put nearly his whole net worth into Polymarket. These markets do not tell us much.

Early voting is, sadly, also hard to interpret. In some cases, heroic efforts by people like Jon Ralston in Nevada can shed some light. But early voting can lead one astray. For example, in 2020, Democrats voted by mail more than Republicans because of the pandemic. So if Republicans early-vote more now, that might only mean they are adapting to the new method.

In other words, early voting measures a combination of (a) enthusiasm for the vote-by-mail/early-voting, and (b) enthusiasm for voting immediately. And you can vote early, but you can’t vote harder. One vote is one vote. So early voting is not that informative.

Where can we turn for more insight? Let’s start with a simpler problem, control of the House of Representatives.

The generic Congressional ballot

Polling people on their generic partisan preference does well in predicting voter preference for the House of Representatives. The House of Representatives has little net overall bias, thanks to reductions in gerrymandering in the last decade (three cheers for anti-gerrymandering reformers!). Therefore we can take the national vote as a measure of who will control the House in 2025.

Here is the generic Congressional ballot. It shows a tie (link to PEC Congressional data page).

In the last six elections, the difference between generic Congressional measure and actual national vote has had a standard deviation of 2.4 points difference from actual results. That’s a pretty wide range.

But now…here is one other independent stream of evidence we can use: special elections.

House seat prediction: 224 D (range: 216-232), 211 R (range: 203-222)

Special elections, which occur when an official must be replaced in a one-off election, provide real voting data. Daniel Donner at Daily Kos Elections/the Downballot has found that these elections are predictive of the next Congressional election.

Since the Dobbs decision repealing Roe v. Wade, special elections have pointed to Democrats winning the national vote in November 2024 by 4.5 points. That’s the orange line above.

I used Bayesian inference to combine these two measures, polls and special elections, to get the red zone above. It shows a range of R+0.8% to D+5.8%, with a midpoint of D+2.5%. That corresponds to about an 8 in 9 probability that Democrats will win the popular vote – and take control of the House. This is consistent with Nancy Pelosi’s public statement that she expects Democrats to regain the majority.

From Electoral Innovation Lab / Vote Maximizer estimates, 1 point of vote margin translates to about 5 seats. That allows conversion of the above vote margins to an approximate seat margin. The midpoint of that range is 224 Democratic seats, 211 Republican seats. Even if this is wrong, whatever happens, the chamber will be very closely divided.

Presidency: probably, maybe, potentially Harris? (or not)

Generic Congressional and state Presidential surveys are done by a professional community of pollsters. For this reason, I suspect that the two data streams will show similar systematic errors. As I wrote, in 2016, Presidential state polls overestimated Hillary Clinton’s margins by 1.5 points. And generic Congressional polls that year overestimated Democratic national margin by 1.7 points.

Here is what final, unadjusted polls look like now, with margins less than 1 point shaded beige.

Last polling snapshot, not adjusted for anything.

If state Presidential margins are shifted by 1.5 points toward Harris, they look like this:

A 1.5-point polling error favoring Harris would produce this.

This latter condition is associated with a modal outcome of Harris 292 EV, Trump 246 EV (NC and GA split between the candidates, Arizona to Trump).

And yes, a 1.5-point error favoring Trump would move things in the other direction.

We’ll probably have to wait

A definitive answer on today’s federal election outcomes (President, Senate, House) will require…counting the votes. Some key states (Georgia, North Carolina, Michigan, Virginia, Florida, Ohio, and Colorado) will be fast. But others (Wisconsin, Pennsylvania, Nevada) will take days, at least.

Finally, what happens with a full dose of hopium?

What would happen if we made a full adjustment using the House analysis? That analysis suggested a systematic error of 2.5 points. That corresponds to 308 to 319 EV for Harris. In this circumstance, we would get an answer on the Presidential election tonight, with Kamala Harris winning both Georgia and North Carolina.

Barring such a large polling miss, learning the outcome will take a while. I’m not even getting into the lawsuits and disruptions by angry partisans. It could be a long night, and a long week. Hopefully not a long month.

Topics:

5 Comments

Mike DiMartino says:

November 5, 2024 at 6:58 pm

Hope you don’t have to eat any insects this year!
Thought of you a bunch of times. Hope you are doing well!
I have made a career in product development of hardware technology and am finishing my career developing medical devices with an emphasis on neurological products (currently EEG-Epilepsy, Parkinson neuroreceptor re-training and intracranial pressure monitors). Not sure if you remember my wife Lisa; she is faculty at Stanford Med School.

Quentin J. Tarantino says:

November 5, 2024 at 7:16 pm

In 2016 data shows ALL the last minute ‘XmasEveVoters (as comedian and friend Bill Maher accurately calls them) suddenly broke for Trump in a 7-8 pt.swing despite Hillary allegedly given an 80%.chance. You’re seeing that AGAIN,but for Kamala. She’ll win by a landslide. WHY? Easy call. Trump BLEW it.

He was at once too brazen AND too removed from actual real campaigning to be effective. You cant use Twitter to simply troll yourself into the White House when your opponent has come to play. Call it a competency/work-ethic deficit.

You guys are really overthinking this. If anything, this will be like Kerry v Bush where everyone expects a long night but will less dramatic than people realize, with an easy Harris win. We will know, if not at some point tonight, in the next couple of days. Only as a courtesy by the Harris campaign.

tl;dr version— The myth of Icarus, with Trump and his Elon/Rohan/RFKjr clique flying too close to the sun (that NY rally, ugh!)

Sal Bro says:

November 5, 2024 at 11:04 pm

Bummed that the hopium map got taken down, but I understand why that may have been necessary. It seems we’re not on that map tonight, anyhow. The way that the news media covers the election is terrible! This page is helping me stay grounded.

Quentin J. Tarantino says:

November 5, 2024 at 11:10 pm

Welp! I take it all back (see my post above). I’m not a big internet person to be honest. I put my marker down, and well, looks like I’m going to lose. It’s America that loses actually. Looks like fascist wing of the population is breaking for Trump. Maybe it happened weeks ago. Does it matter? F***. I only picked this pollster because his information was the easiest to follow. Again, I DON’T DO INTERNET. I’m like one of those insects that comes out once every 7 years. Well, we gave it a good run America. We tried. Now, we wait as our adversaries prepare for a global war, I’m guessing. To make sure this thing we called democracy never happens again in the superpower sense. Hold on tight, with big hugs, to those you live with closely, and those that are most vulnerable, hold onto them tighter. Peace

Leor g says:

November 6, 2024 at 12:19 pm

Hello,

First of ll, thanks for all the data and captivating analysis on the site.

I beg to differ about prediction markets, such as Polymarket or Kalshi.

It is true that the Iowa election market was inundated with biased money in the 1996 election, after its good performance in the 1988 and 1992 elections. But Iowa before 1996 was an academic market, that performed surprisingly well despite the strong democratic bias of its participants and the low volume. There are some sound theoretical reasons to support the informative value of “honest” prediction markets. Good results were also obtained (not universally, but often enough) in election markets conducted in Canada and Germany, where there are multi-party systems.

As for Polymarket, the sheer volume of the market makes it pretty expensive to influence it, and the benefit for the influencer is not at all clear, in terms of media coverage and the ability to create momentum for one candidate or the other.

It is interesting that both Polymarket and Kalshi gave Trump at least a 60% chance of winning, which was much more definitive than the polls – in this case, in the right direction.

So, I wouldn’t discount the possibility that prediction markets carry valuable information – at least as an additional factor beyond polls.

Princeton Election Consortium

Innovations in democracy since 2004

Highlights

Tuesday, November 5, 2024

Senate

House

Presidential