<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Princeton Election Consortium &#187; Politics</title>
	<atom:link href="http://election.princeton.edu/category/politics/feed/" rel="self" type="application/rss+xml" />
	<link>http://election.princeton.edu</link>
	<description>A first draft of electoral history</description>
	<lastBuildDate>Fri, 13 Nov 2009 17:00:24 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Results NY-23 (D upset) and NJ-Gov (R)</title>
		<link>http://election.princeton.edu/2009/11/04/results-ny-23-dem-upset-and-nj-gov-r/</link>
		<comments>http://election.princeton.edu/2009/11/04/results-ny-23-dem-upset-and-nj-gov-r/#comments</comments>
		<pubDate>Wed, 04 Nov 2009 04:29:48 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/2009/11/04/results-ny-23-dem-upset-and-nj-gov-r/</guid>
		<description><![CDATA[Christie (R) won by 4.2%, substantially more than the 1.0% I predicted. On the other hand, Corzine (D) received 44.6% of the vote, quite close to the 44.5% VM-optimized polling average. Looks like it might be Daggett (I) supporters reclaiming the power of their vote.
Was I wrong about NY-23? Only two polls were available. They [...]]]></description>
			<content:encoded><![CDATA[<p>Christie (R) <a href="http://elections.nytimes.com/2009/results/new-jersey.html">won by 4.2%</a>, substantially more than the 1.0% I predicted. On the other hand, Corzine (D) received 44.6% of the vote, quite close to the 44.5% <a href="2008/11/21/tiebreaking-secret-sauce-and-the-case-of-the-missing-cell-phones/">VM-optimized</a> polling average. Looks like it might be Daggett (I) supporters reclaiming the power of their vote.</p>
<p>Was I <a href="http://elections.nytimes.com/2009/results/other.html">wrong about NY-23</a>? Only two polls were available. They came before the full impact of Scozzafava (R)&#8217;s last-minute endorsement of Owens (D), which evidently worked. <del datetime="2009-11-04T08:21:37+00:00">It&#8217;s a squeaker.</del> Owens won by 4%. This is a lose-lose for Republicans: loss of the seat is a self-inflicted injury by the party&#8217;s unreconstructed right-wingers. Full results are <a href="http://elections.nytimes.com/2009/results/other.html">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/11/04/results-ny-23-dem-upset-and-nj-gov-r/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Home stretch analysis of 2009 special elections: NJ-Gov, NY-23</title>
		<link>http://election.princeton.edu/2009/11/03/home-stretch-analysis-of-nj-gov/</link>
		<comments>http://election.princeton.edu/2009/11/03/home-stretch-analysis-of-nj-gov/#comments</comments>
		<pubDate>Tue, 03 Nov 2009 08:02:38 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>
		<category><![CDATA[Jon Corzine]]></category>
		<category><![CDATA[NJ-Gov]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/2009/11/03/home-stretch-analysis-of-nj-gov/</guid>
		<description><![CDATA[The polls in the Corzine (D) &#8211; Christie (R) &#8211; Daggett (I) race have been very close. So it&#8217;s time for the variance minimization tool that I advocated at the end of the 2008 Presidential race. Using that, I come up with the following conclusions:
Since 10/23 (i.e. the last 10 days), the race has been [...]]]></description>
			<content:encoded><![CDATA[<p>The polls in the Corzine (D) &#8211; Christie (R) &#8211; Daggett (I) race have been very close. So it&#8217;s time for the <a href="2008/11/21/tiebreaking-secret-sauce-and-the-case-of-the-missing-cell-phones/">variance minimization tool</a> that I advocated at the end of the 2008 Presidential race. Using that, I come up with the following conclusions:</p>
<p>Since 10/23 (i.e. the last 10 days), the race has been static. Polls spanning that entire period  give<br />
Final margin: <strong>Christie over Corzine by 1.0+/-1.0%</strong> (68% CI, 16 polls).<br />
3-way outcome: <strong>Christie 45.5%, Corzine 44.5%, Daggett 10.0%.</strong><br />
Christie win probability: <strong>83%, i.e. 5:1 odds.</strong> </p>
<p>More details, and analysis of the NY-23 race, after the break.<span id="more-3175"></span></p>
<p>As usual, I use data from the fine obsessives over at Pollster.com: <a href="http://www.pollster.com/polls/nj/09-nj-gov-ge-cvc.php">NJ-Gov</a>. (Note that they omit a late update to a Fairleigh-Dickinson on the grounds that it pools data over 11 days, with 7 days previously reported. Including that poll does not change my bottom line.)</p>
<p>Last November, I suggested that variation in individual measurements arises from two sources: (1) sampling repeatedly from a fixed distribution, and (2) changes in the distribution. On average, sampling from a fixed distribution gives the same standard deviation, independent of the number of samples taken. If the distribution shifts or changes shape, then the standard deviation would be likely to change.</p>
<p>If we plot the standard deviation of a standard quantity (in this case, the Corzine-Christie margin) over various time windows from Date X until Election Eve, allowing X to vary, the standard deviation should stay approximately constant for periods during which the race did not shift – but rise steadily in periods of true change. The plot looks like this:<a href="http://election.princeton.edu/wp-content/uploads/2009/11/nj-gov-092.jpg"><img src="http://election.princeton.edu/wp-content/uploads/2009/11/nj-gov-092.jpg" alt="nj-gov-09" title="nj-gov-09" width="500" height="400" class="alignnone size-full wp-image-3184" /></a><br />
Please pardon the confusing x-axis label. It indicates the first poll # used. There were 48 polls, so that the leftmost point indicates the SD of polls #1-48, the next point is SD of #2-48, and so on.</p>
<p>The plot starts to take off at #44 (Quinnipiac, 10/27-11/1). It might be that polls after that, taken over the weekend, show more volatility. But there&#8217;s not enough data to tell for certain, and in any case these polls still give a median of Christie by 2 +/- 2%. So instead I will go back to the minimum SD at #33 (PPP, 10/23-26). That set of 16 polls gives the median and error bar (estimated using <a href="http://en.wikipedia.org/wiki/Median_absolute_deviation#Relation_to_standard_deviation">this method</a>) listed above.</p>
<p>It&#8217;s right down to the wire &#8211; the Christie-Corzine margin is not statistically distinguishable from zero. But if we believe that these polls are representative of voters, the odds do favor Christie.</p>
<p>Considering that Republican-leaning votes are being split by Chris Daggett, a Republican running as an Independent, it is clear that Governor Corzine is not a popular guy. If Corzine pulls this one out of the fire, it will be thanks in part to Daggett. Also, Christie was hurt by his weaknesses &#8211; leaving the scene of an accident where he hit a motorcyclist with his car, lack of compelling message, and abuse of his position as U.S. Attorney. We&#8217;ll see tomorrow.</p>
<p>For other statistical analysis and color commentary, I refer you to <a href="http://www.fivethirtyeight.com/2009/11/2009-elections-preview-new-jersey.html">Fivethirtyeight</a> by Nate Silver (Christie win probability 57%) and <a href="http://stochasticdemocracy.blogspot.com/2009/11/election-2009.html">Stochastic Democracy</a> by David Shor (Christie win probability 53%). They and I point in the same direction, toward Christie. But they are less certain, perhaps in part because they use a shorter time window. I consider this a test of my variance minimization (VM) method.</p>
<p><em>Postscript: The last 16 polls sampled 12,860 respondents. Binomial sampling suggests that from such a sample, it ought to be possible to get a 68% confidence interval of +/-0.4%. Clearly there are additional uncertainties having to do with methodological differences among pollsters and whether n=16 is the right number of polls to use. But it&#8217;s not crazy to imagine making an even more accurate estimate than what I have given.</em></p>
<p><strong>NY-23:</strong> The Conservative candidate, Doug Hoffman, is favored over the Democrat, Bill Owens. The right-wing push for Hoffman, who does not live in the district, has contributed much of the excitement here &#8211; as well as the fact that Dede Scozzafava (R) has dropped out and endorsed <em>Owens</em>. However, take a look at the <a href="http://www.pollster.com/polls/ny/09-ny-23-ge.php">polling data</a>, which show that Owens&#8217;s support has stayed in the 33-36% range. Basically, Republican voters are breaking toward the Palin-backed Hoffman, not a surprise. Considering that Scozzafava wants Owens to win, dropping out was a bad move. She would have been better off staying in the race. For more commentary, read <a href="http://www.pollster.com/blogs/ny23_watch_monday_morning.php">Mark Blumenthal</a>.</p>
<p>The fact that the Republican base has rejected the moderate Scozzafava would seem to be a sign of the Republican party being pushed further to the right by its base &#8211; seemingly not a good move for them, no matter what happens in this race. </p>
<p>For more on NY-23, see polls at <a href="http://www.pollster.com/polls/ny/09-ny-23-ge.php">Pollster.com</a> and reporting at <a href="http://tpmdc.talkingpointsmemo.com/ny-23/?ref=fpa">TalkingPointsMemo</a>.ons</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/11/03/home-stretch-analysis-of-nj-gov/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Statistical analysis update from Mebane</title>
		<link>http://election.princeton.edu/2009/06/22/statistical-analysis-update-from-mebane/</link>
		<comments>http://election.princeton.edu/2009/06/22/statistical-analysis-update-from-mebane/#comments</comments>
		<pubDate>Mon, 22 Jun 2009 13:32:50 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>
		<category><![CDATA[Iran election 2009]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3159</guid>
		<description><![CDATA[Here&#8217;s the latest update from Walter Mebane of U. Michigan. At this point he is quite confident that fraud occurred. He has kindly made data and R scripts available for the curious.
]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s the <a href="http://www.umich.edu/~wmebane/note22jun2009.pdf">latest update</a> from Walter Mebane of U. Michigan. At this point he is quite confident that fraud occurred. He has kindly made <a href="http://www.umich.edu/~wmebane/Iran2009_22jun2009.zip">data and R scripts</a> available for the curious.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/22/statistical-analysis-update-from-mebane/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Voting early and often</title>
		<link>http://election.princeton.edu/2009/06/22/vote-early-and-often/</link>
		<comments>http://election.princeton.edu/2009/06/22/vote-early-and-often/#comments</comments>
		<pubDate>Mon, 22 Jun 2009 13:23:57 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3156</guid>
		<description><![CDATA[Iran&#8217;s Guardian Council has admitted that apparent turnout exceeded 100% in at least 50 cities. This confirms recent analysis at the University of St. Andrews as well as by David Shor.
Inflated voting is the crudest of fraud, cruder than anything implied by my previous post. At this point statistical analysis is interesting for filling in [...]]]></description>
			<content:encoded><![CDATA[<p>Iran&#8217;s Guardian Council has <a href="http://www.presstv.ir/detail/98711.htm?sectionid=351020101">admitted</a> that apparent turnout exceeded 100% in at least 50 cities. This confirms recent analysis at the <a href="http://www.chathamhouse.org.uk/files/14234_iranelection0609.pdf">University of St. Andrews</a> as well as by <a href="http://stochasticdemocracy.blogspot.com/2009/06/100-turnout.html">David Shor</a>.</p>
<p>Inflated voting is the crudest of fraud, cruder than anything implied by <a href="/2009/06/21/analyzing-iran-2009-part-2-the-official-returns/">my previous post</a>. At this point statistical analysis is interesting for filling in details and estimating the extent of the fraud.</p>
<p>In other news, Boud Roukema <a href="http://arxiv.org/abs/0906.2789v2">has now reported</a> three of the six most populous voting areas in his analysis have vote totals for Karroubi that start with 7, the anomaly he reported before. All three of these have greater proportions of  votes for Ahmadenijad than the other three voting areas.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/22/vote-early-and-often/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Analyzing Iran 2009: Part 2, The Official Returns</title>
		<link>http://election.princeton.edu/2009/06/21/analyzing-iran-2009-part-2-the-official-returns/</link>
		<comments>http://election.princeton.edu/2009/06/21/analyzing-iran-2009-part-2-the-official-returns/#comments</comments>
		<pubDate>Sun, 21 Jun 2009 06:56:04 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>
		<category><![CDATA[Iran election 2009]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3130</guid>
		<description><![CDATA[Current events have completely overwhelmed the relevance of any statistical analysis. But a critical look can still point us toward a better understanding of what happened on Election Day.
Analysis of fraud in Iran 2009 is an unfolding story. In this post I focus on Election Day returns themselves, which suggest: Votes may have been, in [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://andrewsullivan.theatlantic.com">Current events</a> have completely overwhelmed the relevance of any statistical analysis. But a critical look can still point us toward a better understanding of what happened on Election Day.</p>
<p>Analysis of fraud in Iran 2009 is an unfolding story. In this post I focus on Election Day returns themselves, which suggest: <strong>Votes may have been, in some sense, transferred from the minor candidates (Karroubi and Rezaee) to one of the major candidates. This could have been legitimate (that is, voters changing their minds when it came time to vote) or illegitimate (for example, minor-candidate votes being counted for Ahmadinejad). It is currently not known whether enough fraud occurred to flip the election.</strong></p>
<p>Obviously, fraud is not necessarily confined to that suggested by this analysis. For summaries and updates, see <a href="http://stochasticdemocracy.blogspot.com/2009/06/iranian-statistics-round-up.html">David Shor</a> and Fivethirtyeight.com (start with <a href="http://www.fivethirtyeight.com/2009/06/iranian-election-results-by-province.html">basic returns </a>and <a href="http://www.fivethirtyeight.com/2009/06/iran-does-have-some-fishy-numbers.html">poll analysis</a>).<span id="more-3130"></span></p>
<p>&gt; &gt; &gt;</p>
<p>Some crude types of fraud can be identified entirely from the numbers used to commit the fraud. This is a principle known in financial auditing, and relies on a phenomenon called <a href="http://en.wikipedia.org/wiki/Benford's_law">Benford&#8217;s Law</a>. The basic idea is that a wide variety of data have first-digit distributions that tilt toward low values. In other words, measurements such as 1345, 198, or 229 (first digits 1, 1, and 2, respectively) are much more likely to occur than 397, 745, or 9618 (first digits 3, 7, and 9). A similar observation even applies to second digits and so on, though the effect is much weaker.</p>
<p>This somewhat counterintuitive phenomenon arises in many situations: areas of islands, financial numbers, and physical constants, and more. An excellent <a href="http://people.math.gatech.edu/~hill/publications/BENFORD%20PAPERS/TheFirstDigitPhenomenonAmericanScientist1996.pdf">popular article</a> comes from Ted Hill, a mathematician who provided a <a href="http://www.tphill.net/publications/BENFORD%20PAPERS/statisticalDerivationSigDigitLaw1995.pdf">rigorous proof</a> for how Benford&#8217;s Law could arise when one takes random samples from distributions that themselves vary randomly.</p>
<p>A likely mathematical consequence is that observations can be uniformly distributed &#8211; on a log scale. To illustrate what logarithmic distribution implies, imagine an investment that doubles every ten years. It would take ten years to get from $1000 to $1999 (first digit = 1 all the way), but only ten years to get from $4000 to $7999 (first digit = 4, 5, 6, or 7). In this case 1 is the far more common leading digit. This is just one example of a logarithmic distribution &#8211; many financial quantities do the same thing. But when people cook books, they often don&#8217;t pick numbers that lead to Benford&#8217;s Law. Failure of numbers to match this expectation is often interpreted as <a href="http://www.ezrstats.com/Benford.htm">evidence for possible fraud</a>. A similar principle can be used to analyze elections.</p>
<p><strong>Roukema&#8217;s claim.</strong> Boudewijn Roukema, an astronomer at Nicolaus Copernicus University in Toruń, Poland, made waves last week when he analyzed data from the Iranian Ministry of the Interior for the four candidates (Ahmadinejad, Mousavi, Karroubi, and Rezaee) in 366 voting areas. A PDF of his report is <a href="http://arxiv.org/PS_cache/arxiv/pdf/0906/0906.2789v1.pdf">here</a>. In brief, he found several deviations, including too many leading 2&#8217;s and not enough leading 1&#8217;s in Ahmadinejad&#8217;s reported numbers. He also reports quite an odd phenomenon: too many leading 7&#8217;s for Karroubi&#8217;s totals.</p>
<p><strong>A problem with Roukema?</strong> A poltiical scientist at the University of Michigan, Walter Mebane, specializes in the application of Benford&#8217;s Law to the detection of electoral fraud. He has pointed out that first-digit distributions often don&#8217;t follow Benford&#8217;s Law well. Indeed, this was originally noticed by Benford himself (see Table IV of <a href="http://www.jstor.org/stable/984802">his classic 1938 paper</a>).</p>
<p>In the case of election returns, one big reason Benford&#8217;s Law can fail is that voting areas aren&#8217;t necessarly randomly sized (recall that Hill assumed that distributions varied randomly; here a &#8220;distribution&#8221; can be construed as being a voting area). For example, if voting areas are set up to be uniformly sized to contain 100,000 people each, then close races will produce lots of 4&#8217;s and 5&#8217;s as leading digits. So Roukema&#8217;s claim is <a href="http://www.fivethirtyeight.com/2009/06/karroubis-unlucky-7s.html">not yet overwhelmingly convincing</a>.</p>
<p><strong>Beyond first digits?</strong> Mebane advocates the use of second digits (for instance, the 5 in 4567; technical article <a href="http://www-personal.umich.edu/~wmebane/pm07.pdf">here</a>). Second digits don&#8217;t show as strong a trend and therefore require careful statistical analysis. But they are less likely to violate the ideal-case prediction of Benford&#8217;s Law for first digits. Mebane <a href="http://www.umich.edu/~wmebane/note19jun2009.pdf">has begun to analyze the 2009 returns</a>.</p>
<p>At the level of the 366 districts, he found that the results fell within expectations. However, he has more recently obtained data from 12 provinces including over 11,000 ballot boxes. At this much finer-grained level he has found deviations from expected second-digit distributions for Karroubi and Rezaei (see pages 21-22 of Mebane&#8217;s <a href="http://www.umich.edu/~wmebane/note19jun2009.pdf ">report</a>).</p>
<p><strong>What happened?</strong> So did something funny happen to Karroubi and Rezaei&#8217;s votes? Maybe, but not necessarily. Over 80% of ballot boxes showed single-digit numbers of votes (i.e. fewer than 10 votes) for Karroubi, and over 70% showed single-digit numbers for Rezaei. For these boxes there is no second digit to test. In such a situation one might imagine exceptions to the second-digit distribution should obey Benford&#8217;s Law.</p>
<p>In email with Prof. Mebane, he has expressed faith in the analysis&#8217;s applicability in this situation. My own inspection of <a href="http://election.princeton.edu/wp-content/uploads/2009/06/b2.zip">the ballot box-level data</a>, which he very kindly provided, is that the minor-candidate data appear to have a &#8220;scale-invariant&#8221; distribution, a condition that appears in proofs of Benford&#8217;s Law. I may have more thoughts on this later.</p>
<p>But another line of evidence can approach the question independently: pre-election polls.</p>
<p><strong>Pre-election polls.</strong> I <a href="/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/">previously pointed out</a> that in Tehran, Ahmadinejad outperformed all six pre-election polls by a median of 16 percentage points. Now let&#8217;s look at the minor candidates.</p>
<p>In <a href="http://en.wikipedia.org/wiki/Iranian_presidential_election,_2009#Opinion_polls">12 polls taken since May 1st</a>, polled support for minor candidates was <strong>Karroubi 7+/-1%</strong> and <strong>Rezaee 8+/-4%</strong> (median and SEM). They showed at least 3% each in Iranian polls (though less in a <a href="http://www.terrorfreetomorrow.org/upimagestft/TFT%20Iran%20Survey%20Report%200609.pdf">Western-commissioned poll</a> that showed a large fraction of undecided voters).</p>
<p>In official election returns, Karroubi obtained 0.9% of the vote and Rezaee obtained 1.7%. Where did their support go? Maybe it was never there, though that would require assuming a significant failure of Iranian polling. A second possibility is that minor-party candidates reverted to Ahmadinejad or Mousavi when it came time to vote. Certainly this is a common occurrence in U.S. politics. But it&#8217;s been pointed out elsewhere that such a &#8220;coming home&#8221; does not fit with Iranian politics, and notably it did not happen in Iran&#8217;s 2005 presidential race.</p>
<p>A third possibility is that minor-party votes were fraudulently transferred to a major candidate. This possibility is consistent with the statistical analysis of Prof. Mebane. Also, the total support in opinion polls for Karroubi and Rezaee is not far from the 16-point discrepancy in Tehran I <a href="/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/">identified earlier</a>.</p>
<p><strong>What does this mean?</strong> If minor-party candidate votes were transferred to Ahmadinejad, it&#8217;s conceivable that he did not originally get 50% of the vote, which would have forced a runoff. However, I would not go so far as to say that the election was stolen. My reading of this evidence is that fraud may have occurred, but not enough to flip the election.</p>
<p>That having been said, the threat of fraud undermines faith in free elections, whether or not it influences the outcome. In Iran, we&#8217;re now seeing the <a href="http://andrewsullivan.theatlantic.com">bloody aftermath</a> of that loss of faith.</p>
<p>A final note: none of this analysis rules out other forms of fraud that are harder to trace. For that one can turn to other sources of information such as the 2005 election&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/21/analyzing-iran-2009-part-2-the-official-returns/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Iranian presidential debates</title>
		<link>http://election.princeton.edu/2009/06/19/iranian-presidential-debates/</link>
		<comments>http://election.princeton.edu/2009/06/19/iranian-presidential-debates/#comments</comments>
		<pubDate>Fri, 19 Jun 2009 18:48:55 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>
		<category><![CDATA[Iran election 2009]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3120</guid>
		<description><![CDATA[On June 2-8, one-on-one debates were held between pairs of major candidates. As I posted earlier, these could have shifted opinion. One reader writes:
People watched all of the debates very carefully and the result was actually very bad for AhmadiNejad&#8230;[I] am saying that as an Iranian based on what I saw and heard from people [...]]]></description>
			<content:encoded><![CDATA[<p>On June 2-8, one-on-one debates were held between pairs of major candidates. As I <a href="/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/">posted earlier</a>, these could have shifted opinion. One reader writes:</p>
<blockquote><p>People watched all of the debates very carefully and the result was actually very bad for AhmadiNejad&#8230;[I] am saying that as an Iranian based on what I saw and heard from people during and after the debates. Rezaei did very well in the debates and got popularity (if not votes) specially for his debate with AhmadiNejad.</p></blockquote>
<p>I wonder if undecideds felt this way.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/19/iranian-presidential-debates/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Analyzing Iran 2009: Part 1, Pre-election polls</title>
		<link>http://election.princeton.edu/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/</link>
		<comments>http://election.princeton.edu/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/#comments</comments>
		<pubDate>Fri, 19 Jun 2009 03:13:59 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>
		<category><![CDATA[Iran 2009 election]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3095</guid>
		<description><![CDATA[Updated with a third possible reason for the discrepancy. -Sam
At a minimum, hundreds of statistically minded people are poring over Iranian election data (see my previous post and these posts). Sorry for my slow start&#8230;but let&#8217;s roll.
Three general categories of data are currently available for validating the Iranian election: (1) pre-election polls, (2) statistical methods [...]]]></description>
			<content:encoded><![CDATA[<p><em>Updated with a third possible reason for the discrepancy. -Sam</em></p>
<p>At a minimum, hundreds of statistically minded people are poring over <a href="http://andrewsullivan.theatlantic.com/">Iranian election</a> data (see <a href="/flash-statistical-evidence-for-iranian-election-fraud/">my previous post</a> and <a href="http://www.fivethirtyeight.com/2009/06/ahmadinejads-rural-votes.html">these</a> <a href="http://www.fivethirtyeight.com/2009/06/if-he-did-it.html">posts</a>). Sorry for my slow start&#8230;but let&#8217;s roll.</p>
<p>Three general categories of data are currently available for validating the Iranian election: (1) pre-election polls, (2) statistical methods for analyzing standalone voting data, and (3) statistical comparisons with past elections. Categories (2) and (3) are out on the web already, and I&#8217;ll comment soon on those efforts.</p>
<p>First, let&#8217;s look at some polls. A simple look at pre-election polls leads to the following assessment: <strong>National Iranian polls were highly variable and of suspect quality. But within Tehran, polls were more uniform and allow a comparison. Six Tehran polls gave a median </strong><strong>lead for </strong><strong>Moussavi by 4%. This differs notably from the official tally for the city, Ahmadinejad by 12%. The 16-point discrepancy suggests an anomaly in Tehran and opens the question of whether fraud occurred here &#8211; and elsewhere. However, it is also important to note several caveats, including polling uncertainty and possible shifts in opinion following the Ahmadinejad-Moussavi debate on </strong><strong>June 3rd</strong><strong>.<br />
</strong></p>
<p>More after the jump.<span id="more-3095"></span></p>
<p>In the US Presidential elections of 2000, 2004, and 2008, election-eve opinion polls were <a href="/2008/11/11/post-election-evaluation-part-2/">remarkably accurate predictors</a> of the electoral outcome. In each case, aggregated, unadjusted polling data successfully identify key states (Florida, Ohio, and Pennsylvania) and came quite close to the final outcome. It&#8217;s a testament to the power of polls.</p>
<p>The Iranian election presents a harder case. Polling is sparse, professional standards of reporting polls are absent, and respondents are potentially unwilling to answer questions or hard to reach. Still, let&#8217;s look at the <a href="http://en.wikipedia.org/wiki/Iranian_presidential_election,_2009#Opinion_polls">publicly available polls</a>.</p>
<p><em><strong>National polls</strong></em> are all over the place, even if we only take data after the <a href="http://en.wikipedia.org/wiki/Iranian_presidential_election,_2009#Televised_debates main debate, june 3">Ahmadinejad-Moussavi debate on June 3rd</a>, potentially a major decision point for Iranians. One post-June 3rd poll shows <strong>Ahmadinejad +16%</strong> (47% to 31%). <a href="http://translate.google.com/translate?prev=hp&amp;hl=en&amp;js=n&amp;u=http%3A%2F%2Fwww.roozonline.com%2Fpersian%2Fopinion%2Fopinion-article%2Farticle%2F2009%2Fjune%2F08%2F%2F-a2239d36f0.html&amp;sl=fa&amp;tl=en&amp;history_state0=">The other</a> shows <strong>Moussavi +32%</strong> (23% to 54-57%). Earlier polls range from Ahmadinejad +33% to Moussavi +30%. These data are so variable that they are unusable.</p>
<p><em><strong>Polls within Tehran</strong></em> may be a better source of data. This is plausible because urban areas are easier to sample. The last three Tehran surveys (<a href="http://translate.google.com/translate?prev=hp&amp;hl=en&amp;js=n&amp;u=http%3A%2F%2Falef.ir%2F1388%2Fcontent%2Fview%2F47404%2F&amp;sl=fa&amp;tl=en&amp;history_state0=">before June 8th</a>, <a href="http://translate.google.com/translate?prev=hp&amp;hl=en&amp;js=n&amp;u=http%3A%2F%2Falef.ir%2F1388%2Fcontent%2Fview%2F47404%2F&amp;sl=fa&amp;tl=en&amp;history_state0=">before June 7th</a>, and June 3rd) show <em>Moussavi +4%, Ahmadinejad +8%, and Moussavi +17%.</em> Before that, three surveys showed <em>Moussavi +12%, +4%, and +2%</em> (May 27, May 26, May 14). The averages are</p>
<p><strong>Last 3 polls: Moussavi +4 +/- 7% (median +/- <a href="http://en.wikipedia.org/wiki/Median_absolute_deviation#Relation_to_standard_deviation">MAD-based SEM</a>).<br />
All 6 polls: Moussavi +4 +/- 4%.</strong></p>
<p>The announced official result was <strong>Ahmadinejad +12%</strong> (51.6% to 39.4%), a discrepancy of 16 points. When all 6 polls are used, this discrepancy is highly significant (p=0.003).</p>
<p>For now, my interpretation is that the official returns in Tehran are unbelievable. However, I can think of <span style="text-decoration: line-through;">two</span> three alternate explanations.</p>
<p>(1) <em>Ahmadinejad really mopped the floor with Moussavi in the debate. </em>The experience in U.S. elections is that <a href="/2008/10/02/will-the-remaining-debates-matter/">debates provide a side-by-side comparison that can shift opinion substantially</a> (for a famous example see Carter-Reagan 1980). In the case of Iran 2009 there are only 2 or 3 post-debate polls. A comparison using just 3 polls does not quite reach statistical significance (p=0.07).</p>
<p>(2) <em>Tehran polls have a systematic overall pro-Moussavi bias that prevents a direct comparison with vote counts.</em> For example, as David Shor points out in comments, polls might have been restricted to the actual city of Tehran, which is not all of Tehran province.</p>
<p><em>(3) Last-minute minor party candidates broke in favor of Ahmadinejad.</em> There is plenty of precedent for third-party (and in this case, fourth-party) candidates to revert to one of the major candidates. U.S. readers, think of Nader supporters in the last two elections, who underperformed the opinion polls. And then there are undecided voters, who usually break against the incumbent in the U.S. but it&#8217;s not clear what would happen here.</p>
<p>I should emphasize that Tehran is not representative of the entire nation. It is notably more pro-Moussavi, which can account in part for the public anger there. In fact, if the 16-point discrepancy were corrected nationwide it would still not be enough to alter the overall outcome.</p>
<p>Iranians and other knowledgeable observers, please comment.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/18/analyzing-iran-2009-part-1-pre-election-polls/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Statistical evidence for Iranian election fraud?</title>
		<link>http://election.princeton.edu/2009/06/18/flash-statistical-evidence-for-iranian-election-fraud/</link>
		<comments>http://election.princeton.edu/2009/06/18/flash-statistical-evidence-for-iranian-election-fraud/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 12:56:12 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3086</guid>
		<description><![CDATA[In the wake of continuing turmoil over the disputed Iranian election, enough information is available to do some validation of the results. There are several types of information &#8211; examples can be seen here and here.
This type of data can be used to look for fraud. Crude fraud can be found by applying Benford&#8217;s law, [...]]]></description>
			<content:encoded><![CDATA[<p>In the wake of <a href="http://andrewsullivan.theatlantic.com/">continuing turmoil over the disputed Iranian election</a>, enough information is available to do some validation of the results. There are several types of information &#8211; examples can be seen <a href="http://election.princeton.edu/wp-content/uploads/2009/06/iran_election_2009.xls">here</a> and <a href="http://election.princeton.edu/wp-content/uploads/2009/06/vote_iran.xls">here</a>.</p>
<p>This type of data can be used to look for fraud. Crude fraud can be found by applying <a href="http://mathworld.wolfram.com/BenfordsLaw.html">Benford&#8217;s law</a>, which states that for many categories of data, including vote counts, the first digits of lists of observations are not uniformly distributed (as one would naively expect) but instead skewed toward low values. There are more sophisticated tests as well. Prof. Walter Mebane at the University of Michigan is knowledgeable about such analysis and is applying the methods to the data from the recent Iranian election. In addition to intrinsic peculiarities such as Benford&#8217;s law, he is also using 2005 election data as a baseline to help discover unexpected anomalies.</p>
<p>He currently says &#8220;&#8221;I think the results give moderately strong support for a diagnosis that the 2009 election was affected by significant fraud.&#8221; I haven&#8217;t had time to go over his analysis, but <a href="http://www.umich.edu/~wmebane/note18jun2009.pdf">here it is</a> for interested readers, along with a <a href="http://www.umich.edu/~wmebane/Iran2009_18jun2009.zip">ZIP file</a> of source code and data. Note that he is still updating his analysis, so regard this as an interim report.</p>
<p>Later on I&#8217;ll take a critical look at this and other approaches to detecting fraud, with the idea of arriving at a synthesis &#8211; and possible conclusions.</p>
<p>Thanks to <a href="http://stochasticdemocracy.blogspot.com/">David Shor</a> and Andrew Gelman for making this connection.</p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/06/18/flash-statistical-evidence-for-iranian-election-fraud/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Presidential words</title>
		<link>http://election.princeton.edu/2009/02/25/presidential-words/</link>
		<comments>http://election.princeton.edu/2009/02/25/presidential-words/#comments</comments>
		<pubDate>Wed, 25 Feb 2009 13:48:03 +0000</pubDate>
		<dc:creator>Sam Wang</dc:creator>
				<category><![CDATA[Politics]]></category>

		<guid isPermaLink="false">http://election.princeton.edu/?p=3035</guid>
		<description><![CDATA[Via James Fallows, whose insights about presidential (and candidate) performances are worth following, here&#8217;s an interesting tool: SpeechWars, by Ben Reis. SpeechWars shows, over history, how often presidents have used a particular word in a State of the Union address. Although Obama&#8217;s address to the joint session of Congress is technically not a SOTU address, [...]]]></description>
			<content:encoded><![CDATA[<p>Via <a href="http://jamesfallows.theatlantic.com/archives/2009/02/interesting_little_tool_to_use.php">James Fallows</a>, whose insights about presidential (and candidate) performances are worth following, here&#8217;s an interesting tool: <a href="http://www.speechwars.com/sou/index.php">SpeechWars</a>, by Ben Reis. SpeechWars shows, over history, how often presidents have used a particular word in a State of the Union address. Although Obama&#8217;s address to the joint session of Congress is technically not a SOTU address, it&#8217;s counted here. For example, here are the relative frequencies of the word &#8220;war&#8221; and &#8220;banks&#8221; (click image for a high-resolution version):</p>
<p><a href="http://election.princeton.edu/wp-content/uploads/2009/02/sotu-war-banks.jpg"><img class="alignnone size-full wp-image-3038" title="sotu-war-banks-500px" src="http://election.princeton.edu/wp-content/uploads/2009/02/sotu-war-banks-500px.jpg" alt="sotu-war-banks-500px" width="500" height="130" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://election.princeton.edu/2009/02/25/presidential-words/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
