In Part I, we defined non-monotonicity, showed that Instant Runoff Voting is a non-monotonic voting system, and that the recent mayoral election in Burlington Vermont suffered from non-monotonicity.
Here in Part II, I'll talk about why monotonicity matters, and do my best to refute all arguments to the contrary.
What happened?
To understand why it's important, first we should try to understand what happen to make an IRV election display non-monotonicity; we'll use what happened in Burlington's IRV election to showcase. I mentioned that "monotonicity" is an idea borrowed from algebra, but let's make that more clear: rather than move just 8 votes from Wright to Kiss, as we did in the example in Part I, let's make a function out of it, so we can see what it's like to move any number of votes over, and graph it against the vote-tallies in the final IRV round. A picture should make this clear:
You can see the huge discontinuity at 8 votes, where the IRV algorithm changes which candidate is eliminated. You can also see that, at 14 votes, the election switches back into Kiss' favor. So there's a limited window where this sort of manipulation will work. Foremost, obviously, is that there must be at least three candidates in the election. As candidates are eliminated, when only three are left, they must each have more than 25% of the vote. Finally, the eliminated candidate (the one with the fewest first-place votes) must be the "beats-all" (or "Condorcet winner") candidate, meaning that, in any one-on-one race, they would win. We can determine this by, for each pair of candidates, removing all but that pair from every ballot, and seeing who would win. For the Burlington election, check, check, and check.
Under these circumstances, IRV exhibits non-monotonic behavior, and eliminates the Condorcet winner. In fact, you could say that because it eliminates the Condorcet winner, it has exhibited non-monotonic behavior. But so what? Well, there's a hidden implication in the elimination of a Condorcet winner, and that is that the Condorcet winner is the moderate candidate, the compromise candidate. Meaning that, by eliminating them, we've chosen an extremist candidate.
In the Burlington election, in addition to the nationally-familiar Democratic party (with candidate Andy Montroll) and Republican party (with candidate Kurt Wright), we also had the eventual IRV-winner, Bob Kiss of the Vermont Progressive Party, which is just a bit more left-leaning than the Democrats. There is a model of voting behavior, appropriately refered to as the "1D" model, where candidates and voters are arrayed on a line, left to right, and voters always more-prefer the candidate closer to them. In the 1D model then, we should expect that all Kiss-first or Wright-first voters would have had Montroll second (ignoring, for now, candidates other than these three). And that's almost the case: among those who expressed a preference, Wright voters prefered Montroll 3:1 over Kiss, and Kiss voters prefered Montrol 7:1 over Wright. 1D isnt' perfect, but it's often pretty close (how much of this is feedback from our two-party dominated political process is an article for another day), and in my "simplified" example of the election, I moved all the "wrong" votes to their "proper" 1D vote. If the purpose of an election is to choose an appropriate candidate to represent a body of people, it's logical to say that you want the winner to come from the center of the spread of voters' opinions; it's a compromise.
But IRV doesn't always pick the compromise candidate; instead, it picks one of the two extremes. The Center for Range Voting has some excellent graphical representations of this, using a 2D political spectrum and many more candidates (which is more realistic than our 1D, 3 candidate example). At a glance you can see how extremist candidates encroach on the territory of more moderate candidates.
On one hand, it's very strange for me to be damning this election, since it technically elected a third-party candidate, which I generally find to be a good thing. One of the reasons I find our current system so abhorrent is because third-party candidates actually damage their most closely-aligned major-party by acting as spoiler candidates. On the other hand, that's what happened in Burlington. Nearly two-thirds of Burlington residents are registered Democrats (click "voting" on that page to expose the data). And considering the Democratic wave that has been washing over the nation recently, in a vote between right, left, and lefter, the Republicans are the "third" party in Burlington. And as a third party, they operated as we would expect; they siphoned votes away from their nearest major-party candidate and gave the election to their most-hated opponent. Their nearest major-party candidate just happened to be the Democrats.
The "third party" Republicans opperated as a spoiler. Under the 1D model (and by a 7:1 margin in actuality) they would prefer the more-moderate Democrat, Montrol, over the further-left Progressive, Kiss. But their votes for Wright acted only as votes against Montroll in the final tally. This is exactly analogous to Nader spoiling Gore. Despite IRV proponents claims, IRV does not eliminate the effect of spoilers, it only delays their effect. Under our current plurality system, spoilers can have an effect with as little as one vote, if the other candidates are close enough. Under IRV, they only pop up when they have at least 25% of the vote. But the effect will ultimately be the same: fear of spoilers will tend to drive voters back towards the two major parties.
Both of these things—failure to elect a "beats all" winner and susceptibility to a "spoiler" candidate—are indicated by IRV's non-monotonic behavior. Non-monotonicity is the smoke to their fire, or the chink in the armor that lets them sneak in. As IRV advocates will claim, it doesn't matter that if some voters changed their votes they could change the outcome in a non-intuitive way; because they didn't. It matters because that possibility indicates that elections will tend toward extremism and towards two-party domination.
How often?
After being forced to admit that, yes, non-monotonicity is something to be concerned about, an IRV advocate's second round of defense will be that it doesn't matter because it doesn't happen that often. And I must admit, this one example is just one example; the plural of "anecdote" is not "data". Luckily, a host of very smart individuals have tried to calculate how often these things should happen, which CRV has taken the time to summarize. The first to make the attempt was Crispin Allard in 1996, and he calculated the chance as 1 in 40,000, and this is the number upon which IRV advocates have built their argument. Unfortunately, Allard's calculation was wrong. Very wrong. He only counted the more common of the two types of non-monotonicity, and he took advantage of a six-fold symmetry to simplify his calculations but forgot to multiply by six at the end, but worst of all made a serious arithmetic error. Over all, he was off by a factor of over 1,000.
Corrected calculations have found that, using Allard's model and his assumptions, the correct odds of an IRV election displaying non-monotonicity is closer to 1 in 7.
But, an IRV advocate might say, that's only one model. Yes, it is. Using a different model, the odds were calculated to be around 1 in 18, and in a third, a little less than 1 in 10. That, combined with the growing list of real-life IRV elections that have displayed non-monotonicity, it should be quite clear that these failures are predicted to happen often, and often happen in real life. Also, it's important to note that these values apply only when there are three strong candidates in a race. As the number of viable candidates increases, the odds only get worse, until the chaos is practically guaranteed.
What's it mean?
Ultimately, adoption of IRV will result in practically no change to the status-quo. Third parties will find they can gather more votes, but they still won't be able to win any elections, because of the fear of spoilers. Voters will still find themselves deciding between the least-evil of a pair of party-chosen politicians, and we'll continue to lurch back-and-forth between left-leaning policy and right-leaning policy, rather than pick a stable course. The divide between "red" and "blue" will continue to grow and our common ground will continue to wither away. And let's not ignore that this non-improvement comes at a high cost (another post for another day). Why not focus on a change that will actually change something?
I've come down hard on IRV here, but that's only because it's the loudest competitor. The truth is, every election method that uses ranked-order ballots (i.e., ballots of the type A > B > C) is susceptible to one of two failings, either "favorite-betrayal" (which IRV displays through its non-monotonicity) or something called "cloning" (which, briefly, involves proping up ideological copies of your opponent in order to split the votes of those who oppose you). IRV is immune to cloning, a fact that they proudly showcase in this video, while ignoring their non-monotonicity pitfall. ("Or something like that" indeed!) Now, cloning is a serious concern under our current plurality system; but why fix one problem by introducing another, when score voting is clone-proof and monotonic, not to mention cheaper to implement?
I should note that, for the graph, I interpolated between 6 and 8 to find the value at 7; at 7, W and M tie, and I'm not sure what tie-breaker is used in Burlington. But I ignored all the other candidates, moved large numbers of votes around to fit the 1D model, and divided the totals by a hundred, so really, Burlington's rules aren't that important. (Trust me: the end result still works out the same.)
ReplyDeleteAlthough, I should point out that, because of the elimination step in IRV, there are many more chances for ties, and non-monotonicity means the way you break those ties is very important; one changed ballout between the 19th and 20th ranked candidates might percolate up to be all the difference between 1st and 2nd. Which means more Coleman/Franken style lawsuits to decide close elections over small margins.
You're not describing the real problem with non-monotonic elections. Extremism is a problem with IRV, but it has nothing to do with monotonicity. Non-monotonic elections are bad because they represent a 50% or greater chance that the election in question had the wrong result. It's the same reason that independence of irrelevant alternatives is bad.
ReplyDeleteThat's a good point. I should have named these posts "Non-Monotonicity in IRV", since, as you mention, other non-monotonic methods may express their failure in different ways. I do, however, think it's fair to say that IRV expresses its non-monotonicity by deferring to more extremist candidates.
ReplyDeleteI should probably also have broken it into three parts; thank you for reading all the way to the end!