Satisficing and Selection in Electoral Competition
Transkript
Satisficing and Selection in Electoral Competition
Satisficing and Selection in Electoral Competition1 Jonathan Bendor2 Dilip Mookherjee3 Debraj Ray4 1 We thank Alberto Diaz-Cayeros, Jim Fearon, Tim Groseclose, Sunil Kumar, Dave Laitin, Adam Meirowitz, Ken Shotts, and seminar participants at Berkeley, Stanford, U.C. Irvine, UCLA, the University of Chicago, the 2001 APSA meetings and the 2002 MPSA meetings for their helpful comments, and our research assistants, past and present—Bill Hauk, Adam Meirowitz, Dave Siegel, Jon Woon and Muhamet Yildiz—for their work on the computational model. 2 3 4 Graduate School of Business, Stanford University, Stanford CA 94305-5015 (corresponding author). Professor, Department of Economics, Boston University, Boston MA 02215. Professor, Department of Economics, NYU, 269 Mercer St., New York City NY 10012. Abstract We model political parties as adaptive decision makers who compete in a sequence of elections. The key assumptions are that winners satisfice (the winning party in period t keeps its platform in t + 1) while losers search. Under fairly mild assumptions about losers’ search rules, we show that the sequence of winning platforms is absorbed into the top cycle of the (finite) set of feasible platforms with probability one. This implies that if there is a majority rule winner then ultimately the incumbent party must espouse it. However, we also establish, under weak assumptions about the out-party’s search, that the parties do not stabilize at the majority rule winner (should it exist). Thus full Downsian convergence is not predicted. We also show, both analytically and computationally, that the adaptive process is robust: in particular, if a majority rule winner “nearly” exists then the trajectory of winning platforms tends to be “close” to a trajectory of a process which actually has such a winner. 1 One of the most celebrated and cited5 books in political science is An Economic Theory of Democracy (Downs 1957). It routinely makes the discipline’s short lists of the most important works of the post-war era (e.g., Goodin and Klingemann 1996). And it is used as well as cited: An Economic Theory has spawned a great many papers and books on spatial models of party competition. This body of work clearly constitutes a research program. Yet even the most successful research programs have problems. In this paper we focus on three significant issues that confront the Downsian program. These pertain to its assumptions, robustness/power, and predictions. (1) Almost all Downsian theories assume that decision makers are fully rational; relatedly, game theoretic ones also make strong common knowledge assumptions (A knows that B is fully rational, B knows that A knows, ad infinitum). Criticisms of this general approach once came from outsiders (e.g., Simon 1955)—scholars working in neither the Downsian program nor its parent (rational choice theories of politics)—and many political scientists and economists working in this tradition were comfortable with Milton Friedman’s famous “as if” defense of perfect rationality premises. But experimental evidence (Kagel and Roth 1995; Camerer 2003) from the now-flourishing fields of behavioral and experimental economics indicates that in many choice situations the “as if” defense crumbles: boundedly rational agents do not always behave as if they were cognitively unconstrained. Hence, many economists are now willing to take objections to full-rationality seriously and to build alternative theories. The time is ripe to do likewise in political science. (2) Most Downsian models deliver predictions in a narrow class of situations: when preferences are single-peaked and the issue space is unidimensional. Despite the best efforts of several generations of talented researchers, getting Downsian models to yield testable predictions for multidimensional issue spaces has been difficult (even if one retains single-peakedness). Because we have no guarantee that all elections are one-dimensional contests—redistributive issues, e.g., are notoriously multidimensional—this is troubling. (3) Lastly, even in the unidimensional setting the accuracy of the best-known Downsian prediction— the median voter theorem for two-party competition—is questionable. In her review essay on political parties, Stokes says that “Recent research confirms that political parties in two- and multiparty systems occupy persistently different policy position...[hence] spatial theory’s prediction of programatic convergence is unsustainable” (1999, p.258). Ansolabehere, Snyder and Stewart state that “Competing candidates in congressional elections almost never converge” (2001, p.152; see also Chappell and Keech 1986, p.882 and references cited therein, and Levitt 1996, p.438). The last of these problems is the easiest for the Downsian program to handle. As several scholars have noted (e.g., Grofman 1993b), some Downsian models do predict divergence, by modifying several of the canonical premises. This underscores that what is central to a rational choice theory of electoral competition is not a particular prediction, even one as famous as the median voter prediction; instead, it is the rationality assumption itself. It is the latter that is central to the 5 “Wattenberg (1991) shows that An Economic Theory of Democracy is now the most widely cited research work in political science by a living scholar” (Grofman 1993a, p.2). See also the citation data noted by Almond (1993, p.206). 2 program’s Lakatosian “hard core”. In this paper we propose a behavioral model of electoral competition which addresses all three of the above issues. (1) Instead of making restrictive assumptions about rationality and common knowledge, our model assumes only that candidates are adaptively rational: in the spirit of Cyert and March (1963), they engage in problem-oriented search for new platforms in the face of electoral difficulties (losing). (2) The model is not forced to presume either a unidimensional policy space or single-peaked preferences in order to make testable predictions. (3) Under rather general assumptions about search, the model does not predict that the parties converge to the median position in a unidimensional policy environment. Finally, by making the parties’ search behavior a central feature, the model explicitly represents electoral dynamics. Our work follows that of Kramer (1977) Ferejohn, Fiorina and Packel (1980) and Ferejohn, McKelvey and Packel (1984; henceforth FFMP), and Kollman, Miller and Page (1992, 1998; henceforth KMP). Although Kramer, FFMP and KMP are rarely regarded as constituting a line of work, we believe they do: instead of investigating the equilibrium outcomes generated by perfectly rational decision makers, all studied how a series of electoral outcomes can be generated by myopic decision makers. Because this body of work is so important to our own, a section of the paper is devoted to discussing their work. The paper is organized as follows. The second section analyzes the related work of Kramer, FFMP and KMP. The third section presents the model, a stochastic analysis of a sequence of elections, and several implications. Proposition 2 shows that if winners satisfice, then experimentation by losers is necessary for a well-defined type of electoral “progress”. Proposition 3 demonstrates that if experimentation has certain rather weak properties then it and satisficing-by-winners together are sufficient to ensure that the sequence of winning policies reaches the policy space’s top cycle set with probability one, and stays there. Proposition 4 establishes that the sequence of winning policies is absorbed into the top cycle set if winners satisfice and the electoral dynamic satisfies a general stochastic property (ergodicity). Propositions 3 and 4 imply that if there is a majority rule winner, then ultimately the incumbent party must espouse it. However, Proposition 5 shows, given a weak assumption about the out-party’s search, that the parties do not stabilize at the median voter’s bliss point (when it exists). Thus full Downsian convergence is not predicted. The fourth section investigates alternative specifications of the challenger’s search behavior by endowing him with different degrees of sophistication and certain kinds of knowledge about the political terrain, following Kramer (1977), Miller (1980), or FFMP (1980, 1984). The fifth section analyzes whether our results are sensitive to “small” changes in key assumptions. Proposition 6 shows that they are, in fact, robust: e.g., if a majority rule winner “nearly” exists then the trajectory of winning platforms tends to be “close” to a trajectory of a process which actually has a generalized median. We then present a computational model that provides further results for ill-structured electoral environments. This goes beyond our analytical model by examining what happens when the profile of voters’ ideal points is not even close to having a generalized median. Computational results show that if challengers usually search incrementally (i.e., in the 3 neighborhood of their current platform) or blindly then in the steady state winning policies are centrally located. Moreover, their dispersion is strongly correlated with the size of the uncovered set. The last section concludes. RELATED WORK Despite their common orientation to dynamics and their departure from (Nash) equilibrium analysis, the work of Kramer, FFMP and KMP move away from standard rational choice models in different ways. Indeed, the relations among their models are complex. (E.g., although FFMP’s ideas are applicable to Downsian competition [Ferejohn 1995, p.258], their institutional setting is committees, not elections.) Hence it is useful to present a table that compares their attributes to each other and to the current model. [table 1 about here] It is especially worth noting that only KMP make bounded rationality a central theme (e.g., 1992, p.929). Kramer never raises it; FFMP merely touch on it (Ferejohn et al. 1980, p.141, 144). Yet in several ways all share a family resemblance on this dimension as well. First, all are what might be called status quo based processes. E.g., in Kramer’s model the winner in period t retains this winning platform in period t + 1. Although Kramer does not give a cognitive interpretation for this,6 we believe that there is a natural one, captured by the slogan that winners satisfice. Obviously, winning is satisfactory for office-oriented candidates, so a winning platform is the proverbial “needle that is sharp enough” (March and Simon 1958, p.141). Thus satisficing is a straightforward bounded rationality interpretation of status quo based electoral processes. Second, agents are assumed to be myopic. This is explicit in KMP and FFMP7 , but it is also a reasonable interpretation of Kramer’s model in two ways. (a) Satisficing-by-winners, though sensible, is myopic. (b) Kramer assumes that the out-party selects a platform that maximizes the vote-share against the status quo policy. This decision rule is an example of what is now called myopic best response: an alternative is chosen today without any regard for what will happen in the long run. (Indeed, Kramer’s model is a precursor to this line of theorizing, which is intermediate between classical full rationality, with its unlimited horizon, and decision rules such as satisficing that need not be even myopically optimal.) But these are mostly matters of interpretation. If pushed too far they produce a Procrustean reading. An inspection of Kramer and FFMP shows that their central concern is not to modify the rationality postulate of Downsian models; rather, their focus is the so-called “chaos problem” (Kramer, p.311-315; Ferejohn et al. 1980, p.140-41; Ferejohn et al. 1984, p.45-46). Only KMP make rationality a central issue.8 We regard this as an important project in its own right, and we 6 About this assumption he says only that “In each period one of the parties is elected, enacts the policy it advocated, and in the next election must defend this same policy” (p.317). 7 For readers who find this a surprising claim about FFMP, consider the following evidence: “The assumption that the committee decision process is myopic (3) may not be unreasonable, especially when N [the set of decision makers] and X [the policy set] are large enough to impede systematic enumeration of preferred alternatives and their supporting coalitions” (Ferejohn et al. 1980, p.144). 8 In KMP the parties are adaptively rational: they use search heuristics to discover new platforms, and poll data 4 view the current paper as contributing to that effort. That is, we see KMP as part of the larger bounded rationality research program in political science (Bendor, 2003): the extension of that program into work on party competition.9 Creating a coherent research program that shares the rational choice program’s focus on individual agents yet which uses more realistic premises about cognition is important to the discipline as a whole, not merely to the subfield of electoral politics. From this perspective, addressing the so-called chaos problem is far from the whole story. The larger goal is to create a competing research program that can address a wide aray of important topics on electoral competition—including what happens when there is a majority rule winner. Where to go from here Unfortunately, KMP has not had the impact on the field that it deserved. Discipline-wide factors may be at work—formal models of bounded rationality have diffused slowly in political science—but we believe that several specific factors also matter. First, in one sense KMP were too ambitious: they jumped into analyzing races in multidimensional policy spaces without first examining the unidimensional setting.10 Second, in another respect, however, KMP were not ambitious enough: their major papers (1992, 1998) settled for simulation results rather than analytical ones. Though this was surely done for pragmatic reasons—closed form results for dynamic processes in multidimensional spaces are often unattainable—it also made their work differ significantly from the Downsian mainstream, which emphasizes mathematical models and analytical results. (Even the substantively unorthodox papers of Kramer (1977) and FFMP (1980, 1984) produced analytical results.) So KMP differs from the mainstream both in terms of major substantive premises and the method of analysis. This combination may have marginalized it. Third, although Simon (1990) and others have listed empirical regularities about informationprocessing that can constrain our assumptions, there are still many plausible ways to model bounded rationality. Hence, for the sake of robustness it is desirable, ceteris paribus, to model the candidates’ adaptive behavior as generally as possible. This means positing general properties of adaptation (e.g., that successful strategies are more likely to be used in the future than unsuccessful ones) rather than a specific form of adaptation. Unfortunately, simulation is ill-suited to the general approach: a computer must be told exactly what kind of search heuristic to use. This problem can be alleviated by sensitivity testing, which KMP did, but it remains an issue. Indeed, because there are many types of bounded rationality (not only many kinds of search heuristics), an efficient form of sensitivity testing is to establish results analytically. The current paper thus complements KMP by using a different research strategy. First, we establish results for the canonical Downsian setting: a unidimensional policy space and voters is used to evaluate the merits of new platforms. 9 Some might regard the models in KMP and the present paper as belonging to a new research program, behavioral game theory, which merges game theory’s emphasis on strategic intereaction, and behavioral decision theory’s emphasis on the cognitive constraints of human beings. We have no quarrel with this alternative interpretation. 10 Perhaps the authors thought it was obvious how their search heuristics would perform in unidimensional spaces— apparently they induce the sequence of winning policies to converge to the median bliss point (Page, personal communication 1999)—but stating this explicitly in their first paper would have been sensible. 5 with single-peaked preferences. (More precisely, we establish results that include single-peaked preferences in a unidimensional space as a special case.) Only after we nail down results in that setting do we move on to the less well understood multidimensional environment. Second, we push analytical results as far as we can, turning to simulation only when necessary. (We do agree with KMP that when one cannot derive analytical results, resorting to computation beats giving up.) Third, in the interests of generality and robustness, we specify a few general properties of adaptation rather than relying on detailed heuristics. These features are interdependent—e.g., for modelling general properties of adaptation mathematics is better than simulation—and so they form a coherent approach with its own distinctive set of tradeoffs. THE MODEL AND ITS IMPLICATIONS We study a standard electoral game: a contest between two candidates or parties. A few modifications have been introduced to enhance analytical tractability. There are two candidates and a finite set of citizens, N = {1, . . . , n}, with n odd. There is a finite set of feasible policies or platforms, X = {x1 , . . . , xm }, with m > 1. (X may be huge, but finiteness is analytically useful for several results.) Each citizen has a strict preference ordering over the policies. Since n is odd, this ensures that the majority preference relation is also strict: for any two distinct policies, xi and xj , either a majority of citizens strictly prefer xi to xj (written xi xj ) or xj to xi . We do not assume spatial policies or preferences, using instead the above spare assumptions about individual preference orderings. But we can recover spatial settings (of different dimensionalities) when doing so is necessary or convenient. The following description of the majority preference relations induced by the above assumptions enables us to make this translation to a spatial framework. We partition the policy set X into z disjoint subsets, {L1 , . . . , Lz }, where 1 ≤ z ≤ m, by an interative application of the idea of the top cycle set.11 Let L1 denote X’s top cycle set. (It is known that L1 must be nonempty.) Now consider the reduced set, X/L1 . If this too is nonempty (it may not be), let L2 denote the top cycle of X/L1 . Proceed in this way until all x ∈ X have been assigned to an L. This procedure must terminate since X is finite. We call these subsets of X ‘levels’ to suggest a mental picture: one can see the electoral environment as a series of levels or plateaus (if z > 1). Each plateau electorally dominates all those below it in that each policy at a given level is majority-preferred to all policies at lower levels.12 Further, it follows that each policy at a given level covers (Miller 1980) all policies of lower elevation plateaus.13 Within a level, however, no policy beats all others, nor does any policy lose 11 The top cycle set can be defined constructively. In our setting (all majority preference relations are strict) a policy is in the top cycle set if and only if it is reachable from every other policy by a chain of strict majority preference. 12 This follows from two facts. First, no policy outside the top cycle set can beat anything in it. (Austen-Smith and Banks [1999, p.169] explain why.) Second, given our assumptions there are no ties, so any x in the top cycle beats any y not in it. 13 This implies that X’s uncovered set is a subset of L1 , the uncovered set of X/L1 is a subset of L2 , and so on. But some policies in a given Lr may cover other policies in Lr . See Bendor, Mookherjee and Ray 2001 for some facts about the relations among policies in different L’s. 6 to all others at its level. Thus in levels that aren’t singletons, any two policies are connected to each other by a majority rule cycle. Hence adaptive parties may get “hung up” on the cycles that pervade non-singleton levels. (Figure 1 gives an example of a policy set with three levels, where each level has a cycle.) [figure 1 about here] Partitioning the policy set into multiple levels turns out to be useful: it allows us to analyze concisely how the parties hill-climb—or “plateau-climb”—in the electoral landscape. The partioning easily allows us to describe different majority preference relations. Suppose we want to examine the classic Downsian context: a unidimensional policy space and citizens with single-peaked preferences. We obtain this by assuming z = m: each level is a singleton, with the median voter’s (strict) preference ordering being decisive. At the other extreme, we can represent the troublesome multidimensional policy space by assuming z = 1: all platforms are connected by majority rule cycles. Intermediate circumstances are captured by 1 < z < m.14 We assume that if xi xj then xi will defeat xj in an electoral contest. This is consistent with standard idealized assumptions about Downsian competition: in particular, that turnout is universal and voting is sincere and error-free. (Proposition 6 shows that the model is robust with respect to these idealized assumptions.) There is an indefinitely long sequence of elections, 1, 2, . . ., with one election per period. (At the start an incumbent party and its platform are randomly picked.) In every election the candidates simultaneously announce platforms and then citizens vote. The party that gets more votes wins the election and is the incumbent at the start of the next period. If the candidates adopt the same policy then a random, nondegenerate tie-breaking procedure picks the winner. (The “coin” need not be fair; it might, e.g., be biased toward the incumbent.) The state variables of the associated stochastic process are simply the platforms of the incumbent, It , and the challenger, Ct , in every election. Thus, It is the policy espoused by the winner of the election in t − 1. We are especially interested in how It —what Kramer (1977) called the trajectory of winning policies—evolves. (Because of It ’s importance we will sometimes abuse terminology and refer to it as if it were the state variable.) The parties’ adaptive behavior The heart of the model is how the candidates respond to winning and losing. Following KMP, we assume that winners satisfice (Simon 1955). (A1): the winner of the election in t stands for re-election in t + 1 by keeping the platform that won him office in t. In effect, we are assuming that candidates have an implicit aspiration level in-between the payoffs of winning and losing. Hence winning is satisfying, and incumbents don’t fix what isn’t broken. (We don’t explicitly model aspirations or their dynamics. For an overview of such models 14 A majority rule winner may exist even if z < m: one exists if and only if X’s top cycle, L1 , is a singleton. 7 see Bendor, Mookherjee and Ray (2001b).)15 Similarly, losing is dissatisfying, and we follow Simon, and Cyert and March (1963), in assuming that dissatisfaction triggers search. Hence our challengers (sometimes) search: in at least some elections they will try policies that differ from the one that they used—and lost with—in the previous election.16 Before examining how challengers search, we investigate an important implication of satisficing. This will help us better understand which assumptions in our model drive which conclusions. Proposition 1 refers to a stochastic process, produced by the combined behavior of incumbents and challengers in the electoral environment of our z levels and (A1). So far, only part of this process has been defined: incumbents’ (deterministic) behavior.17 The random aspect will be created by the challenger’s search, as we will see shortly. Proposition 1: higher. Suppose (A1) holds. If It is in Lr , then thereafter it must be at that level or Proposition 1 says that merely assuming that incumbents satisfice ensures (in the context of sincere voting, etc.) that electoral outcomes never slip “downhill” to lower plateaus.18 The policy of the party in power must either stay where it is or climb higher. The proof is simple. Suppose that It ∈ Lr . The challenger either proposes something from a lower level or not. If he proposes something lower, then the construction of the L’s plus sincere voting imply that the challenger must lose. Because winners satisfice, this implies It+1 = It ∈ Lr . If he proposes x ∈ Lr then no matter who wins today, It+1 ∈ Lr . Finally, if the challenger proposes a platform from a higher level then he wins and satisficing-by-winners implies It+1 ∈ Lq , where q < r. Induction does the rest. Note that proposition 1 presumes nothing about the challenger’s behavior. The result follows just from the combination of the static electoral stage described by the z levels and satisficing by winners. Recall, however, that we also assume that voters choose without error, or that error is rare enough so that the underlying majority preference relation is faithfully translated into electoral outcomes. Thus if we regard the parties as generating options that are collectively “tested” by the voters (as in Simon’s analysis (1964) of adaptive “generate-and-test” processes), then the “test” phase is flawless. This flawless collective testing plus satisficing-by-incumbents together imply that 15 Assuming that candidates have exogenously fixed implicit aspirations in-between the payoffs of winning and losing is equivalent to assuming explicit aspirations that adjust endogenously as follows. Let w be the payoff of winning and l, losing (w > l). Use the standard assumption (Cyert and March 1963) that tomorrow’s aspirations are a weighted average of today’s aspiration and today’s payoff: ai,t+1 = λt ai,t + (1 − λt )πi,t , where ai,t is party i’s aspiration in t, πi,t is its electoral payoff (w or l), and λt , the weighting parameter, is in (0, 1). (Note that the weights can vary over time.) If initially aspirations are “conventional”—winning is satisfying, losing isn’t—then the endogenous sequence of aspirations will always be in (l, w). Hence for all possible sequences of elections, winning will be satisfying but losing won’t. 16 This is such a weak premise that we rarely label it a formal assumption. (We use it explicitly only in proposition 5.) If it didn’t hold then, given that incumbents satisfice, both parties would always keep the platforms they championed in period one. This is neither realistic nor interesting. 17 Proposition 1 can therefore say something about specific sample paths; it is not confined to analyzing the probability distribution over a population of sample paths. 18 Proposition 1 has bite only when there are multiple policy levels (z > 1). 8 the process cannot slip downhill. So one half of the picture of adaptation, how incumbents behave, acts as a brake on the process. The other half, how challengers behave, is what supplies uphill momentum—not surprisingly, since only challengers search. A key aspect of search is discovering new alternatives, as defined below. Definitions. A party experiments in period t if it selects a platform that it has never used before t. It innovates in t if it espouses a platform that neither party has ever used. (A1) implies that incumbents don’t experiment; they satisfice, sticking with what got them elected. Hence in our model, challengers bear the responsibility for generating novel policies. The importance of this responsibility is underscored by the next result, which shows that the strong form of experimentation—innovation—is necessary for upward progress, i.e., for the trajectory of winning policies to move to higher L’s.19 Proposition 2: Suppose (A1) holds. For all periods s and t where s < t, if Is is in Lr and no challenger in [s, . . . , t] innovates, then with probability one It is also in Lr . The proof is simple. Proposition 1 says that no trajectory of winning policies ever slips downhill. Hence, if Is ∈ Lr , all prior winning platforms (and a fortiori, all losing ones) have come from level r or lower. Consequently, if no challenger innovates in [s, . . . , t], then all of them must have chosen platforms from level r or lower. But if that is true and no innovation occurs in [s, . . . , t], then the set of available policies in that era is in (Lr ∪ · · · ∪ Lz ). So then Is+1 ∈ (Lr ∪ · · · ∪ Lz ), whence by induction all of the winning policies in [s, . . . , t] must also be in (Lr ∪ · · · ∪ Lz ). Hence innovation is necessary for progress.20 The electoral dynamic need not grind to a halt if no one innovates: an incumbent can be beaten even if his opponent does not experiment, much less innovate. For example, suppose that Lr = {a, b, c}, with a b c a. Then the status quo policy can change endlessly without experimentation, as the two parties follow the cycle. But this behavior cannot kick the trajectory of winning policies up to a higher plateau. When does experimentation suffice for upward progress? Consider this condition. (A2): There is an > 0 such that for every history and in every election in which the challenger hasn’t already tried everything, the probability s/he experiments is at least . The challenger has good reason to try something other than his last platform: it lost in the last election and the incumbent has stayed put. Further, the challenger has reason to experiment because everything that he has tried before is electorally flawed. 19 As noted earlier, it is trivially true that search by challengers is necessary for upward progress. Proposition 2’s claim is much stronger: search that yields systemic novelty is required for progress. (By searching its institutional memory for policies that have worked in the past but which it hasn’t tried recently, a party could search without experimenting. And it could experiment without innovating.) 20 Neither proposition 1 nor 2 require that the set of policies be finite. 9 Remark 1: Consider the out-party at date t > 1. Every platform that this party has used before t has lost at least once. The remark is easily proven by contradiction. If it didn’t hold, then the challenger in t must have espoused a policy, x, that won in some earlier period s. (A1) implies that that party would have continued to uphold x in (s + 1, . . . , t − 1), triumphing in every one of those elections. But then in t that party would be the incumbent, not the challenger. If we consider platforms that have lost at least once to be electorally damaged goods, then it makes sense that (A2) would hold.21 However, (A2) does not commit us to a specific assumption about how much the challenger knows or how sophisticated he is. A challenger might, e.g., know the entire set X, have beliefs about voters’ preferences about platforms, and sophisticatedly update those beliefs. Or she might be blindly groping about. (A2) only amounts to a summary statement about the consequences of the out-party’s knowledge and strategic sophistication: it always has some chance of experimenting (when that is possible). Moreover, (A2) does not presume any specific stochastic model, or even any familiar class of stochastic models. For example, (A2) does not require a Markovian process: what happens today could depend on events in the distant past.22 Although (A2) is a weak assumption about platform-experimentation, it and satificing-byincumbents together ensure that the trajectory of winning policies converges to the policy space’s top cycle. Satisficing by incumbents and experimentation by challengers work well together: the former prevents the trajectory of winning policies from slipping downhill and the latter gives the process upward momentum. Proposition 3: If (A1) and (A2) hold, then the trajectory of winning policies converges to and is absorbed by L1 with probability one. The proof (in the appendix) relies on the fact that any level below L1 must be transitory: if the process ever goes to such a level, (A2) ensures that eventually it must leave that level and, by proposition 1, never return. Since all lower levels are transitory, long-run convergence to the top level is assured. And proposition 1 implies that L1 is absorbing: once a trajectory of winning policies reaches that level, it stays there. Note that if L1 is a singleton, then proposition 3 tells us that the trajectory of winning policies converges to that unique platform. (The challenger may continue to move around; more on this 21 In related work (Bendor, Mookherjee and Ray 2001c) we study adaptive behavior via an aspiration-based model of reinforcement learning: an agent is more likely to repeat satisfactory actions and less likely to repeat unsatisfactory ones. In that context, remark 1 is equivalent to saying that every alternative that today’s challenger has tried has received at least one negative feedback, and (A2) implies that the challenger will, with positive probability, try an alternative that has never received negative feedback (if one exists). 22 For example, the probability of experimenting could rise the more often the out-party loses, as desperation to (e.g.) reclaim the White House sets in. And the magnitude of the increases could depend on when the losses occurred: e.g., the more recent the losses, the more the challenger is willing to experiment. Because the party’s memory is not itself a state variable in our model, this process is not Markovian. 10 shortly.) Thus when a majority rule winner exists, we have a semi-Downsian result: the government’s policy must converge to the Condorcet winner.23 Convergence to the top cycle can be established without presuming that the electoral dynamic belongs to a particular class of stochastic processes. In particular, it is not necessary that the process be ergodic: it could have multiple limiting distributions.24 (For an example of a process— consistent with assumptions (A1) and (A2)—that has multiple limiting distributions, see Bendor, Mookherjee and Ray 2003.) However, although convergence to a unique long run distribution is not necessary for trajectories of winning policies to be absorbed into L1 , it turns out that ergodicity is sufficient for this convergence. Proposition 4 establishes that assuming that this process converges to a unique probabilistic steady state substitutes for positing that the challenger experiments. Proposition 4: If (A1) holds and the (It , Ct ) process is ergodic, then the trajectory of winning policies converges to and is absorbed by L1 with probability one. The proof is simple. Suppose that the first incumbent’s policy is in L1 . Then by proposition 1 the It ’s will remain in L1 thereafter, with certainty. But if a process is ergodic then it is independent of initial conditions, i.e., of the initial state values. Hence, that the It ’s of the above sample path are always in L1 is consistent with ergodicity only if the process is absorbed into L1 with probability one. Proposition 4 implies that if a Condorcet winner exists, then an ergodic process in which winners satisfice must converge to the majority rule winner. Thus deriving this semi-Downsian result requires no specific assumption about the challenger’s behavior. Instead, a more abstract approach—the stochastic properties induced by the parties’ behaviors—suffices. This is useful knowledge because, given satisficing-by-winners, many kinds of search rules make the electoral dynamic ergodic. We have called the convergence of the It ’s into L1 a semi-Downsian result; it is not a prediction of full convergence (not even when L1 is a singleton). In view of the facts—candidates in two-party systems rarely adopt identical platforms—that propositions 3 and 4 do not imply the median voter result is good for the model’s empirical standing. However, though these results do not imply full convergence, they do not preclude it either. Proposition 3 is silent on the matter because it uses (A2), which does not say what challengers do once experimenting becomes impossible (because the parties have tried all feasible platforms). True, 23 Determining the speed of convergence requires assumptions about search that are stronger than (A2). We will explore this issue in the next section. For now, note that even unsophisticated search can sometimes make the trajectory of winning platforms move uphill at a good clip. E.g., if the challenger searches completely blindly then the probability of moving “uphill” in one period equals the fraction of the policies that are in plateaus higher than the incumbent’s policy. Thus, if It is at a low level then the probability of hillclimbing in one election can be substantial. 24 In one important respect we need not worry about how many limiting distributions the process has. Under the hypotheses of proposition 3, the electoral dynamic goes into L1 for sure. Then no matter where in L1 it winds up, we know one significant fact: the government’s policy is ultimately electorally undominated, i.e., it beats all policies in all lower levels. 11 this is unlikely if X is large, but it is not ruled out. And proposition 4 makes no direct assumptions about challengers at all. To tie up these loose ends, consider the following weak formalization of Simon-Cyert-March’s idea that dissatisfaction triggers search. (It also incorporates the domainspecific assumption that losing an election is dissatisfying. For a justification of this via endogenous aspirations, see footnote 11.) (A3): There is an > 0 such that for every history, the challenger in t + 1 adopts a platform that differs from the one it lost with in t with a probability of at least . This weak specification of problem-driven search ensures that the electoral process will not settle down into a tweedledum-tweedledee pattern. Note that (A3) is sufficiently weak—e.g., it is not Markovian—so that the next result cannot pin down the properties of the process’s limiting distribution, or even if it has only one. But it can say what the process will not do. (Its proof, being obvious, is omitted.) Proposition 5: If (A3) holds then no state in which the parties adopt identical platforms is absorbing, nor is any set of such states. Proposition 5 implies that if the process does converge to a unique limiting distribution, then that distribution must put positive weight on states in which the parties adopt different platforms. And if it has multiple stationary distributions, then all of these must put positive weight on circumstances in which the parties offer distinct policies. This non-convergence of the parties emerges naturally from two weak behavioral premises: (1) losing is dissatisfying and (2) dissatisfaction triggers search.25 EXTENSIONS: MORE INFORMED AND SOPHISTICATED CHALLENGERS One can extend this model by making the challenger more informed and/or more sophisticated, hence sharpening his search for winning platforms. Regarding information, the extreme case would be to assume that the challenger knows the majority preference relation for each pair of platforms. Then, given varying degrees of strategic sophistication, the challenger might choose platforms that vote-maximize against the incumbent (Kramer 1977), or those that are in X’s uncovered set (Miller 1980). Or one might assume an intermediate degree of information—e.g., the challenger has some chance of knowing what vote-maximizes against the status quo—and combine that with a certain degree of sophistication. Obviously, many different extensions are possible. We can examine only a few prominent ones. Some are taken directly from the literature; others involve modifications. Kramer’s 1977 model Recall that incumbents in Kramer’s model satisfice (in effect) while challengers choose platforms that vote-maximize against the status quo. His main result, theorem 1, says that the trajectory of winning policies converges to the minmax set. It need not stay there, but “Theorem 1 does ensure that a trajectory which jumps outside must immediately return toward the minmax set...” (1977, 25 Note that X’s finiteness plays no essential role in proposition 5. 12 p.324). In contrast, our proposition 5 states that the trajectory converges to the top cycle, where it stays (by proposition 1). As noted earlier, our model’s setup bears some significant similarities to Kramer’s. But there are also some important differences. Most important, perhaps, is the information that the challenger has. Our (A2) presumes only that the challenger knows what he has tried in the past and has some chance of experimenting. Kramer’s challenger knows what vote-maximizes against the incumbent’s policy. One might object to the latter because it gives challengers an unrealistic amount of information: to know exactly what policy vote-maximizes against the status quo is asking a lot of any political party. And to presume that the out-party is so well-organized, so immune to internal squabbles, that it can always choose a vote-maximizing option may also be questioned. What, then, happens if one alters our model by assuming that challengers try to vote-maximize but may occasionally “tremble” and mistakenly pick a platform that is not vote-maximizing? (For simplicity we make the standard assumption that following a tremble the challenger plays a fixed and “totally mixed” strategy: anything in X can be picked with positive probability.) The following result, which follows from proposition 3, gives the answer. Remark 2: Suppose (A1) holds. With probability 1 − the challenger selects a platform that vote-maximizes against the incumbent’s policy; with probability > 0 he trembles and plays a strategy that is totally mixed over X. Then the trajectory of winning policies converges to and is absorbed by L1 with probability one. Hence, if the challenger might err in his effort to vote-maximize against the incumbent then we recover our convergence result, even if the challenger’s chance of erring is arbitrarily small.26 Thus, though the challenger is trying to vote-maximize against the incumbent and usually does so, the process is led toward the top cycle, not to the minmax set (unless the two happen to coincide). Why? The reason is that the challenger’s errors are filtered or tested by the electorate, making the trajectory of winning policies drift toward higher plateaus (higher L’s) in the short run and the top cycle in the long run. Proposition 1 tells us that this electoral testing means that no mistake of the out-party can shove the dynamic down to lower levels. Further, since any kind of mistake is possible, the challenger has some chance of stumbling onto policies on higher levels. The voters approve of such mistakes, thus pushing the trajectory uphill—whether or not the minmax set and the top cycle coincide. Thus, regardless of the challenger’s intentions, the electoral environment ensures that the dynamic is driven by winning per se rather than by (e.g.) the magnitude of victory. The selection environment trumps the agent’s intentions, which is consistent with Satz and Ferejohn’s argument that “[when] we are...interested in explaining...the general regularities that govern the behavior of 26 If one objects to the assumption that the challenger trembles to a strategy that is totally mixed over X, there are weaker assumptions that still yield convergence to L1 . For example, suppose the challenger vote-maximizes with probability 1 − and experiments (when this is possible) with some small probability > 0. Then (A2) holds, so It → L1 with probability one. 13 all agents...it is not the agents’ psychologies that primarily explain their behavior, but the environmental constraints they face” (1994, p.74). This trumping of the challenger’s intentions by the selection environment holds quite generally; the Kramerian challenger’s specific objective—to maximize votes against the status quo—played no essential role in remark 2. As long as the challenger has some chance of trembling and playing a strategy that is totally mixed over X, the conclusions of remark 2 hold, regardless of the challenger’s objective function. Thus the parties could have different goals. For example, when the Democrats challenge they could vote-maximize against the incumbent, but when the Republicans are the outparty they select a policy that maximizes some ideological criterion (as in, e.g., Chappell and Keech 1986, p.884). Or both parties could maximize an objective function based on a mix of office-seeking and ideology (Wittman 1983). In the long run those goals do not matter. One could even allow for parties that suffer from Arrovian problems and so lack coherent preferences. All that counts is that the pattern of error gives the electoral mill enough grist to work on.27 So long as this condition is met, one can make the challenger as sophisticated and informed as one likes, with any kind of preferences; the end result is the same. Miller (et al.) and the uncovered set Miller (1980, p.93) and others (McKelvey 1986; Cox 1987; Epstein 1998) have argued that if x covers y then x electorally dominates y. As Cox put it, If one accepts the extremely mild assumption that candidates will not adopt a spatial strategy y if there is another available strategy x which is at least as good as y against any strategy the opponent might take and is better against some of the opponent’s possible strategies, then one can conclude that candidates will confine themselves to strategies in the uncovered set (1987, p.420). As with most solution concepts, in order to use it a candidate must be both well-informed and reasonably sophisticated. But expecting candidates to invariably pick policies in the uncovered set may be unrealistic.28 Yet even a bit of information can help the challenger search, as the next result shows. (The proof is in the appendix.) Remark 3: If (A1) is satisfied and for every history the challenger alights on X’s uncovered set with a probability of at least > 0, then the following hold. • (i) It is in L1 with positive probability for all t > 1. • (ii) If Pr(It ∈ L1 ) is less than one then Pr(I1 ∈ L1 ) < · · · < Pr(It ∈ L1 ). • (iii) It → L1 with probability one as t → ∞. 27 Thus, one can regard this as an evolutionary theory: “blind” variation is produced by error; selection is the electoral environment. We thank John Padgett for this interpretation. 28 If they did then the process would jump to L1 in one period, since X’s uncovered set is a subset of L1 . 14 Thus even fragmentary information about the uncovered set’s location and even crude understanding about the strategic value of uncovered policies can have substantial impacts, both in the short-run (parts (i) and (ii)) and the long-run (part (iii)). Consider now a less demanding possibility: the challenger might not know the uncovered set’s location, but may know policies that cover what he must try to beat today—the incumbent’s platform. (This presumes, of course, that some alternative covers It . If not, then It is in X s uncovered set and so is already in L1 .) First we establish the importance of the challenger finding something that covers the incumbent’s platform. It is necessary: electoral hill-climbing cannot occur without it. Remark 4: Suppose (A1) holds. Consider any r = 1, . . . , z. If It is in Lr and the probability that Ct covers It is zero, then It+1 must also be in Lr . The logic is straightforward. Any policy at higher levels, say any xi ∈ L1 ∪ · · · ∪ Lr , covers any policy at lower ones, i.e., any x ∈ Lr+1 ∪ · · · ∪ Lz . (See Bendor et al. (2001) for the proof.) Hence if It ∈ Lr and today’s challenger has no chance of finding a platform that covers the incumbent’s, then he has no chance of finding anything in L1 ∪ · · · ∪ Lr−1 , since anything there would in fact cover It . Hence hillclimbing cannot occur. Since proposition 1 ensures that the process cannot slip downhill, it must stay at the same plateau. Hence, now consider elections in which the challenger does have some chance of finding platforms that cover the incumbent’s. The following assumption formalizes this idea. (A4): For every history and in any election in which It is covered by some x ∈ X, with probability of at least > 0 the challenger finds an option that covers the status quo. (A4) neither implies nor is implied by (A2), which stipulates the possibility of experimentation.29 But as the next result shows, their long-run effect is the same. (The proof is in the appendix.) Remark 5: If (A1) and (A4) hold, then the trajectory of winning policies converges to and is absorbed by L1 with probability one. Although the challenger is sophisticated enough to have a chance of finding a platform that covers the incumbent’s, the process is not guaranteed to be absorbed into X’s uncovered set (unless that set and L1 coincide), though it will visit that set infinitely often. The reason: L1 may have 29 (A2) does not imply (A4) because the challenger might experiment but his set of possible new policies may not include anything that covers the status quo. The following example shows why (A4) does not imply (A2). Let Lr = {w, x, y, z}, with r = 1. Suppose w beats x and y, x beats y and z, y beats z, and z beats w. Then a cycle connects any two policies in Lr , but x covers y since both criteria of the covering relation hold: (1) x beats y, and (2) x’s winset, {w} ∪ L1 ∪ · · · ∪ Lr−1 , is a subset of y’s winset, which is {w, x} ∪ L1 ∪ · · · ∪ Lr−1 . Suppose the process has been stuck in level r, and each party has tried all platforms at this level. Yet, if It = y, (A4) could be satisfied while (A2) isn’t, if the challenger could (re)discover x with probability greater than but with probability one would not experiment. Thus the possibility of experimenting and of generating a platform that covers the status quo are independent search assumptions. 15 policies that are not in the uncovered set, and because any two policies in the same level are connected by a cycle, the trajectory of winning policies can leave the uncovered set. ((A4) does not preclude this possibility.) FFMP (1980, 1984) In both papers FFMP assume that the challenges to the status quo, x, come from a uniform distribution over x’s win set. Assuming that new options come from the status quo’s win set involves an intermediate degree of information and sophistication: less demanding than assuming that new options must cover the status quo, but more demanding than assuming experimentation. However, positing a uniform distribution over the status quo’s win set was for computational reasons: FFMP (1984) calculated bounds on the limiting distribution of winning platforms. It is unnecessary for qualitative results, and we shall disregard it here. For our purposes, the key feature in FFMP’s premise is that when searching the challenger puts positive probability on anything that beats the incumbent’s platform. As usual, this assumption need not be forced into a Markovian mold. (A5): Following every history, the challenger’s search has a probability of at least > 0 of finding any option that beats the status quo. Because the set of policies that beat any x must include some in L1 , (A5) implies stochastic hill-climbing in the short-run and convergence to L1 in the long-run, just as remark 3 did. (The proof is similar to remark 3’s and so is omitted.) Remark 6: If (A1) and (A5) are satisfied then the following hold. • (i) It is in L1 with positive probability for all t > 1. • (ii) If Pr(It ∈ L1 ) is less than one then Pr(I1 ∈ L1 ) < · · · < Pr(It ∈ L1 ). • (iii) It → L1 with probability one as t → ∞. Thus if challengers are as informed and sophisticated as FFMP posit, then long-term convergence to the top level is ensured, as is short-term progress. In general, this section’s findings show that endowing the challenger with more information and/or more strategic sophistication has a quantitative effect (speeding up convergence to L1 ); it does not alter the model’s basic qualitative properties. ROBUSTNESS ISSUES A common price for analytical results is that they require stylized assumptions. This can entail reshaping a vague-but-plausible idea (e.g., incumbents are often content with the platforms that won them office) into crisp but less plausible ones (incumbents always satisfice). To be sure, theorizing requires simplification: as Jonathan Swift observed long ago, the most realistic model of a phenomenon is the phenomenon itself. But it would be troubling if our results turned out to be knife-edge findings — if changing a premise a little altered the conclusions a lot. 16 Several features of our model might cause concern in this regard. As noted, assuming that incumbents invariably stick to winning platforms exaggerates the plausible scenario it is meant to capture. Similarly, that x is majority-preferred to y may not guarantee that x will beat y: e.g., variations in turnout or voters’ errors may change the outcome. A more subtle concern, which is conceptually more serious than the simplifications mentioned above, is that our model appears to predict little in “ill-structured” environments in which the Plott conditions fail. In such situations the top cycle may be the entire policy space. (It is well-known that an n-dimensional (n > 1) spatial voting setting is particularly vulnerable to this problem.) But then Proposition 3 has no bite at all, and our model apparently loses all predictive power. Yet in an important sense this conclusion is too hasty: much depends on “how seriously” the Plott conditions are violated. For instance, consider the most drastic class of perturbations: a y from the lowest level Lz now beats an x from L1 , causing all levels to collapse into one giant set. Now the top cycle equals X, so technically our convergence results are vacuous. Yet the substantive bite of such a perturbation could be minimal: it will matter only when the incumbent’s platform is that particular x and only if the challenger can find that once-lowly y. Challengers could face a needle in the haystack problem: in the perturbed electoral environment they might not discover y very often, if X is large and search is crude.30 Then the probability that the sequence of winning policies will transit from x to y will also be low, so the process will spend “most” of its time at the top plateau of policies, even though the top cycle is the entire set of policies in this perturbed environment. Below we state a robustness proposition that handles all these (and possibly other perturbations) in the same general setup. This generality exacts a price—some additional abstraction—but we will try to link the abstract structure to the substantive robustness questions raised here. Consider a family of models, indexed by a parameter θ ∈ [0, 1]. In each model θ there is a set of policies X(θ), partitioned into level sets L1 (θ), L2 (θ), . . . , Lz (θ), where z may also depend on θ. For each model θ a typical state is defined exactly as before: it is a pair (I, C), called σ here for notational convenience. Abusing terminology slightly, we say that the state σ = (I, C) lies in some L if the associated incumbent’s platform I ∈ L. A state σ = (I, C) is in the top set if I ∈ L1 (θ). For each t ≥ 1, let ht be the t-history at date t: all states that transpired up to t, including the current state at t. Let σ(ht ) denote the state at date t under the t-history ht . For each t and each t-history, let π θ (ht , i) denote the probability that the system enters Li (θ) at the very next date (t + 1), starting from ht . This is information that a particular model would give us. For instance, Proposition 1 says that given our stylized assumptions, if the state at history ht lies in some Lj then this transition probability π(ht , i) is precisely zero whenever i > j; i.e., the process doesn’t move from electorally higher plateaus to lower ones. Here we do want to allow for these “perverse drifts”, but with low probability. (A7): 30 There exists a function ψ(θ), with ψ(θ) → 0 as θ → 0, such that whenever σ(ht ) lies in E.g., if search is completely blind then the chance that the challenger will land on y is 17 1 . m Lj (θ) for some j < i, π θ (ht , i) may be positive but is less than ψ(θ). Because this assumption is the heart of the exercise, it warrants further discussion. It acknowledges that movements to lower plateaus can occur in our family of models. This might happen if, e.g., voters occasionally punch the wrong spot on the ballot. But the most subtle interpretation of (A7) is that the multilayered structure of the L’s is “destroyed” by certain preference cycles. This interpretation is subtle because to accommodate it, L1 (θ), L2 (θ), . . . , Lz (θ) can no longer be viewed as the true plateaus of the electoral contest, but as some underlying (counterfactual) structure of plateaus were the “problematic” cycles to be artificially removed. Now the possible drift from higher to lower levels is interpreted not as a failure of the model’s assumptions but as a genuine majority-based defeat of some x in one level by some y in another. Thus, under this last interpretation the true ill-structured environment has only a few levels (perhaps just one), but our formulation strips away the ill-structure in a minimal way by placing the burden on “wrong” movements in the state under a well-structured environment. Return now to (A7). It states that for θ close to zero, the ill-structure “almost” vanishes (or equivalently, under the other interpretations, that the model’s other assumptions “almost” hold). This is captured by the bound function ψ(θ) on “wrong” movements, which is assumed to go to zero as θ → 0. A second assumption guarantees that the usual upward movements, such as those guaranteed by (A2), continue to exist throughout: (A8): The infimum value of π θ (ht , i), over every θ, every t-history, and every i is strictly positive, provided σ(ht ) ∈ Lj (θ) for some j > i. Although we need to state (A8) formally for this family of models, we have already seen that it is an implication of our other assumptions (e.g., experimentation). Since no further comment is needed regarding (A8), we may now state our robustness result. Proposition 6: Assume (A7) and (A8). • (i) For every > 0, there exist integers T () and θ() such that for every T ≥ T () and θ ≤ θ(), and every initial state, the system lies in the top set at date T with probability of at least 1 − . • (ii) Suppose that the system enters the top set at date t. Pick any date s ≥ t. Then, if θ ≥ θ(), the system continues to lie in the top set at date s with (conditional) probability of at least 1 − . As the formal statement of the result is unavoidably technical, a few informal remarks are appropriate. Part (i) of the proposition states that if the perturbations (represented by θ) are small enough, then after “sufficiently” many periods the system must be in the top set L1 (θ) with very high probability. 18 In particular, the proposition says that if the perturbations are due to ill-structured majoritarian preferences but the “destructive” cycles are sparse enough, then by artificially stripping away the destructive cycles one can significantly boost the framework’s predictive power. One interpretation, consistent with the family of models studied here, is that as θ becomes small the number of feasible policies grows without bound, while the policies that cause the ill-structure increase more slowly. If the challenger’s search is crude, the chances of alighting on the latter will become very small; then (A7) holds. Proposition 6 follows. Part (ii) states that for any sample path that enters the top set, the conditional probability of staying there thereafter is also high. Informally, the system enters the top set and stays there with high probability. This rules out certain perverse dynamics (e.g., entry into the top set being positively correlated with swift departure from that set). Note that this result used no Markovian assumptions. This is striking: it shows that the model’s robustness does not rest on any special stochastic features. Ill-structured majoritarian preferences: a computational model Proposition 6 studies only “small” changes in the model’s key assumptions. Hence it does not tell us what happens to the results when there are big changes in, e.g., preference-profiles. In particular, it does not say what happens to the trajectory of winning platforms when the preferenceprofile is far from having a generalized median. This is a difficult question; analytical results are hard to come by (though see proposition 7). Hence we resort to a computational model. (It can address other issues, but due to space constraints we confine ourselves here to the above question. Subsequent work will tap the computational model’s potential more fully.) We now briefly describe the computational model. (For a description of the computer program, see https://faculty-gsb.stanford.edu/bendor/.) Because the simulation is a special case of our mathematical model—the policy space is finite, (A1) and (A2) hold, etc.—we focus on its distinctive properties. The central ones are that policies are embedded in a two-dimensional space and voters have quadratic loss functions in this space.31 To ensure that the simulation results are meaningful and interpretable we make the search Markovian, which implies that the probability distribution of (It+1 , Ct+1 ) depends only the parties’ current platforms and the transition rules created by the incumbent’s satisficing and the challenger’s search. By stipulating time-homogeneous search rules, we ensure that the (It , Ct ) process is stationary. Hence we can invoke powerful theorems for finite-state stationary Markov chains (e.g., Kemeny and Snell 1960) which tell us when such processes are ergodic.32 All of our computational 31 The following ensures that, as the analytical model requires, each citizen has a strict preference ordering over the policies: if policies x and y are equidistant from voter i’s ideal point, then x and y are given randomly drawn distinct “valence” (nonspatial) values. (The draws are independent across voters.) Thus, the voters’ preferences are lexicographic, with the spatial component being primary. 32 Models of trial-and-error learning that are not ergodic may lack empirical content. See Bendor, Diermeier and Ting (2002) for several “folk theorems” for such models. 19 results arise from ergodic processes.33 Thus we will be scrutinizing the steady state distributions of the winning platforms. (More precisely, the output—for every set of parametric values, 1,000 different sample paths run for 1,000 periods—will closely approximate such steady states.) Results We examine two types of results: (1) how different profiles of voters’ ideal points affect the steady state distribution of winning platforms (holding the challenger’s search rule constant) and (2) how different search rules affect the long run distribution. (1) To measure the symmetry of a profile of ideal points, we use a standard metric: the relative size of the uncovered set. (At one extreme, if the uncovered set is a singleton then a generalized median exists; at the other extreme, the uncovered set is the entire policy space. So the measure 1 ranges from m to 1.) The challenger’s search rule is represented by a probability distribution over the policy space; here the distribution is single-peaked (a truncated normal). Thus in period t the challenger is more likely to choose a platform close to the one he espoused in t − 1 than something far away. Figure 2 reveals a strong relation between the size of the uncovered set and the steady-state dispersion of winning platforms. This pattern reflects how the strength of the centripetal forces varies across electoral environments. When the uncovered set is small these centripetal forces are strong, so in the steady state winning platforms are centrally located; when this set is big the centripetal forces are weak, so winning platforms are scattered throughout the policy space. [figure 2 about here] This pattern complements the analytical result of proposition 6, which showed that the process is well-behaved for “small” perturbations to voter profiles. The computational results of figure 2 suggest that the process is well-behaved globally: the dispersion of the winning platforms seems to increase steadily as the uncovered set expands. However, our explanation for this pattern was too terse: the electoral environment can mold the steady-state distribution of winning platforms only if the out-party’s search generates enough variety for the selective forces of the electoral environment to work on. So we now briefly examine the effect of different search rules. (2) The output in the first part of table 2 is based on naive search: challengers search blindly, 1 putting probability of m on every platform. Yet the size of the uncovered set and the long-run dispersion of winning platforms remain highly correlated: the main pattern—centrist platforms tend to win when the uncovered set is small—continues to hold. [table 2 about here] Hence, this main pattern does not require search to be prospectively attuned to winning. Instead, what suffices is that challengers generate enough grist (variety) for the electorate’s mill, per the evolutionary arguments proposed earlier. 33 Establishing that these processes are ergodic is straightforward, so the proofs are omitted. 20 The second part of table 2 shows that the relation between the uncovered set’s size and the steady-state dispersion of winning platforms is attenuated if the out-party’s search is keyed to its policy preferences. This makes sense: that search rule creates a centrifugal force that operates independently of the size of the uncovered set—the challenger always reverts to this distribution. However, although the basic pattern is weakened, it is still present, even in this extreme case when the challenging party is, in effect, completely dominated by ideologues. (We have also studied cases where the challenger’s search is a mix of the above pure types, i.e., search centers on a policy that is a weighted average of the party’s ideal point and its last platform. Results [unreported here due to space constraints] show that the system’s long run tendencies are then intermediate between those produced by the two pure search rules whose outcomes are reported by table 2.) Even ideological parties cannot completely ignore the strong electoral forces that are present when the uncovered set is small. The shaping power of the electoral environment: analytics once more Our computational results are consistent with the argument that even when the electorate’s preference-profile is far from having a generalized median, all hell does not break loose because the electoral environment still has centripetal forces. But computational models necessarily use quite specific assumptions: here, a two-dimensional policy space, quadratic utility, and a small electorate. So, because it would be helpful to supplement these findings with analytical ones, we return to deductive methods for a final result. Naturally, since we resorted to computation because we couldn’t derive general results analytically when the Plott conditions are far from holding, we must simplify our analytical model in some way. But the simplification should be consistent with our substantive objective, which is to examine the electoral environment’s centripetal forces. Therefore we should not impose simplifying assumptions on voters’ preferences. Instead, we make our mathematical model tractable by simplifying the challenger’s search: we assume that it is blind—uniform over X. This assumption not only helps to ensure tractability; it is also substantively useful: since challengers don’t learn, we know that conclusions about the steady-state probabilities of winning platforms depend only on the selective forces inherent in the electorate (and on satisficing-by-incumbents). Our last result shows that even when the top cycle is the entire policy space and even if no assumption is made about whether the Plott conditions “nearly” hold, the electoral environment plus satisficing-by-incumbents can still impart some stochastic order to outcomes. (The proof is in the appendix.) Proposition 7: Assume (A1) and blind search by challengers. If L1 = X and x covers y, then the steady-state probability that x is the incumbent’s platform exceeds y’s steady-state probability. Thus, whether or not a generalized median almost exists, if the electoral environment is characterized by long strings of covering relations (e.g., a covers b which covers c which...), then the steady state probabilities of winning policies will be nicely structured by monotonicity (e.g., in the 21 long-run a is more likely than b which is more likely than c which...).34 Proposition 7 also implies that if a platform’s steady-state probability of being the government’s policy is maximal, then it must be in the uncovered set. These properties buttress the claims of Miller et al. that if x covers y then x electorally dominates y. CONCLUSIONS This paper has examined the electoral dynamic generated by adaptively rational incumbents and challengers in two-party competition. The results emphasize how powerfully certain electoral landscapes shape the dynamic. In highly structured electoral environments—those with many levels—electoral competition constrains the process greatly, even if challengers are both ignorant and unsophisticated. Thus our paper is consistent with work in economics on zero-intelligence agents (e.g., Gode and Sunder 1993), which tries to ascertain how much of market performance is due to the market environment rather than to the intelligence of agents in those environments. Gode and Sunder concluded, “Adam Smith’s invisible hand may be more powerful than some may have thought: when embodied in market mechanisms such as a double auction, it may generate aggregate rationality not only from individual rationality but also from individual irrationality” (p.136). The median voter’s invisible hand, when it exists, is similarly powerful, in guiding the trajectory of winning platforms to what that pivotal citizen wants.35 At the other end of the spectrum, one can investigate the effects of endowing candidates (especially challengers) with more information and/or sophistication. The results in the fourth section show that incorporating such extensions is straightforward. In general, their effect is quantitative, not qualitative: they tend to increase the likelihood of hill-climbing in the short-run, making It converge faster to the top level. A concern regarding our basic model is that slight perturbations of voters’ preferences can create an ill-structured electoral environment—the top cycle is everything—making many of our results vacuous. But proposition 6 shows that small perturbations (of preferences and other parameters) have only a minor effect on winning platforms in the steady-state. And our computational results and proposition 7 indicate that the process is well-behaved even when the voters’ preference profile is far from having a Condorcet winner. 34 Proposition 7 cannot be generalized by replacing “x covers y” with “x beats more policies than y does”. (If x covers y then it beats more policies than y does, but the converse doesn’t hold.) Though the resulting conjecture— platforms that beat more rivals should be more likely to be the government’s policy in the steady state—is intuitively plausible, it is not true in general. This is so even if one rules out what would be, from a spatial perspective, bizarre “preferred-to” relations, e.g., x beats y even though the former loses to thousands of other platforms while the latter loses only to a handful. (We will provide, upon request, a counterexample to the conjecture. This counterexample is not bizarre: if x beats y then x loses to at most only one more policy than y does.) 35 The parallel with models of zero-intelligence agents is incomplete because throughout our paper winners satisfice, which is sensible. For a true zero-intelligence benchmark, one should posit that both candidates choose platforms blindly. Because we are interested in more realistic models of bounded rationality, we have not pursued the pure benchmark case. But even a cursory investigation of that benchmark would reveal that the electoral environment strongly shapes the trajectory of winning policies. 22 Appendix Proof of Proposition 3 If I1 ∈ L1 with certainty then the result is immediate (given proposition 1), so assume that Pr(I1 ∈ L1 ) < 1. Define Ut (i) as the number of platforms untried by party i (i = D, R) at the start of period t. Ut (i) is a weakly decreasing process, with U0 (i) = m. Note that if min(Ut (D), Ut (R)) = 0 then It ∈ L1 . (This holds because if one of the parties has tried all platforms then it must have tried those in L1 , and once something in L1 was tried, proposition 1 implies that the process never leaves L1 thereafter.) So we need to show only that the probability of the event that min(Ut (D), Ut (R)) > 0 goes to zero as t → ∞. Since there is a challenger in every period, at least one party (say D) must be a challenger infinitely often. Suppose, for date t, Ut (D) = n, where 0 < n. Consider the event that Ut (D) stays at n. Suppose D’s experimentation probability were exactly , whenever D challenges and Ut (D) > 0. Then the counting process Ut (D) is just a geometric process with probability , whence limt→∞ [Pr(Ut (D) = n)] = 0. Since D experiments with a probability of at least , this must continue to hold. Hence with probability one Ut (D) will hit n − 1 eventually. If n − 1 = 0 we are done; if not, just repeat the above argument. So with probability one Ut (D) = 0 eventually, implying that the probability of min(Ut (D), Ut (R)) > 0 goes to zero as t → ∞. QED. Proof of Remark 3 (i) Proposition 1 implies that if a sample path of the winning policies is in L1 in t then it stays in L1 thereafter. Hence if Pr(It ∈ L1 ) > 0 then Pr(Is ∈ L1 ) > 0, ∀s > t, and in particular if Pr(I1 ∈ L1 ) > 0 then Pr(Is ∈ L1 ) > 0, ∀s > 1. Alternatively, suppose that Pr(It ∈ L1 ) = 0. Let U C(X) denote X’s uncovered set. Since U C(X) ⊆ L1 it follows immediately that Pr(I2 ∈ L1 ) > 0, and we can then use the preceding argument. (ii) If Pr(It ∈ L1 ) < 1 then by proposition 1, Pr(Is ∈ L1 ) < 1, ∀s < t. Because U C(X) ⊆ L1 , it follows that Pr(Is+1 ∈ L1 |Is ∈ / L1 ) ≥ Pr(Is+1 ∈ U C(X)|Is ∈ / L1 ). Since Pr(Is+1 ∈ U C(X)|Is ∈ / L1 ) = Pr(Cs ∈ U C(X)|Is ∈ / L1 ), and the latter term is bounded away from zero, the result follows. (iii) Let the bound on hitting U C(X) be some > 0. If the challenger’s probability of hitting U C(X) were exactly , then the process would have a geometrically distributed waiting time, whence the probability that it stays out of L1 would go to 0 as t → ∞. Since the process’s probability hitting U C(X) is at least , the result follows. QED. Proof of Remark 5 Consider the event that the challenger fails to espouse an alternative that covers It . Since the probability of this event is at most 1 − < 1, the probability that it will recur infinitely often is zero. Hence with probability one the challenger will eventually find something that covers It . Because It is arbitrary here, this holds for every status quo platform. Hence, since the covering relation is transitive, the trajectory of winning policies must land in U C(X) with probability one. Because U C(X) ⊆ L1 , the result is established. QED. 23 As the proof of proposition 6 is rather technical, it is omitted. (It is available upon request, and has been provided to the referees.) Proof of Proposition 7 Given the proposition’s hypotheses, it is easily established that the process is ergodic. Hence the proposition’s conclusion is a meaningful claim. Given blind search, the steady state probability of policy i (denoted πi ) equals j∈l(i) πj + πi (1 − |w(i)|), where l(i) denotes the set of policies that lose to i and w(i) is the set that wins against i. Thus |w(i)| · πi = As 1 m 1 πj . m j∈l(i) is a constant in this set of simultaneous equations, it drops out; hence πi = 1 πj . |w(i)| j∈l(i) Because either x j or j x for all i = j, it follows that |l(i)| + |w(i)| + 1 = m. So w(i) ⊂ w(j) ⇒ l(j) ⊂ l(i) Hence, if x covers y then l(y) ⊂ l(x). And since πx = the result follows by simple algebra. 24 1 |w(x)| j∈l(x) πj whereas πy = 1 |w(y)| j∈l(y) πj , QED. References Almond, Gabriel. 1993. “The Early Impact of Downs’ An Economic Theory of Democracy.” In Bernard Grofman (ed.), Information, Participation, and Choice. Ann Arbor: University of Michigan Press. Ansolabehere, Stephen, James Snyder and Charles Stewart. 2001. “Candidate Positioning in U.S. House Elections.” American Journal of Political Science 45:136-59. Austen-Smith, David, and Jeffrey Banks. 1999. Positive Political Theory I: Collective Preference. Ann Arbor: University of Michigan Press. Bendor, Jonathan. 2003. “Herbert A. Simon: Political Scientist.” In Nelson Polsby (ed.), Annual Review of Political Science, vol. 6. Bendor, Jonathan, Daniel Diermeier and Michael Ting. 2002. “The Empirical Content of Adaptive Models.” Typescript, Stanford University. Bendor, Jonathan, Dilip Mookherjee and Debraj Ray. 2001a. “Adaptive Political Parties and Downsian Competition.” Presented at the 2001 Annual Meeting of the American Political Science Association. Bendor, Jonathan, Dilip Mookherjee and Debraj Ray. 2001b. “Aspiration-Based Reinforcement Learning in Repeated Games.” International Game Theory Review 3:159-74. Bendor, Jonathan, Dilip Mookherjee and Debraj Ray. 2001c. “Reinforcement Learning in Repeated Interaction Games.” Advances in Economic Theory. Vol. 1, No. 1, Article 3. Bendor, Jonathan, Dilip Mookherjee and Debraj Ray. 2003. “Satisficing and Selection in Electoral Competition.” Typescript, Stanford University. Camerer, Colin. Behavioral Game Theory. Princeton: Princeton University Press. Chappell, Henry, and William Keech. 1986. “Policy Motivation and Party Differences in a Dynamic Spatial Model of Party Competition.” American Political Science Review 80:881-99. Cox, Gary. 1987. “The Uncovered Set and the Core.” American Journal of Political Science 31:408-22. Cyert, Richard, and James March. 1963. A Behavioral Theory of the Firm. Englewood Cliffs NJ: Prentice-Hall. Downs, Anthony. 1957. An Economic Theory of Democracy. New York: Harper & Row. Epstein, David. 1998. “Uncovering Some Subtleties of the Uncovered Set: Social Choice Theory and Distributive Politics.” Social Choice and Welfare 15:81-93. Ferejohn, John. 1995. “The Development of the Spatial Theory of Elections.” In James Farr and John Dryzek (eds.), Political Science and History. Cambridge: Cambridge University Press. Ferejohn, John, Morris Fiorina and Edward Packel. 1980. “Nonequilibrium Solutions for Legislative Systems.” Behavioral Science 25:140-48. 25 Ferejohn, John, Richard McKelvey and Edward Packel. 1984. “Limiting Distributions for Continuous State Markov Voting Models.” Social Choice and Welfare 1:45-67. Gode, Dhananjay, and Shyam Sunder. 1993. “Allocative Efficiency of Markets with Zero-Intelligence Traders: Market as a Partial Substitute for Individual Rationality.” Journal of Political Economy 101:119-37. Goodin, Robert, and Hans-Dieter Klingemann. 1996. “Political Science: The Discipline.” In Goodin and Klingemann (eds.), A New Handbook of Political Science. Oxford: Oxford University Press. Grofman, Bernard. 1993a. “Introduction” in Grofman (ed.), Information, Participation, and Choice. Grofman, Bernard. 1993b. “Toward an Institution-Rich Theory of Political Competition with a Supply Side Component.” In Grofman (ed.), Information, Participation, and Choice. Kagel, John, and Alvin Roth. 1995. The Handbook of Experimental Economics. Princeton: Princeton University Press. Kemeny, John, and J. Laurie Snell. 1960. Finite Markov Chains. Princeton: Van Nostrand. Kollman, Ken, John Miller and Scott Page. 1992. “Adaptive Parties in Spatial Elections.” American Political Science Review 86:929-37. Kollman, Ken, John Miller and Scott Page. 1998. “Political Parties and Electoral Landscapes.” British Journal of Political Science 28:139-58. Kramer, Gerald. 1977. “A Dynamical Model of Political Equilibrium.” Journal of Economic Theory 16:310-34. Levitt, Steven. 1996. “How Do Senators Vote? Disentangling the Role of Voter Preferences, Party Affiliation, and Senator Ideology.” American Economic Review 86:425-41. March, James, and Herbert Simon. 1958. Organizations. New York: John Wiley. McKelvey, Richard. 1986. “Covering, Dominance, and Institution-free Properties of Social Choice.” American Journal of Political Science 30:283-315. Miller, Nicholas. 1980. “A New Solution Set for Tournaments and Majority Voting.” American Journal of Political Science 24:68-96. Satz, Debra, and John Ferejohn. 1994. “Rational Choice and Social Theory.” Journal of Philosophy 91:71-87. Simon, Herbert. 1955. “A Behavioral Model of Rational Choice.” Quarterly Journal of Economics 69:99-118. Simon, Herbert. 1964. “The Concept of Organizational Goal.” Administrative Sciences Quarterly 9:1-22. Simon, Herbert. 1990. “Invariants of Human Behavior.” In Mark Rosenzweig and Lyman Porter 26 (eds.), Annual Review of Psychology, volume 41. Palo Alto: Annual Reviews Inc. Stokes, Susan. 1999. “Political Parties and Democracy.” In Nelson Polsby (ed.), Annual Review of Political Science, volume 2. Palo Alto: Annual Reviews Inc. Wittman, Donald. 1983. “Candidate Motivation: A Synthesis of Alternative Theories.” American Political Science Review 77:142-57. 27 Figure 1: An electoral landscape with three plateaus. a c L 1 : top cycle of X b d f L 2 : top cycle of reduced set, X/L 1 . e x z L 3 : top cycle of reduced set, X/(L 1 U L 2 ). y Key: a Æ b : a beats b. a b c f d e : everything in {a,b,c} beats everything in {d, e, f}. Variance of Limiting Distribution Mean = 13.3335 St. Dev. = 12.1236 60 50 40 30 20 10 0 -10 0 0.05 0.15 0.2 Figure 2: Normal search rule around last policy 0.1 Ratio of Size of the Uncovered Set to Size of Top Cycle Mean = 0.1000 St. Dev. = 0.0594 2 Regression Statistics: Slope = 187.6 T-statistic = 16.17 R = 0.84 0.25 0.3 stochastic? Table 1 dynamic? no is model… parties myopic? yes key foci the “chaos” problem? yes bounded rationality? yes nonequilibrium solution concepts used? analytical solutions? yes no yes yes Kramer (1977) yes yes yes yes no yes yes (Markovian) FFMP (1980, 1984) yes no yes yes yes yes yes yes (Markovian) KMP (1992, 1998) no yes yes yes BMR (2004) yes (not restricted to Markovian) Table bl 2