Abstract
This paper proposes a model of statehood, defined as elite extraction of resources from a subject population. Different from most of the existing literature, the size of the subject population evolves endogenously in a Malthusian fashion, and the elite take into account the effects on future population levels when taxing the current population. The elite can spend extracted resources by investing in productive and extractive capacities. Productive capacity increases the size of the pie, while extractive capacity makes it easier for the elite to tax it. Together—but not each on its own—these two types of investment can give rise to multiple steady-state equilibria, such that one steady state has both a higher rate of extraction, and higher population density and output, than the other steady state. The model can also account for a positive empirical relationship between land productivity and state antiquity among countries with relatively late state development.
Supplementary Information
The online version contains supplementary material available at 10.1007/s10887-021-09188-9.
Keywords: Malthusian model, Statehood, Multiple steady states
Introduction
For most of its existence the human species has lived in small bands of hunters and gatherers. Organized, complex, and hierarchical social structures—what we often call states—are a relatively recent phenomenon. States emerged gradually from around 3500 BCE, starting in a few corners of the world, in particular Mesopotamia, China, the Nile and Indus River Valleys, Mesoamerica, and the Andes (e.g., Service, 1975, Ch. 1; Borcan et al., 2018). A few millennia earlier, these same regions were also the first to enter the Neolithic Revolution, i.e., develop agriculture.
Many have therefore hypothesized a causal link from the rise of agriculture to statehood. One proposed mechanism has been labelled the Surplus Theory. The idea is that agriculture caused, or allowed, the rise of states by raising output per unit of land, thus creating a “surplus” which could be stored, and then feed a ruling elite. By contrast, in human societies that rely on relatively low-yielding techniques to obtain food, no such elite population can be sustained, since everyone’s labor is needed for procuring food. Variations on this broad explanatory theme can be found in, e.g., Childe (1936, 1950), Allen (1997), Diamond (1997), Hibbs and Olsson (2004), Putterman (2008, Section IV), and Borcan et al. (2020).1
Another mechanism, proposed by Scott (2009, 2017), Mayshar et al. (2017, 2020), has been labelled the Appropriability Theory. This emphasizes the characteristics of new crops that arrived with the Neolithic Revolution, in particular cereals. These were easier to expropriate than foods obtained through gathering or horticulture, specifically tubers. In support of this theory, Mayshar et al. (2020) document that statehood did not arise earlier in locations with higher agricultural yields overall, when controlling for the relative productivity of cereals and tubers. They also make the theoretical point that the Surplus Theory is hard to reconcile with a Malthusian model. This relates to the standard Malthusian result that steady-state incomes per agent are independent of land productivity, implying that the rate of extraction chosen by the elite should also be independent of land productivity.
In this paper we propose a unified Malthusian framework that incorporates some elements of both of these theories. Decisions in this model are made by a ruler, representing an “embryonic” state, and by a continuum of subjects, whose incomes the ruler has some ability to expropriate. [The pre-existence of a ruler is not crucial. Prior to full-fledged statehood, we can think of this agent as a “chief,” or what Sahlins (1963) labelled a “big man.” This is discussed further in Sect. 3.6.] The size of the subject population evolves over time in a Malthusian fashion and depends on how much the (embryonic) ruler extracts.
The extracted resources can be used for the ruler’s own consumption, or for two types of investment. First, he can invest in public goods, or what we call productive capacity. This captures the observation that early states were often instrumental in providing, e.g., irrigation (cf. Wittfogel, 1957; Nissen & Heine, 2009) and external defense (cf. Dal Bó et al., 2016).
Second, the ruler can accumulate power, or capacity, to more easily extract resources in the future. We refer to this as investment in extractive capacity. One example of such investments could be the costly acquisition of knowledge about writing and record keeping, which have been important components of a state’s extractive apparatus (Scott, 2009, pp. 226–234; Stasavage, 2020, pp. 93–96). Another example could be the hiring of skilled administrators (Ertman, 1997, Ch. 1).
Extractive and productive capacities are complementary: expanding production is more valuable when extracting it is easy, and improving extraction is more valuable when there is more to extract. This can give rise to multiple steady-state equilibria: one has low extractive capacity, low rates of extraction, and low levels of land productivity, population density, and output; another has high extractive capacity, high rates of extraction, and high levels of productivity, population density, and output.
The way these steady states differ is a non-trivial insight. The population is denser in the very steady state where it is taxed more heavily, which is surprising given the Malthusian framework. It is the higher productive capacity in the high-extractive steady state that sustains that denser population.
Also, the higher rate of extraction does not follow trivially from a higher level of extractive capacity. Rather, the ruler extracts more to finance investment in future extractive capacity.
As in any model with multiple steady states, shocks can push the economy from one steady state to another. For example, a positive shock to extractive capacity, holding productive capacity constant, can push it from the low-extractive to the high-extractive steady state; a shock to productive capacity can cause the same type of transition, holding extractive capacity constant. In that sense, the workings of the model seem consistent with both the Appropriability and Surplus Theories.
Moreover, we show that multiplicity of steady states hinges on the ruler being able to invest in both extractive and productive capacities; removing either channel renders the steady state unique. In other words, investments in extractive and productive capacities produce richer results together than each of them can on its own.
To explore the empirical relevance of the model, we lean on the complementarity between productive and extractive capacities. This complementarity implies that land productivity should have a greater impact on state building when the return to investing in extractive capacity is higher. That return should arguably depend on how many existing states there are to copy from.
To illustrate this, we consider an extended setting with many societies, and assume that the return to investing in extractive capacity faced by each ruler is increasing with the average level of extractive capacity across all societies. We then simulate the model, and let a few societies experience a positive shock to extractive capacity at some point, which pushes these to the high-extractive steady state. This in turn raises the return to investing in extractive capacity for the remaining societies, among which those with higher land productivity transition into statehood earlier than those with lower land productivity. This generates a positive relationship between land productivity and statehood across societies with late state development, but not among those with early state development. This pattern is consistent with cross-country data for the Eurasian continent.
The rest of this paper is organized as follows. Next, Sect. 2 discusses some of the existing literature. Section 3 sets up the benchmark model, and arrives at its main prediction about multiplicity of steady states. Section 4 then shows how this result falls apart when dropping investment in either extractive or productive capacities. Section 5 presents a simulation and some empirical evidence. Section 6 ends with a concluding discussion.
Existing literature
This paper seeks to contribute to a strand of the economics literature studying early state development. One reason this topic matters to economists is that there seems to be long-lasting effects from early statehood on modern development. For example, Borcan et al. (2018) document that countries with very early and very late statehood tend to have lower GDP/capita levels than those with states of intermediate age. Other studies using earlier installments of the same state antiquity data (e.g., Bockstette et al., 2002; Chanda & Putterman, 2007; Chanda et al., 2014) find a mostly positive relationship. There are also some interesting correlations between early statehood and other modern outcome variables: Hariri (2012) documents that countries with older states are currently less democratic; Depetris-Chauvin (2016) finds links between early statehood and modern conflict in Africa. Theories linking the timing of statehood to democracy and other modern development outcomes include Lagerlöf (2016).
Empirical studies into the origins of statehood often focus on the natural environment as a deep-rooted factor. For example, Fenske (2014), Litina (2014), Depetris-Chauvin and Özak (2016) find that states emerge where ecological conditions promote trade and specialization. Heldring et al. (2019) link state development in the Fertile Crescent from 5000 BCE to shifts in rivers, which they argue induced provision of public goods.
One particularly influential theory of how the environment can induce state building is the so-called circumscription theory by Carneiro (1970), which holds that states tend to emerge where fertile lands are geographically delimited, e.g., by mountains. Recent research has found support for this theory. Schönholzer (2019) documents that states form at locations with locally high agricultural productivity, surrounded by areas with lower productivity. Looking at data from ancient Egypt, Mayoral and Olsson (2020) find that changes over time in the degree of circumscription—defined as the productivity gap between the taxable and non-taxable activity, and induced by variation in rainfall—seems to impact state stability. In our model, we may think of the parameters guiding the accumulation of extractive capacity as factors encompassing the degree of environmental circumscription.
Theories on the emergence of states also often focus on the environment. For example, Dal Bó et al. (2016) and Schönholzer (2019) present models where land productivity, and the degree of geographical circumscription, are drivers of state formation.2 Different from these models our setting is Malthusian, allowing us to study population density as an endogenous outcome.
Using a Malthusian framework should also help address some of the critique against theories linking land productivity to state formation, or what we here label the Surplus Theory. As discussed in Sect. 1, Mayshar et al. (2020) argue that such theories are hard to reconcile with Malthusian population dynamics. This poses a conundrum, given the broad consensus about the relevance of the Malthusian model for preindustrial development (see, e.g., Galor, 2010; Ashraf & Galor 2011). In the Malthusian model presented here, land productivity can indeed affect state building. This hinges on extractive capacity being endogenous: when closing down this channel agricultural productivity no longer has any effect on the rate of extraction, similar to the results of Mayshar et al. (2020, Online Appendices B); see Sect. 4.1 below. Our empirical findings suggest that endogenous extractive capacity may be most relevant when state building is done by copying and learning from existing states. This does not contradict that earlier state building could be better understood from a framework where extractive capacity is exogenous and a function of crop composition, as argued by Mayshar et al. (2020).
Finally, this paper leans on a theoretical literature, starting with Besley and Persson (2009, 2011), on investment in fiscal and legal state capacities; what we here call extractive capacity corresponds closest to fiscal capacity in their jargon. Again, one difference is that we use a Malthusian setting, where population density is endogenous.3
The model
Consider a world with two classes: subjects and what we for simplicity call a “ruler.” The term ruler, and many model assumptions, are discussed further in Sect. 3.6.
The subjects live in overlapping generations for two periods: as passive children and active adults. In the adult phase of life, a subject works, pays taxes, and produces offspring. This means that the size of the subject population evolves endogenously over time, as a function of the ruler’s extraction rate.
The ruler has one single offspring who replaces him in the next period. We refer to him by the singular male pronoun, but this can also be interpreted as a collective of agents (an elite, or proto-elite).4
The ruler decides on the rate at which subjects are taxed, denoted . A fraction of the taxed (extracted) resources are lost, where . We refer to as extractive capacity. The subjects thus get a fraction of total output, the ruler gets a fraction , while the remainder, , is lost. As discussed in Sect. 3.6, lost tax revenue can be interpreted as theft by a class of tax collectors.
Since the ruler’s income equals , we shall refer to as the ruler’s effective tax base.5
Production
Output in period t, denoted , is produced with the production function
| 1 |
where is the land share of output, is the size of the subject population, M denotes the size of land (below normalized to one, ), and B and are the two different land productivity factors. We refer to as just population, but since land is normalized to unity, it also measures population density.
The factor B is taken as given by the ruler, and captures time-invariant factors determined by geography, such as the caloric content of the crops that can be grown in a particular environment. By contrast, depends on productivity-enhancing investment undertaken by the ruler, representing public goods such as irrigation systems, or knowledge. We shall refer to as productive capacity.6
Extraction and population dynamics
Each subject earns the average product of labor, , which is taxed at rate . Each subject’s income after tax thus equals .
Subjects care about consumption, , and fertility, , and utility is given by
| 2 |
where . Each subject takes her income as given and maximizes (2) subject to the budget constraint
| 3 |
where is the cost per child. This gives optimal fertility as
| 4 |
where . Since each subject is replaced by offspring, the subject population in the next period equals . Applying (4) and gives
| 5 |
The subject population thus constitutes a capital stock to the ruler, in the sense that its size in the next period, , decreases with the ruler’s current rate of extraction, . Put another way, is the fraction of output that the ruler “invests” in the subject population.
Investment in extractive capacity
Let the ruler’s investment in next period’s extractive capacity be denoted , which builds extractive capacity in the next period, , at a rate . We let extractive capacity be bounded from above and below at levels and , respectively, such that (discussed further in Sect. 3.6 below). More precisely,
| 6 |
The parameter is a measure of how easy extractive capacity is to build. For now this is treated as exogenous. In Sect. 5 we are going to interpret as a function of extractive capacity among other societies, the idea being that state building is often done by copying existing states.7
Investment in productive capacity
Consider next investment in productive capacity. We let the cost of in terms of period-t consumption be , where and . Assuming ensures that output and population converge to constant non-growing levels. The ruler’s budget constraint can now be written
| 7 |
where is the ruler’s consumption.
Utility
The ruler’s preferences are defined over and the total effective tax base in the next period, , with utility function
| 8 |
where .8
Discussion
Before we set up the ruler’s maximization problem, it is helpful to scrutinize some of the (implicit and explicit) assumptions in the set-up so far.
Minimum extractive capacity
As mentioned, we assume upper and lower bounds for extractive capacity, denoted and , respectively. The upper bound is not critical and can be set to one, . The assumption that is more important. If , then the economy would under certain conditions converge to a steady state with zero population and output, a special case of what we will later call a low-extractive steady state. Intuitively, in that steady state the ruler would have no extractive capacity, and thus lack tax revenue with which to invest in productive capacity, which is necessary for production, and thus for the population to reproduce. Assuming a minimum level of extractive capacity ensures that this steady state has positive population.
There are other ways to avoid the outcome with a vanishing population. For example, one can impose an exogenous lower bound for productive capacity instead.9 However, that type of model would be mechanically similar to the one set up here, the main difference being that a non-negativity constraint on investment in productive capacity would replace that for extractive capacity in the current set-up.
Egalitarianism and the assumed pre-existence of a ruler
The model presumes that a so-called ruler exists, which might ostensibly contradict the idea of an egalitarian social structure from which statehood emerges. Again, this is mostly for simplicity and clarity, and not completely at odds with the stylized facts pertaining to many pre-state societies.
First of all, the ruler does not need to be richer than other agents. The Online Appendices shows that the ruler’s steady-state income can be lower than, or equal to, that of his subjects, if is sufficiently small. What distinguishes the ruler from the subjects is not his income, but rather that he chooses taxes and invests in extractive and productive capacities.
Second, in any economic model where variation in statehood is the endogenous result of a choice, that choice needs to be vested with some agent, whether we call that agent a “ruler” or something else, and whatever the exact choice is. When interpreting the model, we may think of the decision maker more abstractly, standing in for various mechanisms through which pre-state societies solve collective-action problems, e.g., processes involving collaboration and negotiation.
Third, the conjectured presence of some type of ruler may in fact hold true for many quasi-egalitarian and pre-agrarian societies. It is common to categorize the political organization of human societies on a gradient from egalitarian bands, via more unequal tribes and chiefdoms, to fully fledged and highly hierarchical states (Flannery, 1972; Service, 1975; Diamond, 1997). In our model, equilibrium outcomes with low extractive capacity could at least correspond to chiefdoms.
Moreover, some societies at the earlier political stages have also been described as having embryonic rulers, tasked with rudimentary forms of public goods provision. Read (1959) coined the term “big man” for such leader figures among pre-state societies in New Guinea. Sahlins (1963) used the same term to contrast leader figures in Melanesia to those in more politically advanced Polynesian chiefdoms; see Lindstrom (1981) for other terminology used in the literature, such as “head man” and “center man.” Different from rulers of states, these leaders were typically not bestowed their powers through office or inheritance, but rather personal traits (Service, 1975, pp. 49–53). This may correspond to in our model, applying when the preceding ruler did not invest in extractive capacity (by setting ).
Defense against external predators
The variable is referred to as productive capacity. This may also include defensive (or protective) capacity. Specifically, we could let some fraction of the output be stolen by external predators, and allow the ruler to undertake costly investments to limit that fraction. That setting is explored in the Online Appendices, and shown to boil down to the same one presented here. The main difference is that some of the variables that we here treat as exogenous, such as and , in that setting become functions of the “deep” parameters characterizing the costs of investing in productive and defensive capacities, respectively.
One insight from that model set-up is that land that is less costly to protect corresponds to more productive land in the current setting (i.e., a higher B). Intuitively, resources not needed for protection can be invested in productive capacity instead, which translates to more output at a given level of total investment in defensive and productive capacities. In that sense, we can think of B as a measure not only of land productivity, but also of how well protected output is.10
Tax collectors
We have conceptualized extractive capacity in this model as the fraction of the taxes collected that end up with the ruler, rather than being lost in the process of collecting them.
In order to not restrict ourselves to one single interpretation, we have not explicitly modelled how those tax revenues are lost. The Online Appendices proposes one way to capture that process more explicitly by introducing a new class of agents, called tax collectors. These can run off with the taxes they collect, and the ruler can invest in capacity to retrieve (some of) those lost revenues. The upshot is a model producing the same functional form for accumulation of extractive capacity as that in (6), but with , , and being functions of “deep” model parameters.
Alternative ways to model extractive capacity
There are other ways to model extractive capacity. We can let the ruler face a cost of levying taxes, incurred in the same period they are levied. Then extractive capacity, , could be a variable characterizing that cost function, such that a higher implies a lower cost of tax collection. This formulation resembles that of Mayshar et al. (2020, Online Appendices B).
Specifically, let the cost of levying a tax rate of on total output equal , where is increasing in the tax rate, , and decreasing in . Then the ruler’s budget constraint, corresponding to that in (7), becomes
| 9 |
Our setting can be seen as a special case of this formulation, where , which makes (9 ) identical to (7). Similarly, what we can call the net tax (or extraction) rate, , then equals just , which corresponds more closely to the variable used to measure statehood in Mayshar et al. (2020, Online Appendices B). In our benchmark model both and are endogenous, while they treat the latter as exogenous.
The ruler’s optimization problem
We are now ready to set up the ruler’s optimization problem. Recall that he chooses , , and to maximize (8), subject to (5), (6), (7), (1) forwarded one period, and a non-negativity constraint on . More compactly, the problem can be written as follows:
| 10 |
subject to
| 11 |
We refer to this as the benchmark model. Its results can be understood from three different trade-offs that the ruler faces. First, higher investment in productive capacity, , generates a larger tax base in the next period (higher ), at the cost of less consumption for the ruler today (lower ).
Second, a higher extraction rate, , gives higher income and consumption today (by raising more tax revenue, ); this comes at the cost of a smaller future tax base (lower ), in turn due to the Malthusian way in which more extraction reduces the future population size ().
Third, investment in future extractive capacity, , is costly in terms of current consumption.
Due to the assumed linear functional form, and the upper and lower bounds on , this last trade-off can be seen to generate corner solutions: by setting , and thus , the ruler invests nothing in extractive capacity, keeping it at its minimum level; by setting , and thus , the ruler chooses maximum extractive capacity.
The ruler’s investment in future extractive capacity depends on his current effective tax base, . If this is small, then a marginal increase in generates relatively little revenue, thus making it costly to finance investment in extractive capacity. If the effective tax base is small enough it is optimal to set ; if it is sufficiently large, then it is optimal to set . In that sense, a currently strong and rich state is more likely to remain strong also in the next period. The next section derives explicit expressions for the ruler’s choice variables as functions of the effective tax base and exogenous parameters (with details deferred to Sect. 1 of the Appendices).
The ruler’s optimal choices
Let and denote the thresholds for , above and below which the two constraints on in (6) bind. That is, and if ; and and if . A weak ruler, with a low effective tax base (), finds current extraction costly, making it optimal not to build any future extractive capacity, thus preserving the weak state. A strong ruler, with a large effective tax base (), finds it easy to extract resources, and chooses to maintain a strong state by investing enough to keep extractive capacity to its maximum, .
As shown in Sect. 1 of the Appendices, these thresholds are given by
| 12 |
and
| 13 |
It is straightforward to show that follows from .
The ruler’s choices thus depend on how the effective tax base falls relative to these thresholds. Consider first how the ruler sets the rate of extraction. Section 1 of the Appendices shows that the ruler’s optimal extraction rate can be written:
| 14 |
It can be see from (14) that the relationship between and is inversely U-shaped. First, is constant for , i.e., when investment in extractive capacity is not operative. This constant rate is the same as in the corresponding model without any investment in extractive capacity (see Sect. 4.1).
We also see that is increasing in for . Over this interval, rulers respond to marginal increases in the effective tax base () by extracting more resources, in order to fund more investment in future extractive capacity. Finally, we see that decreases with for . Intuitively, the cost of maintaining maximum extractive capacity falls relative to income as the effective tax base grows.
As approaches infinity, approaches the same level as when . However, for any finite level of , the extraction rate is always higher when the ruler invests the maximum amount in future extractive capacity ( and ) than when he invests the minimum amount ( and ). That is, the top row of (14) is always greater than the bottom row, for finite . This means that any steady state with maximum investment in extractive capacity must have a higher extraction rate than one with no such investment. Below we explore if two such steady states can coexist.
Dynamics
Since the optimal extraction rate in (14) depends on the effective tax base, , the dynamics of the economy are most easily described in terms of the two state variables and .
Dynamics of
As shown in Sect. 1 of the Appendices, the ruler’s optimal choice of (as implied by the choice of ) can be written
| 15 |
That is, binds when , and binds when . When these constraints are non-binding (i.e., when ) the next period’s extractive capacity () increases linearly with the current period’s effective tax base (). It is also easy to verify that the respective corner solutions coincide with the interior solution when and .
Dynamics of
From (1) we see that , and from (5) we recall that . Once we have the ruler’s optimal and in terms of and we can thus derive an expression for in terms of the same state variables. Section 2 of the Appendices shows that
| 16 |
where , and where and depend only on the exogenous and time-invariant variables , , , , , and [see (47) and (54 ) in the Appendices], and play no role for the dynamics.
Note that depends on B, i.e., the land productivity factor that is independent of the ruler’s investment. This has interesting implications for how changes in B impact the dynamic configuration, as discussed below.
Multiple steady states
Now (15) and (16) define a two-dimensional dynamical system for and , which is illustrated in the phase diagram in Fig. 1. It shows the loci along which and are constant (derived in Sect. 3 of the Appendices), and the regions where the constraints on extractive-capacity investment bind: binds when , and binds when .
Fig. 1.

Phase diagram illustrating the dynamics. The loci along which and are constant are indicated by the red and blue solid curves. The green dashed curves indicate the loci above and below which the constraints and bind. In this configuration, there exist two stable steady states (Color figure online)
Generally, the configuration depends on exogenous variables, in particular B. Figure 1 illustrates a case where there are two locally stable steady-state equilibria, and one unstable. (Exact conditions for this type of configuration are stated in Proposition 1 below.) One stable steady-state equilibrium can be labelled a low-extractive steady state. Here the ruler undertakes no investment in extractive capacity, so , and output can be written
| 17 |
which is illustrated in Fig. 1, and derived by setting and in the bottom row of ( 16). The associated extraction rate, which we can denote , is given by the bottom row of (14), i.e., . Population is given by (5) as .
The other stable steady state, at which , can be labelled the high-extractive steady state. Here output equals , defined from ; cf. the top row of (16). The extraction rate in this steady state, , is given by the top row of (14 ), setting . From (5), population can be written .
A saddle path separates the phase diagram into two basins of attraction, each associated with one of the two steady states.11 An economy starting off above the saddle path (i.e., with a large initial effective tax base, ) will converge over time to the high-extractive steady state. An economy starting off below the saddle path converges to the low-extractive steady state.
A trajectory leading to the high-extractive steady state eventually enters a region where , at which point the upper bound on extractive capacity investment starts to bind. From there, stays constant at , while continues to grow, stabilizing at , as illustrated in Fig. 1. Similarly, a trajectory leading to the low-extractive steady state eventually enters a region where , after which stays constant at , while declines, approaching .
We can also compare levels of population, output, extractive capacity, and rates of extraction in the two steady states. This is a nontrivial exercise, since these are all endogenous and jointly determined. The following proposition summarizes these results, and provides conditions for the existence and uniqueness of each steady state, respectively.
Proposition 1
Consider the model with investment in both productive and extractive capacities, as described by (10) and (11). In this model, there exist and , such that:
If, and only if, does there exist a low-extractive steady state, , such that .
If, and only if, does there exist a high-extractive steady state, , such that .
For small enough, it holds that . That is, the low- and the high-extractive steady states coexist for .
- Assume that , so that both steady states exist. Then the following holds:
-
(i)The low-extractive steady state has a lower extraction rate than the high-extractive steady state, i.e., ;
-
(ii)The low-extractive steady state has lower output than the high-extractive steady state, i.e., ;
-
(iii)The low-extractive steady state has lower population than the high-extractive steady state, i.e., .
-
(i)
All proofs are in Sect. 5 of the Appendices.
The possibility of multiple steady states is quite intuitive, and has to do with how current extraction affects future extraction. A larger initial level of the effective tax base—i.e., a larger —induces the ruler to invest more in both and , leading to a larger effective tax base in the next period. This can sustain high levels of extractive and productive capacities across generations of rulers. As we shall see in Sect. 4 below, investment in productive and extractive capacities are both needed for multiplicity of steady-state equilibria to arise.
The claims in part (d) in Proposition 1, comparing the properties of these steady states, are far less obvious.
For example, part (d) (iii) states that the high-extractive steady state has larger population (density) than the low-extractive one (). This may seem counter-intuitive, since a higher rate of extraction [see (d) (i)] would imply a smaller population for a given level of output; to see this one can impose steady state on (5). The result still holds because output is higher in the high-extractive steady state [see (d) (ii)], in turn due to higher investment in productive capacity, which is sustained by the ruler’s larger tax revenues.
Part (d) (i) of Proposition 1 is not obvious either (despite the ostensibly self-explanatory labels). We gleaned some of the intuition from ( 14). It is not merely about higher extractive capacity inducing a higher rate of extraction. In fact, the rate of extraction in the low-extractive steady state () is independent of the exogenously given minimum level of extractive capacity ().12 In other words, small changes in extractive capacity do not affect the rate of extraction, as long as the economy is not pushed out of the low-extractive steady state. Rather, the result refers specifically to a steady-state comparison. In the high-extractive steady state the ruler chooses a higher rate of extraction to finance investment in future extractive capacity, which is worthwhile precisely because of the large effective tax base in that steady state.
Shocks to or As explained above, given a configuration with multiple steady states, such as that in Fig. 1, the economy converges over time to one of the stable steady-state equilibria. Which one it converges to depends on its initial position relative to the saddle-path trajectory leading to the unstable steady state.
This means that an economy can transition from the low-extractive to the high-extractive steady state in the wake of a one-period shock to either extractive capacity (), or output (), or a combination of the two. Intuitively, the shock raises the ruler’s effective tax base in period t, inducing him to invest more in productive and/or extractive capacity, possibly putting the economy on a trajectory leading to the high-extractive steady state. For this to happen, the shock must push () above the threshold saddle path, into the basin of attraction of the high-extractive steady state.
A transition due to a shock to output would be consistent with the Surplus Theory, and could perhaps be interpreted as the result of temporary climatic variations, and/or a temporary phase of good harvests. A transition due to a shock to extractive capacity relates conceptually to the Appropriability Theory.
Exogenous changes to B Above we considered shocks to extractive capacity () or output (). We can also analyze exogenous increases in the geographically determined land productivity factor, B. As shown in Sect. 3 of the Appendices, this shifts up the ()-locus, thus raising output in the low-extractive steady state; note from (17) that is increasing in B. It also expands the basin of attraction for the high-extractive steady state. At some point the low-extractive steady state ceases to exist. Intuitively, a rise in B implies more output, which in turn can be used to accumulate both productive and extractive capacities.
Changes in B need not be interpreted as shocks. Very gradual increases in B would have small effects at first, but eventually lead to rapid changes in and , as the dynamic configuration changes and the high-extractive steady state becomes the unique steady state (i.e., when B exceeds ). The economy can thus initially change slowly in response to improvements in B, and then go through a rapid spurt in extractive capacity and output, stabilizing at and , respectively. From there, output expands more slowly again (as is increasing in B).
Closing down channels
In the benchmark model the ruler could invest in both extractive and productive capacities. To see why this matters, we next consider what happens when we close down either of these channels.
Closing down investment in extractive capacity
To remove investment in extractive capacity from the model, we ignore (6), setting , and let equal some exogenous constant, here denoted . In this setting, an increase in represents a rise in extractive capacity independent of any actions taken by the ruler, conceptually similar to Mayshar et al. (2020, Online Appendices B), who treat extractive capacity as exogenous.
The ruler’s optimization problem now becomes:
| 18 |
subject to
| 19 |
The solution to this model resembles that analyzed in the previous section in the case when the non-negativity constraint on was binding (); see Sect. 1 of the Appendices for details. The dynamics of output becomes
| 20 |
where (recall) , and where G depends on exogenous parameters and is increasing in both agricultural productivity ( B), and extractive capacity (); see (60) in the Appendices. The following proposition summarizes the main results in this setting.
Proposition 2
Consider the model without investment in extractive capacity, as described by (18) and (19). In this model, there exists a unique (non-zero) steady-state equilibrium where the following holds: extractive capacity equals its exogenous level, ; output equals ; and the rate of extraction equals
| 21 |
Thus, taking investment in extractive capacity out of the model rules out multiplicity of steady states. It can be seen that is increasing in both B and (since G is), so we do get the expected predictions from increases in both land productivity and extractive capacity; note that extractive capacity still affects tax revenues and thus investment in productive capacity, .
However, optimal is here constant. [Indeed, the expression in (21) is the same as in the bottom row in (14), which applies to the benchmark model when , i.e., .] Since the extraction rate does not depend on either B or , this setting cannot explain the rise of statehood as an endogenous outcome of changes in B and/or . In that sense, without investment in extractive capacity the model is inconsistent with both the Surplus and Appropriability Theories.13
Closing down investment in productive capacity
Next we remove investment in productive capacity, setting in all periods, but keep investment in extractive capacity. The ruler’s budget constraint, analogous to that in (7), becomes . The expression for output in (1) becomes .
The ruler’s optimization problem can now be written:
| 22 |
subject to
| 23 |
This model coincides with that in the benchmark setting in Sect. 3 when goes to infinity, i.e., when we make investment in productive capacity prohibitively expensive. Specifically, there are two thresholds for the effective tax base, and , below and above which investment in extractive capacity is constrained to its minimum or maximum levels, respectively. Letting go to infinity in (12) and (13), these thresholds can now be written
| 24 |
and
| 25 |
That is, if , then and ; if , then and .
The dynamical system describing the evolution of and is derived in Sect. 2 of the Appendices, and can also be derived from (15) and (16) by letting go to infinity, and setting . Because the resulting expressions for and are so qualitatively similar to those in (15) and (16), we suppress these to the Appendices.
We sum up the main results in the following proposition.
Proposition 3
Consider the model without investment in productive capacity, as described by (22) and (23). In this model, there exist and , such that:
If, and only if, does there exist a low-extractive steady state, , such that .
If, and only if, does there exist a high-extractive steady state, , such that .
. That is, the low- and the high-extractive steady states cannot coexist.
- If , then there exists a unique steady state, , such that . Furthermore, it holds that:
-
(i)The steady-state extraction rate, , is increasing in B and ;
-
(ii)The steady-state level of extractive capacity rate, , is increasing in B and ;
-
(iii)The steady-state level of output, , is increasing in B and decreasing in ;
-
(iv)The steady-state level of population density, , does not depend on B and is decreasing in .
-
(i)
Parts (a) and (b) of Proposition 3 are consistent with the corresponding claims in Proposition 1.14 More (less) productive land makes the high-extractive (low-extractive) steady state more likely to exist. This is broadly consistent with the Surplus Theory.
However, part (c) of the proposition shows that multiple steady-state equilibria are not possible in this setting. If land productivity, B, is high enough that the high-extractive steady state exists (meaning ), then it is also too high for the low-extractive steady state to exist (since ). Intuitively, multiplicity of steady states requires strong enough feedback from current extraction to future extraction, and this feedback is weakened when rulers are not able to invest in productive capacity.
Part (d) takes this point further, by considering the case when . Here neither the low- or high-extractive steady state exists. Rather, the economy converges to a unique interior steady state. Interestingly, this steady state has many properties—summarized by parts (i)-(iv) of (d)—that seem inconsistent with the facts. For example, a (small) rise in land productivity, B, leads to a higher steady-state extraction rate and higher levels of extractive capacity, but leaves steady-state population density unchanged. Intuitively, higher land productivity raises population in the usual Malthusian way, but that is counteracted by the higher rate of extraction, and here the net effect is zero. Both those effects were present in the benchmark model, but there higher tax revenues also generated higher investments in productive capacity, which tended to increase steady-state population density. That third channel is closed down here.
Similarly, a rise in (which, recall, measures how easy it is to build extractive capacity) raises the steady-state extraction rate and extractive capacity, but lowers population density. This implies a negative association between statehood and population density, which is inconsistent with the empirical facts.
Empirical results
The results of the model build on a complementarity between extractive and productive capacities. Intuitively, the possibility of a high-extractive steady state hinges on land productivity affecting the effective tax base and thus investment in future extractive capacity. The implication is that an increase in land productivity, B, is more likely to generate statehood if investments in extractive capacity are easier to undertake, i.e., if is large.15
We can explore if this holds empirically by comparing the correlation between statehood and land productivity for samples of countries with high and low . To measure , we may lean on a literature emphasizing how much easier elites have found it to build a state when they already have a blueprint. For example, the earliest states developed writing and bookkeeping, which were copied by elites developing states later (Scott, 2009, pp. 226–234); Stasavage, 2020, pp. 91–93). Similarly, Ertman (1997, p. 27) argues that European state building became easier at a point when rulers could hire from an existing pool of experts to serve as administrators and in the military. In a multi-society interpretation of our model, this suggests that the return to investing in extractive capacity in one society, as captured by , could depend on the level of extractive capacity across a range of societies.
To fix ideas, suppose a group of countries have transitioned into statehood in a first wave. Since they did not have any statehood blueprints they faced a very low , but transitioned anyhow, possibly for reasons not modelled here, and once they have transitioned they are more likely to maintain statehood moving forward (due to the multiplicity of locally stable steady states). The remaining countries, being able to draw on the state knowledge accumulated by the first wave of countries, face a higher . The complementarity between B and should then imply that countries in the second wave transition earlier if they have higher B.
A simulation example
To better understand the dynamics of a model where changes over time, we can first consider a simulation where in each period is a function of the average level of extractive capacity, , across 200 societies. (For details, see Sect. 1 of the Appendices) We let these 200 societies be endowed with different levels of land productivity, B, which is uniformly distributed between the two thresholds discussed in Proposition 1, and . Thus, two steady states exist initially.
All societies start off in a low-extractive steady state, with minimum extractive capacity (), but 20 are exogenously hit by a shock at , giving them maximum extractive capacity (). These 20 represent early states, and have levels of B distributed in the same way as among the other 180. (Here we select them as every tenth society when ranked by B, but one can also select them randomly.) Their function in this simulation is to initiate a process through which statehood can spread: the initial rise in average raises , in turn inducing more societies to invest in , thus raising further, creating a self-propelling dynamic.
Figure 2 shows the simulated time paths of the log of for three societies out of the 180 not hit by the shock. A higher B is associated with an earlier rise in , since higher land productivity induces earlier investments in when starts to rise; the rise in is in turn driven by the rise in average across the 200 societies, shown as a dotted line.
Fig. 2.
Simulated time paths over 100 periods showing log extractive capacity, , for three societies in a setting where depends on the average level of across all societies (shown as a dotted path). These three are all among those societies not hit by a shock to extractive capacity
Some paths in Fig. 2 show a non-monotonic rise (hardly visible unless we log ), which reflects that the dynamics for a fixed exhibit two locally stable steady states. Depending on parameter values, not all societies need ever transition into statehood, but in this simulation all 200 societies make the transition within 60 periods. In any given period, societies with higher B have higher levels of .
Figure 3 illustrates the cross-sectional relationship between land productivity and a cumulative statehood measure, namely mean extractive capacity over the 100 periods. The 20 societies with the highest levels of statehood are those that experienced a positive shock. By assumption, these have levels of B distributed across the same interval as the remaining 180, and thus show little association between land productivity and state history.16 Among the remainder, however, we see a clear positive relationship between land productivity and mean extractive capacity, such that the highest levels of statehood are found in societies with the highest land productivity.
Fig. 3.

Plot showing the cross-sectional relationship between land productivity, B, and mean extractive capacity over 100 periods, based on the same simulation as in Fig. 2. Each circle represents one society
Cross-country evidence from Eurasia
Next we explore if this pattern is consistent with cross-country data. We focus on the continent of Eurasia, where most state building has spread from a couple of centers (see discussion below). We use accumulated State Antiquity over different periods from 3500 BCE to 1500 CE from Borcan et al. (2018) to measure statehood (corresponding to mean extractive capacity over time in the simulation). We use the Caloric Suitability Index (CSI) from Galor and Özak (2016) to measure land productivity. (See Sect. 2 of the Appendices for more details about the data.)
Table 1 presents results from regressing State Antiquity on CSI for different subsamples, namely countries which developed statehood before and after different temporal cutoffs. Columns (1)–(3) consider 450 CE, a common benchmark for the end of the classical-age state building era (see, e.g., Mayshar et al., 2020). Columns (4)–(9) consider 1000 BCE, an earlier point at which much fewer countries had begun to develop statehood.
Table 1.
Agricultural productivity and statehood: countries with late and early state development
| Dependent variable is State Antiquity over the period | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 3500 BCE to 1500 CE | 450 CE to 1500 CE | 3500 BCE to 1500 CE | 1000 BCE to 1500 CE | 3500 BCE to 450 CE | 1000 BCE to 450 CE | ||||
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | (9) | |
| Galor–Özak CSI | 30.38** | − 90.00** | 18.96 | 69.10*** | − 23.24 | 18.14 | 28.59** | − 44.05 | − 3.22 |
| (11.36) | (35.33) | (11.88) | (24.83) | (40.90) | (30.87) | (10.76) | (35.65) | (26.64) | |
| State Antiquity 3500 BCE-450 CE |
0.00 (0.05) |
||||||||
| State Antiquity 3500-1000 BCE |
0.16 (0.14) |
0.17 (0.11) |
|||||||
| 0.14 | 0.17 | 0.09 | 0.13 | 0.01 | 0.07 | 0.07 | 0.05 | 0.11 | |
| Number of obs. | 23 | 52 | 52 | 56 | 19 | 19 | 56 | 19 | 19 |
| Arrival of statehood | After 450 CE | Before 450 CE | Before 450 CE | After 1000 BCE | Before 1000 BCE | Before 1000 BCE | After 1000 BCE | Before 1000 BCE | Before 1000 BCE |
Ordinary least squares regressions across Eurasian countries with robust standard errors in parentheses. The dependent variable is accumulated State Antiquity over different periods. The sample is split between countries which developed statehood early and late, respectively: before and after 450 CE in columns (1)–(3), and before and after 1000 BCE in columns (4)–(9)
*; **; ***
Consider first columns (1), (4), and (7) in Table 1, which use samples of countries with relatively late state development. Here we find a positive and significant correlation between the Galor–Özak CSI index and statehood. The relationship among countries with earlier state development in the remaining columns is mostly insignificant, at least when controlling for existing state development up until the cutoff year; see columns (3), (6), and (9). This is consistent with the simulation results in Fig. 3. That is, the relationship between accumulated statehood and land productivity tends to be positive for countries that developed statehood later, and close to zero for those with early statehood.
Figure 4 illustrates the relationship between land productivity and statehood for early and late state developers, using 1000 BCE as cutoff; cf. columns (4) and (5) in Table 1. Note that the pattern is qualitatively similar to the simulated one in Fig. 3.
Fig. 4.
Plot showing the relationship between statehood and the Galor–Özak CSI index for Eurasian countries that already had some statehood before 1000 BCE, and those that did not
Table 2 explores these cross-country data further when using 1000 BCE as cutoff for late and early state development, but using the full sample of Eurasian countries and instead interacting land productivity with an indicator for late state development. Column (1) first documents a negative but insignificant unconditional relationship between Galor–Özak CSI and statehood. This turns positive and significant in column (2), where we enter a Late Statehood Dummy, equal to one for countries which developed statehood after 1000 BCE. The Late Statehood Dummy itself carries a significant negative coefficient for obvious reasons.
Table 2.
Agricultural productivity and statehood: interacting late statehood with agricultural productivity
| Dependent variable is State Antiquity 3500 BCE to 1500 CE | ||||||
|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | |
| Galor–Özak CSI | − 58.76 | 46.41** | − 23.24 | − 42.63 | − 42.65 | − 40.82 |
| (37.03) | (21.86) | (39.76) | (32.39) | (32.67) | (35.41) | |
| Late Statehood Dummy |
− 1383.62*** (132.09) |
− 1912.32*** (338.10) |
− 1766.34*** (307.02) |
− 1752.91*** (280.72) |
− 1736.35*** (296.55) |
|
| Late Statehood Galor–Özak CSI |
92.34* (47.00) |
99.02*** (36.23) |
98.07*** (34.65) |
95.71** (38.25) |
||
| Distance from State Origin |
− 0.01 (0.09) |
− 0.02 (0.10) |
||||
| Log Absolute Latitude |
− 25.06 (169.25) |
|||||
| 0.05 | 0.61 | 0.63 | 0.78 | 0.78 | 0.78 | |
| Number of obs. | 75 | 75 | 75 | 75 | 75 | 75 |
| Region fixed effects | No | No | No | Yes | Yes | Yes |
Ordinary least squares regressions across Eurasian countries with robust standard errors in parentheses. The dependent variable is accumulated State Antiquity 3500 BCE to 1500 CE. Late Statehood is an indicator for a country not having a state before 1000 CE. Region Fixed Effects is a set of dummies for each of nine regions
*; **; ***
In column (3) we interact the Late Statehood Dummy and the Galor–Özak CSI index. The interaction term comes out as positive and significant just below the 5% level. It stays positive and becomes much more precisely estimated in column (4), where we include region fixed effects. Column (5) also controls for the geodetic distance from country centroids to Baghdad or Beijing, whichever is closest, conjectured centers for state origins in Eurasia. Column (6) adds a control for Log Absolute Latitude. Throughout, the positive coefficient on the interaction term stays significant at the 5% level, or better. In other words, land productivity shows a positive association with statehood among countries that developed statehood later, just as we should expect.
As mentioned, we here focus on the Eurasian continent, since state building did not spread between Eurasia and other continents prior to 1500. When including the Americas, or the rest of the world, the results in Tables 1 and 2 tend to weaken. This seems consistent with the idea that land productivity should matter more when state building tools can be copied or imported more easily.
Anecdotal evidence from Sweden
The data presented above end in 1500 CE, but state building continued after that, in particular in Northern Europe, which lagged behind the continent (cf. Fig. 4). Sweden offers some concrete examples of how rulers of younger states could use tax revenue to import state building after 1500.
As described by Ertman (1997, pp. 313–314), in 1538 Sweden’s first king Gustav I (or Gustav Vasa) hired a German minister, Conrad von Pyhy, to organize its central administration following a template from the Holy Roman Empire. From 1611, Gustavus Adolphus continued state centralization by borrowing from more recent German and Dutch models.
Architecture offers another example. The oldest and most famous castles and monuments from Sweden’s so-called Great Power era in the 17th century were designed by foreign architects, in particular Simon de la Vallée and Nicodemus Tessin the Elder, who acquired their skills on the continent (Stevens Curl & Wilson, 2015). There may be more important (and productive) aspects of state building than castles, but this does illustrate that skills related to state building could indeed be imported.
Concluding remarks
There are many competing explanations of what caused the rise and spread of statehood, or social stratification more generally. The Surplus Theory posits that a non-producing elite could only be supported with a “surplus” supply of food. This surplus, goes the argument, arrived when land productivity rose in the wake of the Neolithic Revolution, i.e., when humans transitioned from food procurement through hunting and gathering to using agriculture. A different theory has been labelled the Appropriability Theory. It holds that the rise of states was rather about the arrival of new crops, which were easier for a ruling elite to confiscate.
This paper has presented a model which incorporates mechanisms related to those emphasized by both the Surplus and Appropriability Theories. A ruler extracts resources from a subject population, the size of which evolves over time in a Malthusian fashion, dependent on the ruler’s rate of extraction. The ruler can invest the extracted resources in what we call extractive and productive capacities. These complement each other in such a way that the model can give rise to multiple steady states holding constant land productivity and other exogenous factors. One steady state has low extractive capacity, a low extraction rate, and low population density and output; the other has high extractive capacity, a high extraction rate, and high population density and output.
Not only can the combination of extractive and productive capacities give rise multiple steady states. This paper has shown that both of these elements are needed for such multiplicity to arise. In that sense, the Surplus and Appropriability Theories, as modelled here, can generate richer theoretical results together than each theory on its own.
To illustrate the empirical relevance of the model we exploit its complementarity between land productivity and the return to state building. Intuitively, countries which develop statehood later are able to draw on the state knowledge accumulated by earlier states, and thus face a higher return to efforts and resources directed towards state building compared to countries which developed statehood from scratch. Therefore, among countries which transition into statehood relatively late, we should expect too see a positive association between land productivity and state antiquity, but not necessarily among earlier states. Evidence from across Eurasian countries supports this prediction.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendices
The ruler’s maximization problem
Finding optimal , and
First note from (1) and (5) that output in period can be written
| 26 |
Substituting , (7), and (26) into (8), we can write as a function of , , and , namely
| 27 |
where
contains only variables taken as given by the ruler. The problem is to maximize (27) subject to , , , , and ; the last two constraints correspond to and , respectively.
The first-order conditions for an interior solution state that and satisfy
| 28 |
and
| 29 |
where ; recall (7).
It is straightforward to see that the constraints , , and never bind, so (28) and (29) always give optimal and for any . Using (7), (28), and (29) we can solve for and as follows:
| 30 |
| 31 |
Also, using (7), (30), and (31) we can write the ruler’s consumption as
| 32 |
Below we use (30) to (32) to find the optimal choices of and for three cases: when ; when ; and when .
Corner solutions where
If the marginal effect on from an increase in is negative when , then is optimal. This happens when
| 33 |
Using (30) and (31) we see that is simply the expression for in (32), evaluated at . Thus, the inequality in (33) can be written
| 34 |
which translates to , where is given by (13).
It thus follows that if , then . Moreover, optimal and can be found by setting in (30) and (31). This gives the bottom rows of (14) and (43) below.
Corner solutions where
If the marginal effect on from an increase in is positive when , then is optimal. This happens when
| 35 |
The expression in square brackets in (35) equals in (32), evaluated at . Substituted into (35), this gives
| 36 |
or
| 37 |
which can in turn be simplified to , where is given by (12).
To sum up, if , then and optimal and can be found by setting in (30) and (31). This gives the top rows of (14) and (43) below.
Interior solutions
Consider next interior solutions for , which can be found when , and are derived from the first-order condition
| 38 |
where is given by (32). We can use (32) and (38) to find optimal , but we are rather interested in the associated expression for . Since in an interior solution for , we can write the first-order condition in (38) as
| 39 |
This can be solved for to give
| 40 |
which is the middle row of (15).
Using (40), we can also derive the associated solutions for and . Dividing (28) by (38), and rearranging, investment in next period’s technology becomes
| 41 |
where the second equality follows from (40). Solving (41) for gives the middle row in (43) below.
Similarly, dividing (29) by (38), using (40), and rearranging, gives
| 42 |
which can be solved to give the middle row in (14).
Complete characterization of the solution
To sum up, we can write the optimal expressions for as in (14), for as in (15), and the optimal expression for can be written
| 43 |
Dynamics of
This section finds an expression for in terms of and (and exogenous variables, such as B). Consider first the case when , meaning neither of the constraints on binds. Using (14), it is then seen that
| 44 |
Now (26), (43), and (44) tell us that
| 45 |
To simplify this expression, first define
| 46 |
and
| 47 |
where we note that follows from . Using (46) and (47), we can rewrite (45) more compactly as
| 48 |
which is the middle row of (16).
Consider next the case when . From (14), it now follows that
| 49 |
Following similar steps as we followed above for the interior solution, we can use (26), (43), and (49) to show that
| 50 |
where
| 51 |
and (recall) is given by (46).
Finally, for the case when , we can use (14) again to see that
| 52 |
Applying (26), (43), and (52) some algebra shows that
| 53 |
where and are given by (46) and (51).
Finally, using (46), (47), and (51) we can define as
| 54 |
Now, using (50), (53), and substituting for , we arrive at the top and bottom rows of (16).
The phase diagram
The ()-locus
The following can be seen directly from (15): for , it holds that when ; for , it holds that when ; for , it holds that when , or . In sum, the ()-locus can be written
| 55 |
The (inverse of) (55) is graphed in Fig. 1 as a three-segment solid blue curve.
The ()-locus
From (16) we learn the following: for , it holds that when (from using ); for , it holds that when , defined from ; for , it holds that when , defined from . To summarize, the ()-locus can be written
| 56 |
The red solid curves in Fig. 1 show the graphs of the three different segments of the ()-locus in (56).
Change in configuration when changing B
Note that the ()-locus does not depend on B. It is easy to see, from the definitions above, that and are strictly increasing in B, that , and that . It follows that we can adjust B to shift the ()-locus to alter the configuration of the two-dimensional dynamical system. When B is sufficiently small the ()- and ()-loci intersect only once, and this unique intersection lies in the region where . When B is sufficiently large the two loci also intersect only once, now in the region where .
Closing down channels
Closing down investment in extractive capacity
In this setting, the first-order conditions for and can be written
| 57 |
and
| 58 |
Solving for gives the same expression as in the bottom row in ( 14). The expression for becomes identical to that in (30), but with and , i.e.,
| 59 |
Using and , together with the expressions for in the bottom row in (14), and in (59), some algebra shows that , where
| 60 |
and (recall) .
Using (51), it can also be seen that , which shows that can be derived from (16), setting . That is, the dynamics in the model without investment in extractive capacity coincide with those in the benchmark model in the relevant corner solution.
Closing down investment in productive capacity
In the model without investment in productive capacity, the first-order condition for (which always holds with equality) becomes
| 61 |
where we have used . It can be seen from (61) that can be written
| 62 |
Dynamics for extractive capacity,
The optimal choice of (which determines ) involves corner solutions. If the marginal effect on from an increase in is negative when , then is optimal. The condition for this can be written:
| 63 |
If , we see from (62) that
| 64 |
Using (64) and (63), we see that is the ruler’s optimal choice when , where is given by (25). That is, when , it holds that and .
Next, if the marginal effect on from an increase in is positive when , then is optimal. The condition for this can be written:
| 65 |
Evaluating (62) at gives
| 66 |
Now (65) and (66) show that is optimal when , where is given by (24). That is, when , it holds that and .
An interior solutions for , which can be found when , can be derived from the first-order condition
| 67 |
From (62) and (67) we find an expression for when , namely
| 68 |
Thus, we can write
| 69 |
where we recall that the respective corner solutions coincide with the interior solution when and .
Dynamics for output,
To find the dynamics for output, we use and ; see (23 ). When , we can use the expression for in (64) to write
| 70 |
which gives
| 71 |
When , we see from (66) that
| 72 |
which gives
| 73 |
When , we use the expression for in (68), which applies when . Together with this gives an expression for , which can be substituted into (62) to show that
| 74 |
Using and (74) gives
| 75 |
In sum, (71), (73), and (75) can be written in the same way as in (16), but with replaced by :
| 76 |
and with the following new definitions of D and :
| 77 |
Proof of propositions
Proof of Proposition 1
(a) Let be defined as the level of B that generates . From (17) follows that
| 78 |
where is given by (13). Note from (17) that is increasing in B. By implication, is equivalent to .
(b) Let be defined as the level of B that generates . Recall that imposing steady state in the top row of (16) defines from , which shows that is increasing in B. Setting and gives
| 79 |
where the second equality follows from (12) and (13). Since is increasing in B, it follows that is equivalent to .
(c) Using (78) and (79), and letting things cancel, we can write as
| 80 |
where the equality recalls from (46). Next, using (12) and (13), and dividing both sides by , we can write the inequality in (80) as
| 81 |
which expresses the condition for in terms of the ratio . Letting go to zero (keeping constant), the right-hand side of (81) goes to something strictly positive, while the left-hand side goes to zero. Thus, the inequality in (81) must hold for sufficiently close to zero, implying in turn that holds for sufficiently close to zero.
(d) Part (i): The result follows from (14). The bottom row equals , and is defined as the top row, evaluated at , which can be written:
| 82 |
Part (ii): Given , and the way we defined and in the proof of (a) and (b), we know that the output levels in the two steady states, and , must be such that (since ) and (since ). From (12) and (13) it follows that
| 83 |
implying that (since ).
Part (iii): Using (5) the population levels in the two steady states can be written and , respectively. From (31) follows that
| 84 |
where we have used and ; recall that when , i.e., when binds. Using (31) again, we see that
| 85 |
where we have used and ; recall that when , i.e., when binds. Now (83), (84 ), and (85) together imply that .
Proof of Proposition 2
The expression for follows from imposing steady state on ( 20). The expression for in (21) can be found by solving (57) and (58) for . The expression for is identical to that in the bottom row in (14 ), which was derived by setting in (31) in the benchmark setting.
Proof of Proposition 3
(a) Solving for from the bottom row in (76) gives
| 86 |
Let be defined as the level of B that generates . From (86) follows that
| 87 |
From (86) we see that is increasing in B. By implication, if, and only if, , then .
(b) The top row in (76) gives an implicit definition of :
| 88 |
which shows that is increasing in B. Let be defined as the level of B that generates . Setting and in (88) gives
| 89 |
where the second equality follows from (24) and (25). Since is increasing in B, it follows that is equivalent to .
(c) Using the expression for in (89), we see that
| 90 |
where the last equality uses (87). From (24) and (25) we see that , which implies that the expression in square brackets following the last equality in (90) is greater than one. Thus, .
(d) If the steady state exists, it must be such that . This follows from the assumption , and parts (a) and (b) of the proposition: implies that cannot hold; and implies that cannot hold.
To show that the steady state exists and is unique we derive closed-form expressions for and . Consider the maximization problem in (22) and (23 ) for some given levels of and , such that , meaning the solution to the maximization problem must be interior.
The first-order conditions for and in an interior solution can be written
| 91 |
where we recall that in an interior solution. Together the conditions in (91) give
| 92 |
Imposing steady state, and using super-index “int” to denote steady-state levels, we can now write:
| 93 |
where the top row imposes steady-state on (92), and the middle and bottom rows do the same for and , respectively; see (23). Solving (93) for , and we get
| 94 |
Finally, we derive an expression for the steady-state level of , denoted . To that end, we first rewrite the first-order condition for in (91) as
| 95 |
where we have used and , implying ; recall (23) again. Rearranging (95), and imposing steady state, gives us in terms of :
| 96 |
Next we can use (94) to find that
| 97 |
Substituting (97) into (96) gives
| 98 |
The existence and uniqueness of the steady state is shown by the closed-form expressions for and in (94) and (98). One can also use (94) and (98) to verify that and when .
The claims in (i)–(iv) are confirmed by differentiating the expressions in (94) and (98) with respect to B and .
Empirics
Simulation
Let and be output and extractive capacity, respectively, of society i in period t. The simulation is done by iterating on (15) and (16), given some initial values for and , with , D, , and replaced by , , , and (the time-dependent levels of the same variables), and with B replaced by (the society-specific level of B). Compactly, this can be written as
| 99 |
where , , and are given by (47), (12), and (13), with replacing , and where the functions and are defined in (15) and (16). To determine , first let
| 100 |
denote the mean of in period t across the 200 societies. We then let depend on according to
| 101 |
where is the exogenously given initial value for . Note that and that the weight increases from zero to one as goes from to , implying an increase in by a factor of 30, which is sufficient to ensure that all 200 societies make a full transition.
The values of are uniformly distributed on the interval , given by (78) and (79), with , D, , and replaced by , , , and (see above). That is, and . This implies that all societies exhibit multiple steady states for fixed .
Parameter values are set to , , , , , and . We set to approximately 96.81, targeting to 100.
The initial value for is set to , which together with the parameter values above ensures that .
Initial values for are set at the minimum level, , for all i. Initial levels of for each society are set at the low-extractive steady-state values associated with their respective , i.e., ; see (17).
Since all societies are dropped off in the low-extractive steady-state they stay there until an exogenous shock is introduced at . At that point, the levels of increase from to for every tenth society when ranked by (the first being and the last ). From that point on, all societies follow the dynamical process described be (99) to (101), eventually transitioning to the high-extractive steady state.
Data
The measure of statehood is from Borcan et al. (2018), in turn building on Bockstette et al. (2002). They report a score on the extent of statehood across territories defined by modern countries and by half century, from 3500 BCE until today. This index is based on three different criteria: whether any government above the tribal level was present; whether this government was local or foreign; and how much of the territory of the modern country that was controlled by the government. Here we use the accumulated state index score from 3500 BCE to to 1500 CE or 450 CE. Both endpoints precede European colonization and the change in crop composition following the Columbian exchange.
Countries without statehood before 450 CE and 1000 BCE, respectively, are defined as those with zero state index score from 3500 BCE to that point in time.
Land productivity is measured by the Caloric Suitability Index, which is from Galor and Özak (2016) and available here:
https://ozak.github.io/Caloric-Suitability-Index/
Specifically, we use the country-level measure of mean productivity across crops and locations in a country, excluding non-productive locations, and using only crops available before 1500 CE.
Distances to state origin are obtained by applying the geodist package in Stata to calculate the distance from the centroids of modern country borders to the geo-coordinates of Baghdad and Beijing, respectively. Distance to state origin is the shortest of those two distances.
To measure country borders we use publicly available shapefiles shared through the ESRI/ArcGIS website, downloadable here:
https://www.arcgis.com/home/item.html?id=2ca75003ef9d477fb22db19832c9554f
Latitude and regional dummies are from the same data. Latitude refers to the country centroid.
Footnotes
For example, Hibbs and Olsson (2004, p. 3718) write that “[t]he superior agricultural mode of production made possible specialization of economic activity and the establishment of a non-food producing class devoted to the creation and codification of knowledge and the development of technology.” Diamond (1997, p. 285) writes that “food production [i.e., agriculture] may be organized so as to generate stored food surpluses, which permit economic specialization and social stratification.” In Mann (1986), an oft-cited overview of the literature on early state development, the index lists 26 pages referencing the term “surplus” in various contexts.
Dal Bó et al. (2016) capture the interaction between what we may call productive and defensive capacities, while we here focus on productive and extractive capacities. Conceptually, extractive capacity may here represent the powers of a domestic ruler to tax his own people. By contrast, defensive capacity would rather capture the ability to protect against extraction by external and less benevolent actors.
Besley et al. (2013) set up a dynamic, but non-Malthusian, model of investment in state capacity.
In that case, the ruling collective is assumed to be cohesive enough to act as one agent. It also carries fixed size, meaning each member has one offspring, replacing the (single) parent in the next period.
The effective tax base may correspond to what Scott (2009, p. 73) has called “state-accessible product.”
We could also let include external defense, which is a type of public good. See Sect. 3.6.
One could imagine other interpretations too. Following Carneiro (1970), one may also think of as capturing the degree of environmental circumscription. For example, creating records over tax payers may be easier when their ability to move is limited.
This utility function is chosen for tractability. Another approach would be a dynastic model where the ruler cares about the utility of the next generation. Letting be the ruler’s value function, the associated Bellman equation could then be written , subject to the budget constraints in (11) below.
That is, one can let the production function in (1) be written , where is an exogenous lower bound for productive capacity.
One element that the extended model in the Online Appendices does not capture is an endogenous decision by the potential predator, which can generate a link from output to the probability of theft. For such a model, see Dal Bó et al. (2016).
Note that and are exogenously given, so nothing forces the economy to end up on that saddle path. Put another way, if and were drawn from a joint continuous distribution, then the economy would end up on the saddle path with zero probability.
That is, is given by the bottom row in (14), which does not depend on .
While the (gross) extraction rate is a constant , following Mayshar et al. (2020) we may instead consider the net extraction rate. This is the same as the rate of extraction, , minus the (implicit) cost of extraction, ; cf. Sect. 3.6.5. The net extraction rate here equals just , which is increasing in (since does not depend on ). This is consistent with Proposition B2 in Mayshar et al. (2020, Online Appendices B).
It can be seen that and coincide with the corresponding expressions in the benchmark setting, and , when goes to infinity. That is, and .
One way to see this more formally is to note that the two thresholds for B , above which the high-extractive steady-state exists and the low-extractive one does not, are both decreasing in . These are the ones denoted and , respectively, in Proposition 1.
The small dip in mean extractive capacity for those with the lowest levels of B is due to temporarily falling below in the transition to the high-extractive steady state.
Earlier versions of this paper have been titled “Multiple Steady Statehood” and “Land Productivity and Statehood: The Surplus Theory Revisited.” I thank three anonymous referees for detailed comments. I am also grateful for input from: Oded Galor, Fabio Mariani, Omer Moav, Andreas Irmen, Holger Strulik, David Weil, Balazs Zelity, and participants at presentations that I gave at a May 2018 Growth Lab workshop at Brown University, a June 2018 seminar at the University of Göttingen, and the December 2018 CREA Workshop on Culture and Comparative Development at the University of Luxembourg. This research was supported in part by funding from the Social Sciences and Humanities Research Council. All errors are mine.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Allen RC. Agriculture and the origins of the state in ancient Egypt. Explorations in Economic History. 1997;34:135–154. doi: 10.1006/exeh.1997.0673. [DOI] [Google Scholar]
- Ashraf Q, Galor O. Dynamics and stagnation in the Malthusian epoch. American Economic Review. 2011;101:2003–2041. doi: 10.1257/aer.101.5.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Besley T, Persson T. The origins of state capacity: Property rights, taxation, and politics. American Economic Review. 2009;99:1218–1244. doi: 10.1257/aer.99.4.1218. [DOI] [Google Scholar]
- Besley T, Persson T. Pillars of prosperity: The political economics of development clusters. Princeton University Press; 2011. [Google Scholar]
- Besley T, Ilzetzki E, Persson T. Weak states and steady states: The dynamics of fiscal capacity. American Economic Journal: Macroeconomics. 2013;5:205–235. [Google Scholar]
- Bockstette V, Chanda A, Putterman L. States and markets: The advantage of an early start. Journal of Economic Growth. 2002;7:347–69. doi: 10.1023/A:1020827801137. [DOI] [Google Scholar]
- Borcan O, Olsson O, Putterman L. State history and economic development: Evidence from six millennia. Journal of Economic Growth. 2018;23:1–40. doi: 10.1007/s10887-017-9152-0. [DOI] [Google Scholar]
- Borcan, O., Olsson, O., & Putterman, L. (2020). Transition to agriculture and first state presence: A global analysis. University of Gothenburg working paper in economics no. 741.
- Carneiro RL. A theory of the origin of the state. Science. 1970;169:733–738. doi: 10.1126/science.169.3947.733. [DOI] [PubMed] [Google Scholar]
- Chanda A, Putterman L. Early starts, reversals and catch-up in the process of economic development. Scandinavian Journal of Economics. 2007;109:387–413. doi: 10.1111/j.1467-9442.2007.00497.x. [DOI] [Google Scholar]
- Chanda A, Cook CJ, Putterman L. Persistence of fortune: Accounting for population movements, there was no post-Columbian reversal. American Economic Journal: Macroeconomics. 2014;6:1–28. [Google Scholar]
- Childe, V. G. (1936). Man makes himself. The New American Library, Inc. (Reprinted 1951).
- Childe VG. The urban revolution. The Town Planning Review. 1950;21:3–17. doi: 10.3828/tpr.21.1.k853061t614q42qh. [DOI] [Google Scholar]
- Dal Bó, E., Hernández, P., Mazzuca, S. (2016). The paradox of civilization: pre-institutional sources of security and prosperity, mimeo, UC Berkeley, NYU Abu Dabi, and Johns Hopkins.
- Depetris-Chauvin, E., (2016). State history and contemporary conflict: Evidence from Sub-Saharan Africa, mimeo, Pontificia Universidad Cat ólica de Chile.
- Depetris-Chauvin, E., & Özak, Ö. (2016) Population diversity, division of labor and comparative development, mimeo, Pontificia Universidad Católica de Chile and Southern Methodist University.
- Diamond J. Guns, germs, and steel: The fates of human societies. W.W. Norton & Company; 1997. [Google Scholar]
- Ertman T. Birth of the leviathan: Building states and regimes in medieval and early modern Europe. Cambridge University Press; 1997. [Google Scholar]
- Fenske J. Ecology, trade, and states in pre-colonial Africa. Journal of the European Economic Association. 2014;12:612–640. doi: 10.1111/jeea.12042. [DOI] [Google Scholar]
- Flannery KV. The cultural evolution of civilizations. Annual Review of Ecology and Systematics. 1972;3:399–426. doi: 10.1146/annurev.es.03.110172.002151. [DOI] [Google Scholar]
- Galor O. The 2008 Lawrence R. Klein lecture-comparative economic development: Insights from Unified Growth Theory. International Economic Review. 2010;51:1–44. doi: 10.1111/j.1468-2354.2009.00569.x. [DOI] [Google Scholar]
- Galor O, Özak Ö. The agricultural origins of time preference. American Economic Review. 2016;106:3064–3103. doi: 10.1257/aer.20150020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hariri JG. The autocratic legacy of early statehood. American Political Science Review. 2012;106:471–494. doi: 10.1017/S0003055412000238. [DOI] [Google Scholar]
- Heldring, L., Allen, R. C., & Bertazzin, M. C. (2019). Institutional adaptation to environmental change, mimeo, Institute on Behavior & Inequality (briq), Bonn.
- Hibbs DR, Jr, Olsson O. Geography, biogeography, and why some countries are rich and others are poor. Proceedings of the National Academy of Sciences. 2004;101:3715–3720. doi: 10.1073/pnas.0305531101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lagerlöf N-P. Statehood, democracy and preindustrial development. Journal of Economic Dynamics and Control. 2016;67:58–72. doi: 10.1016/j.jedc.2016.04.001. [DOI] [Google Scholar]
- Lindstrom L. “Big man:” A short terminological history. American Anthropologist. 1981;83:900–905. doi: 10.1525/aa.1981.83.4.02a00140. [DOI] [Google Scholar]
- Litina, A. (2014). The geographical origins of early state formation, CREA Discussion Paper Series 14-28, Center for Research in Economic Analysis, University of Luxembourg.
- Mann M. The sources of social power, volume 1: a history of power from the beginning to AD 1760. 2. Cambridge University Press; 1986. [Google Scholar]
- Mayoral, L., & Olsson, O. (2020). Pharaoh’s cage: Environmental circumscription and appropriability in early state development, mimeo, University of Gothenburg.
- Mayshar J, Moav O, Neeman Z. Geography, transparency, and institutions. American Political Science Review. 2017;111:622–636. doi: 10.1017/S0003055417000132. [DOI] [Google Scholar]
- Mayshar, J., Moav, O., Neeman, Z., & Pascali, L. (2020). The origin of the state: Land productivity or appropriability? Mimeo, Hebrew University of Jerusalem, University of Warwick, Tel-Aviv University, and Pompeu Fabra University.
- Nissen HJ, Heine P. From Mesopotamia to Iraq: A concise history. The University of Chicago Press; 2009. [Google Scholar]
- Putterman L. Agriculture, diffusion and development: Ripple effects of the Neolithic Revolution. Economica. 2008;75:729–748. doi: 10.1111/j.1468-0335.2007.00652.x. [DOI] [Google Scholar]
- Read KE. Leadership and consensus in a New Guinea society. American Anthropologist. 1959;61:425–436. doi: 10.1525/aa.1959.61.3.02a00060. [DOI] [Google Scholar]
- Sahlins MD. Poor man, rich man, big-man, chief: Political types in Melanesia and Polynesia. Comparative Studies in Society and History. 1963;5:285–303. doi: 10.1017/S0010417500001729. [DOI] [Google Scholar]
- Scott JC. The art of not being governed: An anarchist history of Upland Southeast Asia. Yale University Press; 2009. [Google Scholar]
- Scott JC. Against the grain: A deep history of the earliest states. Yale University Press; 2017. [Google Scholar]
- Service, E. R . Origins of the state and civilization: The process of cultural evolution. W.W. Norton & Co; 1975. [Google Scholar]
- Schönholzer D. The origin of the state: Incentive compatible extraction under environmental circumscription, mimeo. IIES, Stockholm University; 2019. [Google Scholar]
- Stasavage D. The decline and rise of democracy: A global history from antiquity to today. Princeton University Press; 2020. [Google Scholar]
- Stevens Curl J, Wilson S. A dictionary of architecture and landscape architecture. 3. Oxford University Press; 2015. [Google Scholar]
- Wittfogel KA. Oriental despotism: A comparative study of total power. Yale University Press; 1957. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


