Skip to main content
Springer logoLink to Springer
. 2021 Mar 13;26(2):113–152. doi: 10.1007/s10887-021-09188-9

Multiple steady statehood: the roles of productive and extractive capacities

Nils-Petter Lagerlöf 1,
PMCID: PMC8550320  PMID: 34720668

Abstract

This paper proposes a model of statehood, defined as elite extraction of resources from a subject population. Different from most of the existing literature, the size of the subject population evolves endogenously in a Malthusian fashion, and the elite take into account the effects on future population levels when taxing the current population. The elite can spend extracted resources by investing in productive and extractive capacities. Productive capacity increases the size of the pie, while extractive capacity makes it easier for the elite to tax it. Together—but not each on its own—these two types of investment can give rise to multiple steady-state equilibria, such that one steady state has both a higher rate of extraction, and higher population density and output, than the other steady state. The model can also account for a positive empirical relationship between land productivity and state antiquity among countries with relatively late state development.

Supplementary Information

The online version contains supplementary material available at 10.1007/s10887-021-09188-9.

Keywords: Malthusian model, Statehood, Multiple steady states

Introduction

For most of its existence the human species has lived in small bands of hunters and gatherers. Organized, complex, and hierarchical social structures—what we often call states—are a relatively recent phenomenon. States emerged gradually from around 3500 BCE, starting in a few corners of the world, in particular Mesopotamia, China, the Nile and Indus River Valleys, Mesoamerica, and the Andes (e.g., Service, 1975, Ch. 1; Borcan et al., 2018). A few millennia earlier, these same regions were also the first to enter the Neolithic Revolution, i.e., develop agriculture.

Many have therefore hypothesized a causal link from the rise of agriculture to statehood. One proposed mechanism has been labelled the Surplus Theory. The idea is that agriculture caused, or allowed, the rise of states by raising output per unit of land, thus creating a “surplus” which could be stored, and then feed a ruling elite. By contrast, in human societies that rely on relatively low-yielding techniques to obtain food, no such elite population can be sustained, since everyone’s labor is needed for procuring food. Variations on this broad explanatory theme can be found in, e.g., Childe (1936, 1950), Allen (1997), Diamond (1997), Hibbs and Olsson (2004), Putterman (2008, Section IV), and Borcan et al. (2020).1

Another mechanism, proposed by Scott (2009, 2017), Mayshar et al. (2017, 2020), has been labelled the Appropriability Theory. This emphasizes the characteristics of new crops that arrived with the Neolithic Revolution, in particular cereals. These were easier to expropriate than foods obtained through gathering or horticulture, specifically tubers. In support of this theory, Mayshar et al. (2020) document that statehood did not arise earlier in locations with higher agricultural yields overall, when controlling for the relative productivity of cereals and tubers. They also make the theoretical point that the Surplus Theory is hard to reconcile with a Malthusian model. This relates to the standard Malthusian result that steady-state incomes per agent are independent of land productivity, implying that the rate of extraction chosen by the elite should also be independent of land productivity.

In this paper we propose a unified Malthusian framework that incorporates some elements of both of these theories. Decisions in this model are made by a ruler, representing an “embryonic” state, and by a continuum of subjects, whose incomes the ruler has some ability to expropriate. [The pre-existence of a ruler is not crucial. Prior to full-fledged statehood, we can think of this agent as a “chief,” or what Sahlins (1963) labelled a “big man.” This is discussed further in Sect. 3.6.] The size of the subject population evolves over time in a Malthusian fashion and depends on how much the (embryonic) ruler extracts.

The extracted resources can be used for the ruler’s own consumption, or for two types of investment. First, he can invest in public goods, or what we call productive capacity. This captures the observation that early states were often instrumental in providing, e.g., irrigation (cf. Wittfogel, 1957; Nissen & Heine, 2009) and external defense (cf. Dal Bó et al., 2016).

Second, the ruler can accumulate power, or capacity, to more easily extract resources in the future. We refer to this as investment in extractive capacity. One example of such investments could be the costly acquisition of knowledge about writing and record keeping, which have been important components of a state’s extractive apparatus (Scott, 2009, pp. 226–234; Stasavage, 2020, pp. 93–96). Another example could be the hiring of skilled administrators (Ertman, 1997, Ch. 1).

Extractive and productive capacities are complementary: expanding production is more valuable when extracting it is easy, and improving extraction is more valuable when there is more to extract. This can give rise to multiple steady-state equilibria: one has low extractive capacity, low rates of extraction, and low levels of land productivity, population density, and output; another has high extractive capacity, high rates of extraction, and high levels of productivity, population density, and output.

The way these steady states differ is a non-trivial insight. The population is denser in the very steady state where it is taxed more heavily, which is surprising given the Malthusian framework. It is the higher productive capacity in the high-extractive steady state that sustains that denser population.

Also, the higher rate of extraction does not follow trivially from a higher level of extractive capacity. Rather, the ruler extracts more to finance investment in future extractive capacity.

As in any model with multiple steady states, shocks can push the economy from one steady state to another. For example, a positive shock to extractive capacity, holding productive capacity constant, can push it from the low-extractive to the high-extractive steady state; a shock to productive capacity can cause the same type of transition, holding extractive capacity constant. In that sense, the workings of the model seem consistent with both the Appropriability and Surplus Theories.

Moreover, we show that multiplicity of steady states hinges on the ruler being able to invest in both extractive and productive capacities; removing either channel renders the steady state unique. In other words, investments in extractive and productive capacities produce richer results together than each of them can on its own.

To explore the empirical relevance of the model, we lean on the complementarity between productive and extractive capacities. This complementarity implies that land productivity should have a greater impact on state building when the return to investing in extractive capacity is higher. That return should arguably depend on how many existing states there are to copy from.

To illustrate this, we consider an extended setting with many societies, and assume that the return to investing in extractive capacity faced by each ruler is increasing with the average level of extractive capacity across all societies. We then simulate the model, and let a few societies experience a positive shock to extractive capacity at some point, which pushes these to the high-extractive steady state. This in turn raises the return to investing in extractive capacity for the remaining societies, among which those with higher land productivity transition into statehood earlier than those with lower land productivity. This generates a positive relationship between land productivity and statehood across societies with late state development, but not among those with early state development. This pattern is consistent with cross-country data for the Eurasian continent.

The rest of this paper is organized as follows. Next, Sect. 2 discusses some of the existing literature. Section 3 sets up the benchmark model, and arrives at its main prediction about multiplicity of steady states. Section 4 then shows how this result falls apart when dropping investment in either extractive or productive capacities. Section 5 presents a simulation and some empirical evidence. Section 6 ends with a concluding discussion.

Existing literature

This paper seeks to contribute to a strand of the economics literature studying early state development. One reason this topic matters to economists is that there seems to be long-lasting effects from early statehood on modern development. For example, Borcan et al. (2018) document that countries with very early and very late statehood tend to have lower GDP/capita levels than those with states of intermediate age. Other studies using earlier installments of the same state antiquity data (e.g., Bockstette et al., 2002; Chanda & Putterman, 2007; Chanda et al., 2014) find a mostly positive relationship. There are also some interesting correlations between early statehood and other modern outcome variables: Hariri (2012) documents that countries with older states are currently less democratic; Depetris-Chauvin (2016) finds links between early statehood and modern conflict in Africa. Theories linking the timing of statehood to democracy and other modern development outcomes include Lagerlöf (2016).

Empirical studies into the origins of statehood often focus on the natural environment as a deep-rooted factor. For example, Fenske (2014), Litina (2014), Depetris-Chauvin and Özak (2016) find that states emerge where ecological conditions promote trade and specialization. Heldring et al. (2019) link state development in the Fertile Crescent from 5000 BCE to shifts in rivers, which they argue induced provision of public goods.

One particularly influential theory of how the environment can induce state building is the so-called circumscription theory by Carneiro (1970), which holds that states tend to emerge where fertile lands are geographically delimited, e.g., by mountains. Recent research has found support for this theory. Schönholzer (2019) documents that states form at locations with locally high agricultural productivity, surrounded by areas with lower productivity. Looking at data from ancient Egypt, Mayoral and Olsson (2020) find that changes over time in the degree of circumscription—defined as the productivity gap between the taxable and non-taxable activity, and induced by variation in rainfall—seems to impact state stability. In our model, we may think of the parameters guiding the accumulation of extractive capacity as factors encompassing the degree of environmental circumscription.

Theories on the emergence of states also often focus on the environment. For example, Dal Bó et al. (2016) and Schönholzer (2019) present models where land productivity, and the degree of geographical circumscription, are drivers of state formation.2 Different from these models our setting is Malthusian, allowing us to study population density as an endogenous outcome.

Using a Malthusian framework should also help address some of the critique against theories linking land productivity to state formation, or what we here label the Surplus Theory. As discussed in Sect. 1, Mayshar et al. (2020) argue that such theories are hard to reconcile with Malthusian population dynamics. This poses a conundrum, given the broad consensus about the relevance of the Malthusian model for preindustrial development (see, e.g., Galor, 2010; Ashraf & Galor 2011). In the Malthusian model presented here, land productivity can indeed affect state building. This hinges on extractive capacity being endogenous: when closing down this channel agricultural productivity no longer has any effect on the rate of extraction, similar to the results of Mayshar et al. (2020, Online Appendices B); see Sect. 4.1 below. Our empirical findings suggest that endogenous extractive capacity may be most relevant when state building is done by copying and learning from existing states. This does not contradict that earlier state building could be better understood from a framework where extractive capacity is exogenous and a function of crop composition, as argued by Mayshar et al. (2020).

Finally, this paper leans on a theoretical literature, starting with Besley and Persson (2009, 2011), on investment in fiscal and legal state capacities; what we here call extractive capacity corresponds closest to fiscal capacity in their jargon. Again, one difference is that we use a Malthusian setting, where population density is endogenous.3

The model

Consider a world with two classes: subjects and what we for simplicity call a “ruler.” The term ruler, and many model assumptions, are discussed further in Sect. 3.6.

The subjects live in overlapping generations for two periods: as passive children and active adults. In the adult phase of life, a subject works, pays taxes, and produces offspring. This means that the size of the subject population evolves endogenously over time, as a function of the ruler’s extraction rate.

The ruler has one single offspring who replaces him in the next period. We refer to him by the singular male pronoun, but this can also be interpreted as a collective of agents (an elite, or proto-elite).4

The ruler decides on the rate at which subjects are taxed, denoted τt. A fraction 1-zt of the taxed (extracted) resources are lost, where zt(0,1]. We refer to zt as extractive capacity. The subjects thus get a fraction 1-τt of total output, the ruler gets a fraction τtzt, while the remainder, τt(1-zt), is lost. As discussed in Sect. 3.6, lost tax revenue can be interpreted as theft by a class of tax collectors.

Since the ruler’s income equals τtztYt, we shall refer to ztYt as the ruler’s effective tax base.5

Production

Output in period t, denoted Yt, is produced with the production function

Yt=(MBAt)αLt1-α, 1

where α is the land share of output, Lt is the size of the subject population, M denotes the size of land (below normalized to one, M=1), and B and At are the two different land productivity factors. We refer to Lt as just population, but since land is normalized to unity, it also measures population density.

The factor B is taken as given by the ruler, and captures time-invariant factors determined by geography, such as the caloric content of the crops that can be grown in a particular environment. By contrast, At depends on productivity-enhancing investment undertaken by the ruler, representing public goods such as irrigation systems, or knowledge. We shall refer to At as productive capacity.6

Extraction and population dynamics

Each subject earns the average product of labor, yt=Yt/Lt=(BAt/Lt)α, which is taxed at rate τt[0,1]. Each subject’s income after tax thus equals (1-τt)yt.

Subjects care about consumption, ctS, and fertility, nt, and utility is given by

UtS=(1-γ~)lnctS+γ~lnnt, 2

where γ~0,1. Each subject takes her income as given and maximizes (2) subject to the budget constraint

ctS=(1-τt)yt-qnt, 3

where q>0 is the cost per child. This gives optimal fertility as

nt=γ(1-τt)yt. 4

where γγ~/q. Since each subject is replaced by nt offspring, the subject population in the next period equals Lt+1=ntLt. Applying (4) and yt=Yt/Lt gives

Lt+1=γ(1-τt)ytLt=γ(1-τt)Yt. 5

The subject population thus constitutes a capital stock to the ruler, in the sense that its size in the next period, Lt+1, decreases with the ruler’s current rate of extraction, τt. Put another way, 1-τt is the fraction of output that the ruler “invests” in the subject population.

Investment in extractive capacity

Let the ruler’s investment in next period’s extractive capacity be denoted xt0, which builds extractive capacity in the next period, zt+1 , at a rate ϕ>0. We let extractive capacity be bounded from above and below at levels z¯ and z_, respectively, such that 0<z_<z¯1 (discussed further in Sect. 3.6 below). More precisely,

zt+1=min{z¯,z_+ϕxt}=z¯ifxtz¯-z_ϕ,z_+ϕxtifxt0,z¯-z_ϕ,z_ifxt=0. 6

The parameter ϕ is a measure of how easy extractive capacity is to build. For now this is treated as exogenous. In Sect. 5 we are going to interpret ϕ as a function of extractive capacity among other societies, the idea being that state building is often done by copying existing states.7

Investment in productive capacity

Consider next investment in productive capacity. We let the cost of At+1 in terms of period-t consumption be ηAt+1σ, where η>0 and σ>1. Assuming σ>1 ensures that output and population converge to constant non-growing levels. The ruler’s budget constraint can now be written

ctR=τtztYt-ηAt+1σ-xt, 7

where ctR is the ruler’s consumption.

Utility

The ruler’s preferences are defined over ctR and the total effective tax base in the next period, zt+1Yt+1, with utility function

UtR=(1-β)lnctR+βln(zt+1Yt+1), 8

where β0,1.8

Discussion

Before we set up the ruler’s maximization problem, it is helpful to scrutinize some of the (implicit and explicit) assumptions in the set-up so far.

Minimum extractive capacity

As mentioned, we assume upper and lower bounds for extractive capacity, denoted z¯ and z_, respectively. The upper bound is not critical and can be set to one, z¯=1. The assumption that z_>0 is more important. If z_=0, then the economy would under certain conditions converge to a steady state with zero population and output, a special case of what we will later call a low-extractive steady state. Intuitively, in that steady state the ruler would have no extractive capacity, and thus lack tax revenue with which to invest in productive capacity, which is necessary for production, and thus for the population to reproduce. Assuming a minimum level of extractive capacity ensures that this steady state has positive population.

There are other ways to avoid the outcome with a vanishing population. For example, one can impose an exogenous lower bound for productive capacity instead.9 However, that type of model would be mechanically similar to the one set up here, the main difference being that a non-negativity constraint on investment in productive capacity would replace that for extractive capacity in the current set-up.

Egalitarianism and the assumed pre-existence of a ruler

The model presumes that a so-called ruler exists, which might ostensibly contradict the idea of an egalitarian social structure from which statehood emerges. Again, this is mostly for simplicity and clarity, and not completely at odds with the stylized facts pertaining to many pre-state societies.

First of all, the ruler does not need to be richer than other agents. The Online Appendices shows that the ruler’s steady-state income can be lower than, or equal to, that of his subjects, if z_ is sufficiently small. What distinguishes the ruler from the subjects is not his income, but rather that he chooses taxes and invests in extractive and productive capacities.

Second, in any economic model where variation in statehood is the endogenous result of a choice, that choice needs to be vested with some agent, whether we call that agent a “ruler” or something else, and whatever the exact choice is. When interpreting the model, we may think of the decision maker more abstractly, standing in for various mechanisms through which pre-state societies solve collective-action problems, e.g., processes involving collaboration and negotiation.

Third, the conjectured presence of some type of ruler may in fact hold true for many quasi-egalitarian and pre-agrarian societies. It is common to categorize the political organization of human societies on a gradient from egalitarian bands, via more unequal tribes and chiefdoms, to fully fledged and highly hierarchical states (Flannery, 1972; Service, 1975; Diamond, 1997). In our model, equilibrium outcomes with low extractive capacity could at least correspond to chiefdoms.

Moreover, some societies at the earlier political stages have also been described as having embryonic rulers, tasked with rudimentary forms of public goods provision. Read (1959) coined the term “big man” for such leader figures among pre-state societies in New Guinea. Sahlins (1963) used the same term to contrast leader figures in Melanesia to those in more politically advanced Polynesian chiefdoms; see Lindstrom (1981) for other terminology used in the literature, such as “head man” and “center man.” Different from rulers of states, these leaders were typically not bestowed their powers through office or inheritance, but rather personal traits (Service, 1975, pp. 49–53). This may correspond to z_ in our model, applying when the preceding ruler did not invest in extractive capacity (by setting xt=0).

Defense against external predators

The variable At is referred to as productive capacity. This may also include defensive (or protective) capacity. Specifically, we could let some fraction of the output be stolen by external predators, and allow the ruler to undertake costly investments to limit that fraction. That setting is explored in the Online Appendices, and shown to boil down to the same one presented here. The main difference is that some of the variables that we here treat as exogenous, such as η and σ, in that setting become functions of the “deep” parameters characterizing the costs of investing in productive and defensive capacities, respectively.

One insight from that model set-up is that land that is less costly to protect corresponds to more productive land in the current setting (i.e., a higher B). Intuitively, resources not needed for protection can be invested in productive capacity instead, which translates to more output at a given level of total investment in defensive and productive capacities. In that sense, we can think of B as a measure not only of land productivity, but also of how well protected output is.10

Tax collectors

We have conceptualized extractive capacity in this model as the fraction of the taxes collected that end up with the ruler, rather than being lost in the process of collecting them.

In order to not restrict ourselves to one single interpretation, we have not explicitly modelled how those tax revenues are lost. The Online Appendices proposes one way to capture that process more explicitly by introducing a new class of agents, called tax collectors. These can run off with the taxes they collect, and the ruler can invest in capacity to retrieve (some of) those lost revenues. The upshot is a model producing the same functional form for accumulation of extractive capacity as that in (6), but with z¯, z_, and ϕ being functions of “deep” model parameters.

Alternative ways to model extractive capacity

There are other ways to model extractive capacity. We can let the ruler face a cost of levying taxes, incurred in the same period they are levied. Then extractive capacity, zt, could be a variable characterizing that cost function, such that a higher zt implies a lower cost of tax collection. This formulation resembles that of Mayshar et al. (2020, Online Appendices B).

Specifically, let the cost of levying a tax rate of τt on total output Yt equal Cτt,ztYt, where Cτt,zt is increasing in the tax rate, τt, and decreasing in zt. Then the ruler’s budget constraint, corresponding to that in (7), becomes

ctR=τt-Cτt,ztYt-ηAt+1σ-xt. 9

Our setting can be seen as a special case of this formulation, where Cτt,zt=τt(1-zt), which makes (9 ) identical to (7). Similarly, what we can call the net tax (or extraction) rate, τt-Cτt,zt, then equals just ztτt, which corresponds more closely to the variable used to measure statehood in Mayshar et al. (2020, Online Appendices B). In our benchmark model both τt and zt are endogenous, while they treat the latter as exogenous.

The ruler’s optimization problem

We are now ready to set up the ruler’s optimization problem. Recall that he chooses τt, xt, and At+1 to maximize (8), subject to (5), (6), (7), (1) forwarded one period, and a non-negativity constraint on xt. More compactly, the problem can be written as follows:

maxτt,xt,At+1(1-β)lnctR+βln(zt+1Yt+1), 10

subject to

xt0,zt+1=min{z¯,z_+ϕxt},ctR=τtztYt-ηAt+1σ-xt,Yt+1=(BAt+1)αLt+11-α,Lt+1=γ(1-τt)Yt. 11

We refer to this as the benchmark model. Its results can be understood from three different trade-offs that the ruler faces. First, higher investment in productive capacity, At+1, generates a larger tax base in the next period (higher Yt+1), at the cost of less consumption for the ruler today (lower ctR).

Second, a higher extraction rate, τt, gives higher income and consumption today (by raising more tax revenue, τtztYt); this comes at the cost of a smaller future tax base (lower Yt+1), in turn due to the Malthusian way in which more extraction reduces the future population size (Lt+1).

Third, investment in future extractive capacity, zt+1, is costly in terms of current consumption.

Due to the assumed linear functional form, and the upper and lower bounds on zt+1, this last trade-off can be seen to generate corner solutions: by setting xt=0, and thus zt+1=z_, the ruler invests nothing in extractive capacity, keeping it at its minimum level; by setting xt=(z¯-z_)/ϕ, and thus zt+1=z¯, the ruler chooses maximum extractive capacity.

The ruler’s investment in future extractive capacity depends on his current effective tax base, ztYt. If this is small, then a marginal increase in τt generates relatively little revenue, thus making it costly to finance investment in extractive capacity. If the effective tax base is small enough it is optimal to set xt=0; if it is sufficiently large, then it is optimal to set xt=(z¯-z_)/ϕ. In that sense, a currently strong and rich state is more likely to remain strong also in the next period. The next section derives explicit expressions for the ruler’s choice variables as functions of the effective tax base and exogenous parameters (with details deferred to Sect. 1 of the Appendices).

The ruler’s optimal choices

Let X_ and X¯ denote the thresholds for ztYt , above and below which the two constraints on zt+1 in (6) bind. That is, xt=0 and zt+1=z_ if ztYtX_; and xt=(z¯-z_)/ϕ and zt+1=z¯ if ztYtX¯. A weak ruler, with a low effective tax base (ztYtX_), finds current extraction costly, making it optimal not to build any future extractive capacity, thus preserving the weak state. A strong ruler, with a large effective tax base (ztYtX¯), finds it easy to extract resources, and chooses to maintain a strong state by investing enough to keep extractive capacity to its maximum, z¯.

As shown in Sect. 1 of the Appendices, these thresholds are given by

X¯=1ϕz¯βσ(1-α)+σ+αββσ-z_, 12

and

X_=1ϕσ(1-αβ)+αββσz_. 13

It is straightforward to show that 0<X_<X¯ follows from 0<z_<z¯.

The ruler’s choices thus depend on how the effective tax base falls relative to these thresholds. Consider first how the ruler sets the rate of extraction. Section 1 of the Appendices shows that the ruler’s optimal extraction rate can be written:

τt=1-βσ(1-α)σ(1-αβ)+αβ1-z¯-z_ϕ1ztYtifztYtX¯,1-βσ(1-α)βσ(1-α)+σ+αβ1+z_ϕztYtifztYtX_,X¯,1-βσ(1-α)σ(1-αβ)+αβ=σ(1-β)+αβσ(1-αβ)+αβifztYtX_. 14

It can be see from (14) that the relationship between τt and ztYt is inversely U-shaped. First, τt is constant for ztYtX_, i.e., when investment in extractive capacity is not operative. This constant rate is the same as in the corresponding model without any investment in extractive capacity (see Sect. 4.1).

We also see that τt is increasing in ztYt for ztYtX_,X¯. Over this interval, rulers respond to marginal increases in the effective tax base (ztYt) by extracting more resources, in order to fund more investment in future extractive capacity. Finally, we see that τt decreases with ztYt for ztYtX¯. Intuitively, the cost of maintaining maximum extractive capacity falls relative to income as the effective tax base grows.

As ztYt approaches infinity, τt approaches the same level as when ztYtX_. However, for any finite level of ztYt, the extraction rate is always higher when the ruler invests the maximum amount in future extractive capacity (ztYtX¯ and zt+1=z¯) than when he invests the minimum amount (ztYtX_ and zt+1=z_). That is, the top row of (14) is always greater than the bottom row, for finite ztYt. This means that any steady state with maximum investment in extractive capacity must have a higher extraction rate than one with no such investment. Below we explore if two such steady states can coexist.

Dynamics

Since the optimal extraction rate in (14) depends on the effective tax base, ztYt, the dynamics of the economy are most easily described in terms of the two state variables Yt and zt.

Dynamics of zt

As shown in Sect. 1 of the Appendices, the ruler’s optimal choice of zt+1 (as implied by the choice of xt) can be written

zt+1=Φ(Yt,zt)z¯ifztYtX¯,βσβσ(1-α)+σ+αβϕztYt+z_ifztYtX_,X¯,z_ifztYtX_. 15

That is, zt+1z_ binds when ztYt<X_, and zt+1z¯ binds when ztYt>X¯. When these constraints are non-binding (i.e., when ztYtX_,X¯) the next period’s extractive capacity (zt+1) increases linearly with the current period’s effective tax base (ztYt). It is also easy to verify that the respective corner solutions coincide with the interior solution when ztYt=X_ and ztYt=X¯.

Dynamics of Yt

From (1) we see that Yt+1=(BAt+1)αLt+11-α , and from (5) we recall that Lt+1=γ(1-τt)Yt. Once we have the ruler’s optimal At+1 and τt in terms of zt and Yt, we can thus derive an expression for Yt+1 in terms of the same state variables. Section 2 of the Appendices shows that

Yt+1=Ψ(Yt,zt,B)κDBαztα-1ϕztYt+z_-z¯ρifztYtX¯,DBαztα-1ϕztYt+z_ρifztYtX_,X¯,κDBαztα-1ϕztYtρifztYtX_, 16

where ρ=(α/σ)+1-α<1, and where D>0 and κ>1 depend only on the exogenous and time-invariant variables α, β , γ, ϕ, σ, and η [see (47) and (54 ) in the Appendices], and play no role for the dynamics.

Note that Yt+1 depends on B, i.e., the land productivity factor that is independent of the ruler’s investment. This has interesting implications for how changes in B impact the dynamic configuration, as discussed below.

Multiple steady states

Now (15) and (16) define a two-dimensional dynamical system for zt and Yt, which is illustrated in the phase diagram in Fig. 1. It shows the loci along which zt and Yt are constant (derived in Sect. 3 of the Appendices), and the regions where the constraints on extractive-capacity investment bind: zt+1z_ binds when ztYt<X_, and zt+1z¯ binds when ztYt>X¯.

Fig. 1.

Fig. 1

Phase diagram illustrating the dynamics. The loci along which zt and Yt are constant are indicated by the red and blue solid curves. The green dashed curves indicate the loci above and below which the constraints zt+1z¯ and zt+1z_ bind. In this configuration, there exist two stable steady states (Color figure online)

Generally, the configuration depends on exogenous variables, in particular B. Figure 1 illustrates a case where there are two locally stable steady-state equilibria, and one unstable. (Exact conditions for this type of configuration are stated in Proposition 1 below.) One stable steady-state equilibrium can be labelled a low-extractive steady state. Here the ruler undertakes no investment in extractive capacity, so zt=z_, and output can be written

Y_=κDBαz_α-1ϕz_ρ11-ρ, 17

which is illustrated in Fig. 1, and derived by setting Yt+1=Yt=Y_ and zt=z_ in the bottom row of ( 16). The associated extraction rate, which we can denote τ_, is given by the bottom row of (14), i.e., τ_=[σ(1-β)+αβ]/[σ(1-αβ)+αβ] . Population is given by (5) as L_=γ(1-τ_)Y_.

The other stable steady state, at which zt=z¯, can be labelled the high-extractive steady state. Here output equals Y¯, defined from Y¯=κDBαz¯α-1ϕz¯Y¯+z_-z¯ρ; cf. the top row of (16). The extraction rate in this steady state, τ¯, is given by the top row of (14 ), setting ztYt=z¯Y¯. From (5), population can be written L¯=γ(1-τ¯)Y¯.

A saddle path separates the phase diagram into two basins of attraction, each associated with one of the two steady states.11 An economy starting off above the saddle path (i.e., with a large initial effective tax base, z0Y0) will converge over time to the high-extractive steady state. An economy starting off below the saddle path converges to the low-extractive steady state.

A trajectory leading to the high-extractive steady state eventually enters a region where ztYt>X¯, at which point the upper bound on extractive capacity investment starts to bind. From there, zt stays constant at z¯, while Yt continues to grow, stabilizing at Y¯, as illustrated in Fig. 1. Similarly, a trajectory leading to the low-extractive steady state eventually enters a region where ztYt<X_, after which zt stays constant at z_, while Yt declines, approaching Y_.

We can also compare levels of population, output, extractive capacity, and rates of extraction in the two steady states. This is a nontrivial exercise, since these are all endogenous and jointly determined. The following proposition summarizes these results, and provides conditions for the existence and uniqueness of each steady state, respectively.

Proposition 1

Consider the model with investment in both productive and extractive capacities, as described by (10) and (11). In this model, there exist B^>0 and B^^>0, such that:

  1. If, and only if, B<B^ does there exist a low-extractive steady state, (z_,Y_), such that z_Y_<X_.

  2. If, and only if, B>B^^ does there exist a high-extractive steady state, (z¯,Y¯), such that z¯Y¯>X¯.

  3. For z_ small enough, it holds that B^^<B^. That is, the low- and the high-extractive steady states coexist for B(B^^,B^).

  4. Assume that B(B^^,B^), so that both steady states exist. Then the following holds:
    • (i)
      The low-extractive steady state has a lower extraction rate than the high-extractive steady state, i.e., τ_<τ¯;
    • (ii)
      The low-extractive steady state has lower output than the high-extractive steady state, i.e., Y_<Y¯;
    • (iii)
      The low-extractive steady state has lower population than the high-extractive steady state, i.e., L_<L¯.

All proofs are in Sect. 5 of the Appendices.

The possibility of multiple steady states is quite intuitive, and has to do with how current extraction affects future extraction. A larger initial level of the effective tax base—i.e., a larger ztYt—induces the ruler to invest more in both zt+1 and Yt+1, leading to a larger effective tax base in the next period. This can sustain high levels of extractive and productive capacities across generations of rulers. As we shall see in Sect. 4 below, investment in productive and extractive capacities are both needed for multiplicity of steady-state equilibria to arise.

The claims in part (d) in Proposition 1, comparing the properties of these steady states, are far less obvious.

For example, part (d) (iii) states that the high-extractive steady state has larger population (density) than the low-extractive one (L_<L¯). This may seem counter-intuitive, since a higher rate of extraction [see (d) (i)] would imply a smaller population for a given level of output; to see this one can impose steady state on (5). The result still holds because output is higher in the high-extractive steady state [see (d) (ii)], in turn due to higher investment in productive capacity, which is sustained by the ruler’s larger tax revenues.

Part (d) (i) of Proposition 1 is not obvious either (despite the ostensibly self-explanatory labels). We gleaned some of the intuition from ( 14). It is not merely about higher extractive capacity inducing a higher rate of extraction. In fact, the rate of extraction in the low-extractive steady state (τ_) is independent of the exogenously given minimum level of extractive capacity (z_).12 In other words, small changes in extractive capacity do not affect the rate of extraction, as long as the economy is not pushed out of the low-extractive steady state. Rather, the result refers specifically to a steady-state comparison. In the high-extractive steady state the ruler chooses a higher rate of extraction to finance investment in future extractive capacity, which is worthwhile precisely because of the large effective tax base in that steady state.

Shocks to zt or Yt As explained above, given a configuration with multiple steady states, such as that in Fig. 1, the economy converges over time to one of the stable steady-state equilibria. Which one it converges to depends on its initial position relative to the saddle-path trajectory leading to the unstable steady state.

This means that an economy can transition from the low-extractive to the high-extractive steady state in the wake of a one-period shock to either extractive capacity (zt), or output (Yt), or a combination of the two. Intuitively, the shock raises the ruler’s effective tax base in period t, inducing him to invest more in productive and/or extractive capacity, possibly putting the economy on a trajectory leading to the high-extractive steady state. For this to happen, the shock must push (zt,Yt) above the threshold saddle path, into the basin of attraction of the high-extractive steady state.

A transition due to a shock to output would be consistent with the Surplus Theory, and could perhaps be interpreted as the result of temporary climatic variations, and/or a temporary phase of good harvests. A transition due to a shock to extractive capacity relates conceptually to the Appropriability Theory.

Exogenous changes to B Above we considered shocks to extractive capacity (zt) or output (Yt). We can also analyze exogenous increases in the geographically determined land productivity factor, B. As shown in Sect. 3 of the Appendices, this shifts up the (Yt+1=Yt)-locus, thus raising output in the low-extractive steady state; note from (17) that Y_ is increasing in B. It also expands the basin of attraction for the high-extractive steady state. At some point the low-extractive steady state ceases to exist. Intuitively, a rise in B implies more output, which in turn can be used to accumulate both productive and extractive capacities.

Changes in B need not be interpreted as shocks. Very gradual increases in B would have small effects at first, but eventually lead to rapid changes in zt and Yt, as the dynamic configuration changes and the high-extractive steady state becomes the unique steady state (i.e., when B exceeds B^). The economy can thus initially change slowly in response to improvements in B, and then go through a rapid spurt in extractive capacity and output, stabilizing at z¯ and Y¯, respectively. From there, output expands more slowly again (as Y¯ is increasing in B).

Closing down channels

In the benchmark model the ruler could invest in both extractive and productive capacities. To see why this matters, we next consider what happens when we close down either of these channels.

Closing down investment in extractive capacity

To remove investment in extractive capacity from the model, we ignore (6), setting xt=0, and let zt equal some exogenous constant, here denoted z~(0,1]. In this setting, an increase in z~ represents a rise in extractive capacity independent of any actions taken by the ruler, conceptually similar to Mayshar et al. (2020, Online Appendices B), who treat extractive capacity as exogenous.

The ruler’s optimization problem now becomes:

maxτt,At+1(1-β)lnctR+βln(z~Yt+1), 18

subject to

ctR=τtz~Yt-ηAt+1σ,Yt+1=(BAt+1)αLt+11-α,Lt+1=γ(1-τt)Yt. 19

The solution to this model resembles that analyzed in the previous section in the case when the non-negativity constraint on xt was binding (xt=0); see Sect. 1 of the Appendices for details. The dynamics of output becomes

Yt+1=GYtρ, 20

where (recall) ρ=(α/σ)+1-α<1, and where G depends on exogenous parameters and is increasing in both agricultural productivity ( B), and extractive capacity (z~); see (60) in the Appendices. The following proposition summarizes the main results in this setting.

Proposition 2

Consider the model without investment in extractive capacity, as described by (18) and (19). In this model, there exists a unique (non-zero) steady-state equilibrium where the following holds: extractive capacity equals its exogenous level, z~; output equals Y~=G1/(1-ρ); and the rate of extraction equals

τ~=σ(1-β)+αβσ(1-αβ)+αβ. 21

Thus, taking investment in extractive capacity out of the model rules out multiplicity of steady states. It can be seen that Y~ is increasing in both B and z~ (since G is), so we do get the expected predictions from increases in both land productivity and extractive capacity; note that extractive capacity still affects tax revenues and thus investment in productive capacity, At+1.

However, optimal τt is here constant. [Indeed, the expression in (21) is the same as in the bottom row in (14), which applies to the benchmark model when xt=0, i.e., ztYt<X_.] Since the extraction rate does not depend on either B or z~, this setting cannot explain the rise of statehood as an endogenous outcome of changes in B and/or z~. In that sense, without investment in extractive capacity the model is inconsistent with both the Surplus and Appropriability Theories.13

Closing down investment in productive capacity

Next we remove investment in productive capacity, setting At=1 in all periods, but keep investment in extractive capacity. The ruler’s budget constraint, analogous to that in (7), becomes ctR=τtztYt-xt. The expression for output in (1) becomes Yt=BαLt1-α.

The ruler’s optimization problem can now be written:

maxτt,xt(1-β)lnctR+βln(zt+1Yt+1), 22

subject to

xt0,zt+1=min{z¯,z_+ϕxt},ctR=τtztYt-xt,Yt+1=BαLt+11-α,Lt+1=γ(1-τt)Yt. 23

This model coincides with that in the benchmark setting in Sect. 3 when σ goes to infinity, i.e., when we make investment in productive capacity prohibitively expensive. Specifically, there are two thresholds for the effective tax base, X_ and X¯, below and above which investment in extractive capacity is constrained to its minimum or maximum levels, respectively. Letting σ go to infinity in (12) and (13), these thresholds can now be written

X¯=1ϕz¯β(1-α)+1β-z_, 24

and

X_=(1-αβ)z_βϕ. 25

That is, if ztYtX_, then zt+1=z_ and xt=0; if ztYtX_, then zt+1=z¯ and xt=(z¯-z_)/ϕ.

The dynamical system describing the evolution of zt and Yt is derived in Sect. 2 of the Appendices, and can also be derived from (15) and (16) by letting σ go to infinity, and setting ρ=1-α. Because the resulting expressions for zt+1 and Yt+1 are so qualitatively similar to those in (15) and (16), we suppress these to the Appendices.

We sum up the main results in the following proposition.

Proposition 3

Consider the model without investment in productive capacity, as described by (22) and (23). In this model, there exist B>0 and B>0, such that:

  1. If, and only if, B<B does there exist a low-extractive steady state, (z_,Y_), such that z_Y_<X_.

  2. If, and only if, B>B does there exist a high-extractive steady state, (z¯,Y¯), such that z¯Y¯>X¯.

  3. B>B. That is, the low- and the high-extractive steady states cannot coexist.

  4. If B(B,B), then there exists a unique steady state, (zint,Yint), such that zintYint(X_,X¯). Furthermore, it holds that:
    • (i)
      The steady-state extraction rate, τint, is increasing in B and ϕ;
    • (ii)
      The steady-state level of extractive capacity rate, zint, is increasing in B and ϕ;
    • (iii)
      The steady-state level of output, Yint, is increasing in B and decreasing in ϕ;
    • (iv)
      The steady-state level of population density, Lint, does not depend on B and is decreasing in ϕ.

Parts (a) and (b) of Proposition 3 are consistent with the corresponding claims in Proposition 1.14 More (less) productive land makes the high-extractive (low-extractive) steady state more likely to exist. This is broadly consistent with the Surplus Theory.

However, part (c) of the proposition shows that multiple steady-state equilibria are not possible in this setting. If land productivity, B, is high enough that the high-extractive steady state exists (meaning B>B), then it is also too high for the low-extractive steady state to exist (since B>B). Intuitively, multiplicity of steady states requires strong enough feedback from current extraction to future extraction, and this feedback is weakened when rulers are not able to invest in productive capacity.

Part (d) takes this point further, by considering the case when B(B,B). Here neither the low- or high-extractive steady state exists. Rather, the economy converges to a unique interior steady state. Interestingly, this steady state has many properties—summarized by parts (i)-(iv) of (d)—that seem inconsistent with the facts. For example, a (small) rise in land productivity, B, leads to a higher steady-state extraction rate and higher levels of extractive capacity, but leaves steady-state population density unchanged. Intuitively, higher land productivity raises population in the usual Malthusian way, but that is counteracted by the higher rate of extraction, and here the net effect is zero. Both those effects were present in the benchmark model, but there higher tax revenues also generated higher investments in productive capacity, which tended to increase steady-state population density. That third channel is closed down here.

Similarly, a rise in ϕ (which, recall, measures how easy it is to build extractive capacity) raises the steady-state extraction rate and extractive capacity, but lowers population density. This implies a negative association between statehood and population density, which is inconsistent with the empirical facts.

Empirical results

The results of the model build on a complementarity between extractive and productive capacities. Intuitively, the possibility of a high-extractive steady state hinges on land productivity affecting the effective tax base and thus investment in future extractive capacity. The implication is that an increase in land productivity, B, is more likely to generate statehood if investments in extractive capacity are easier to undertake, i.e., if ϕ is large.15

We can explore if this holds empirically by comparing the correlation between statehood and land productivity for samples of countries with high and low ϕ. To measure ϕ, we may lean on a literature emphasizing how much easier elites have found it to build a state when they already have a blueprint. For example, the earliest states developed writing and bookkeeping, which were copied by elites developing states later (Scott, 2009, pp. 226–234); Stasavage, 2020, pp. 91–93). Similarly, Ertman (1997, p. 27) argues that European state building became easier at a point when rulers could hire from an existing pool of experts to serve as administrators and in the military. In a multi-society interpretation of our model, this suggests that the return to investing in extractive capacity in one society, as captured by ϕ, could depend on the level of extractive capacity across a range of societies.

To fix ideas, suppose a group of countries have transitioned into statehood in a first wave. Since they did not have any statehood blueprints they faced a very low ϕ, but transitioned anyhow, possibly for reasons not modelled here, and once they have transitioned they are more likely to maintain statehood moving forward (due to the multiplicity of locally stable steady states). The remaining countries, being able to draw on the state knowledge accumulated by the first wave of countries, face a higher ϕ. The complementarity between B and ϕ should then imply that countries in the second wave transition earlier if they have higher B.

A simulation example

To better understand the dynamics of a model where ϕ changes over time, we can first consider a simulation where in each period ϕ is a function of the average level of extractive capacity, zt, across 200 societies. (For details, see Sect. 1 of the Appendices) We let these 200 societies be endowed with different levels of land productivity, B, which is uniformly distributed between the two thresholds discussed in Proposition 1, B^^ and B^. Thus, two steady states exist initially.

All societies start off in a low-extractive steady state, with minimum extractive capacity (z_), but 20 are exogenously hit by a shock at t=40, giving them maximum extractive capacity (z¯). These 20 represent early states, and have levels of B distributed in the same way as among the other 180. (Here we select them as every tenth society when ranked by B, but one can also select them randomly.) Their function in this simulation is to initiate a process through which statehood can spread: the initial rise in average zt raises ϕ, in turn inducing more societies to invest in zt, thus raising ϕ further, creating a self-propelling dynamic.

Figure 2 shows the simulated time paths of the log of zt for three societies out of the 180 not hit by the shock. A higher B is associated with an earlier rise in zt, since higher land productivity induces earlier investments in zt when ϕ starts to rise; the rise in ϕ is in turn driven by the rise in average zt across the 200 societies, shown as a dotted line.

Fig. 2.

Fig. 2

Simulated time paths over 100 periods showing log extractive capacity, ln(zt), for three societies in a setting where ϕ depends on the average level of zt across all societies (shown as a dotted path). These three are all among those societies not hit by a shock to extractive capacity

Some paths in Fig. 2 show a non-monotonic rise (hardly visible unless we log zt), which reflects that the dynamics for a fixed ϕ exhibit two locally stable steady states. Depending on parameter values, not all societies need ever transition into statehood, but in this simulation all 200 societies make the transition within 60 periods. In any given period, societies with higher B have higher levels of zt.

Figure 3 illustrates the cross-sectional relationship between land productivity and a cumulative statehood measure, namely mean extractive capacity over the 100 periods. The 20 societies with the highest levels of statehood are those that experienced a positive shock. By assumption, these have levels of B distributed across the same interval as the remaining 180, and thus show little association between land productivity and state history.16 Among the remainder, however, we see a clear positive relationship between land productivity and mean extractive capacity, such that the highest levels of statehood are found in societies with the highest land productivity.

Fig. 3.

Fig. 3

Plot showing the cross-sectional relationship between land productivity, B, and mean extractive capacity over 100 periods, based on the same simulation as in Fig. 2. Each circle represents one society

Cross-country evidence from Eurasia

Next we explore if this pattern is consistent with cross-country data. We focus on the continent of Eurasia, where most state building has spread from a couple of centers (see discussion below). We use accumulated State Antiquity over different periods from 3500 BCE to 1500 CE from Borcan et al. (2018) to measure statehood (corresponding to mean extractive capacity over time in the simulation). We use the Caloric Suitability Index (CSI) from Galor and Özak (2016) to measure land productivity. (See Sect. 2 of the Appendices for more details about the data.)

Table 1 presents results from regressing State Antiquity on CSI for different subsamples, namely countries which developed statehood before and after different temporal cutoffs. Columns (1)–(3) consider 450 CE, a common benchmark for the end of the classical-age state building era (see, e.g., Mayshar et al., 2020). Columns (4)–(9) consider 1000 BCE, an earlier point at which much fewer countries had begun to develop statehood.

Table 1.

Agricultural productivity and statehood: countries with late and early state development

Dependent variable is State Antiquity over the period
3500 BCE to 1500 CE 450 CE to 1500 CE 3500 BCE to 1500 CE 1000 BCE to 1500 CE 3500 BCE to 450 CE 1000 BCE to 450 CE
(1) (2) (3) (4) (5) (6) (7) (8) (9)
Galor–Özak CSI 30.38** − 90.00** 18.96 69.10*** − 23.24 18.14 28.59** − 44.05 − 3.22
(11.36) (35.33) (11.88) (24.83) (40.90) (30.87) (10.76) (35.65) (26.64)
State Antiquity 3500 BCE-450 CE

0.00

(0.05)

State Antiquity 3500-1000 BCE

0.16

(0.14)

0.17

(0.11)

R2 0.14 0.17 0.09 0.13 0.01 0.07 0.07 0.05 0.11
Number of obs. 23 52 52 56 19 19 56 19 19
Arrival of statehood After 450 CE Before 450 CE Before 450 CE After 1000 BCE Before 1000 BCE Before 1000 BCE After 1000 BCE Before 1000 BCE Before 1000 BCE

Ordinary least squares regressions across Eurasian countries with robust standard errors in parentheses. The dependent variable is accumulated State Antiquity over different periods. The sample is split between countries which developed statehood early and late, respectively: before and after 450 CE in columns (1)–(3), and before and after 1000 BCE in columns (4)–(9)

*p<0.10; **p<0.05; ***p<0.01

Consider first columns (1), (4), and (7) in Table 1, which use samples of countries with relatively late state development. Here we find a positive and significant correlation between the Galor–Özak CSI index and statehood. The relationship among countries with earlier state development in the remaining columns is mostly insignificant, at least when controlling for existing state development up until the cutoff year; see columns (3), (6), and (9). This is consistent with the simulation results in Fig. 3. That is, the relationship between accumulated statehood and land productivity tends to be positive for countries that developed statehood later, and close to zero for those with early statehood.

Figure 4 illustrates the relationship between land productivity and statehood for early and late state developers, using 1000 BCE as cutoff; cf. columns (4) and (5) in Table 1. Note that the pattern is qualitatively similar to the simulated one in Fig. 3.

Fig. 4.

Fig. 4

Plot showing the relationship between statehood and the Galor–Özak CSI index for Eurasian countries that already had some statehood before 1000 BCE, and those that did not

Table 2 explores these cross-country data further when using 1000 BCE as cutoff for late and early state development, but using the full sample of Eurasian countries and instead interacting land productivity with an indicator for late state development. Column (1) first documents a negative but insignificant unconditional relationship between Galor–Özak CSI and statehood. This turns positive and significant in column (2), where we enter a Late Statehood Dummy, equal to one for countries which developed statehood after 1000 BCE. The Late Statehood Dummy itself carries a significant negative coefficient for obvious reasons.

Table 2.

Agricultural productivity and statehood: interacting late statehood with agricultural productivity

Dependent variable is State Antiquity 3500 BCE to 1500 CE
(1) (2) (3) (4) (5) (6)
Galor–Özak CSI − 58.76 46.41** − 23.24 − 42.63 − 42.65 − 40.82
(37.03) (21.86) (39.76) (32.39) (32.67) (35.41)
Late Statehood Dummy

− 1383.62***

(132.09)

− 1912.32***

(338.10)

− 1766.34***

(307.02)

− 1752.91***

(280.72)

− 1736.35***

(296.55)

Late Statehood × Galor–Özak CSI

92.34*

(47.00)

99.02***

(36.23)

98.07***

(34.65)

95.71**

(38.25)

Distance from State Origin

− 0.01

(0.09)

− 0.02

(0.10)

Log Absolute Latitude

− 25.06

(169.25)

R2 0.05 0.61 0.63 0.78 0.78 0.78
Number of obs. 75 75 75 75 75 75
Region fixed effects No No No Yes Yes Yes

Ordinary least squares regressions across Eurasian countries with robust standard errors in parentheses. The dependent variable is accumulated State Antiquity 3500 BCE to 1500 CE. Late Statehood is an indicator for a country not having a state before 1000 CE. Region Fixed Effects is a set of dummies for each of nine regions

*p<0.10; **p<0.05; ***p<0.01

In column (3) we interact the Late Statehood Dummy and the Galor–Özak CSI index. The interaction term comes out as positive and significant just below the 5% level. It stays positive and becomes much more precisely estimated in column (4), where we include region fixed effects. Column (5) also controls for the geodetic distance from country centroids to Baghdad or Beijing, whichever is closest, conjectured centers for state origins in Eurasia. Column (6) adds a control for Log Absolute Latitude. Throughout, the positive coefficient on the interaction term stays significant at the 5% level, or better. In other words, land productivity shows a positive association with statehood among countries that developed statehood later, just as we should expect.

As mentioned, we here focus on the Eurasian continent, since state building did not spread between Eurasia and other continents prior to 1500. When including the Americas, or the rest of the world, the results in Tables 1 and 2 tend to weaken. This seems consistent with the idea that land productivity should matter more when state building tools can be copied or imported more easily.

Anecdotal evidence from Sweden

The data presented above end in 1500 CE, but state building continued after that, in particular in Northern Europe, which lagged behind the continent (cf. Fig. 4). Sweden offers some concrete examples of how rulers of younger states could use tax revenue to import state building after 1500.

As described by Ertman (1997, pp. 313–314), in 1538 Sweden’s first king Gustav I (or Gustav Vasa) hired a German minister, Conrad von Pyhy, to organize its central administration following a template from the Holy Roman Empire. From 1611, Gustavus Adolphus continued state centralization by borrowing from more recent German and Dutch models.

Architecture offers another example. The oldest and most famous castles and monuments from Sweden’s so-called Great Power era in the 17th century were designed by foreign architects, in particular Simon de la Vallée and Nicodemus Tessin the Elder, who acquired their skills on the continent (Stevens Curl & Wilson, 2015). There may be more important (and productive) aspects of state building than castles, but this does illustrate that skills related to state building could indeed be imported.

Concluding remarks

There are many competing explanations of what caused the rise and spread of statehood, or social stratification more generally. The Surplus Theory posits that a non-producing elite could only be supported with a “surplus” supply of food. This surplus, goes the argument, arrived when land productivity rose in the wake of the Neolithic Revolution, i.e., when humans transitioned from food procurement through hunting and gathering to using agriculture. A different theory has been labelled the Appropriability Theory. It holds that the rise of states was rather about the arrival of new crops, which were easier for a ruling elite to confiscate.

This paper has presented a model which incorporates mechanisms related to those emphasized by both the Surplus and Appropriability Theories. A ruler extracts resources from a subject population, the size of which evolves over time in a Malthusian fashion, dependent on the ruler’s rate of extraction. The ruler can invest the extracted resources in what we call extractive and productive capacities. These complement each other in such a way that the model can give rise to multiple steady states holding constant land productivity and other exogenous factors. One steady state has low extractive capacity, a low extraction rate, and low population density and output; the other has high extractive capacity, a high extraction rate, and high population density and output.

Not only can the combination of extractive and productive capacities give rise multiple steady states. This paper has shown that both of these elements are needed for such multiplicity to arise. In that sense, the Surplus and Appropriability Theories, as modelled here, can generate richer theoretical results together than each theory on its own.

To illustrate the empirical relevance of the model we exploit its complementarity between land productivity and the return to state building. Intuitively, countries which develop statehood later are able to draw on the state knowledge accumulated by earlier states, and thus face a higher return to efforts and resources directed towards state building compared to countries which developed statehood from scratch. Therefore, among countries which transition into statehood relatively late, we should expect too see a positive association between land productivity and state antiquity, but not necessarily among earlier states. Evidence from across Eurasian countries supports this prediction.

Supplementary Information

Below is the link to the electronic supplementary material.

Appendices

The ruler’s maximization problem

Finding optimal At+1, zt+1 and τt

First note from (1) and (5) that output in period t+1 can be written

Yt+1=(BAt+1)αγ(1-τt)Yt1-α. 26

Substituting zt+1=z_+ϕxt, (7), and (26) into (8), we can write UtR as a function of At+1, xt, and τt, namely

UtR=(1-β)lnτtztYt-ηAt+1σ-xt+βln(z_+ϕxt)+αβlnAt+1+β(1-α)ln(1-τt)+Ωt 27

where

Ωt=αβlnB+β(1-α)ln(Yt)+β(1-α)lnγ

contains only variables taken as given by the ruler. The problem is to maximize (27) subject to At+10, τt0, τt1, xt0, and xt(z¯-z_)/ϕ; the last two constraints correspond to zt+1z_ and zt+1z¯, respectively.

The first-order conditions for an interior solution state that At+1 and τt satisfy

(1-β)ctR-1ησAt+1σ-1=αβAt+1-1, 28

and

(1-β)ctR-1ztYt=β(1-α)1-τt-1, 29

where ctR=τtztYt-ηAt+1σ-xt; recall (7).

It is straightforward to see that the constraints At+10, τt0, and τt1 never bind, so (28) and (29) always give optimal At+1 and τt for any xt[0,(z¯-z_)/ϕ]. Using (7), (28), and (29) we can solve for ηAt+1σ and 1-τt as follows:

ηAt+1σ=αβσ(1-αβ)+αβztYt-xt, 30
1-τt=βσ(1-α)σ(1-αβ)+αβ1-xtztYt. 31

Also, using (7), (30), and (31) we can write the ruler’s consumption as

ctR=σ(1-β)σ(1-αβ)+αβztYt-xt. 32

Below we use (30) to (32) to find the optimal choices of At+1 and τt for three cases: when xt=0; when xt=(z¯-z_)/ϕ; and when 0<xt<(z¯-z_)/ϕ.

Corner solutions where xt=0

If the marginal effect on UtR from an increase in xt is negative when xt=0, then xt=0 is optimal. This happens when

UtRxtxt=0=-(1-β)τtztYt-ηAt+1σ-1+ϕβz_-1<0. 33

Using (30) and (31) we see that τtztYt-ηAt+1σ is simply the expression for ctR in (32), evaluated at xt=0. Thus, the inequality in (33) can be written

(1-β)σ(1-β)σ(1-αβ)+αβztYt-1>ϕβz_-1, 34

which translates to ztYt<X_, where X_ is given by (13).

It thus follows that if ztYt<X_, then xt=0. Moreover, optimal At+1 and τt can be found by setting xt=0 in (30) and (31). This gives the bottom rows of (14) and (43) below.

Corner solutions where xt=(z¯-z_)/ϕ

If the marginal effect on UtR from an increase in xt is positive when xt=(z¯-z_)/ϕ, then xt=(z¯-z_)/ϕ is optimal. This happens when

UtRxtxt=z¯-z_ϕ=-(1-β)τtztYt-ηAt+1σ-z¯-z_ϕ-1+ϕβz¯-1>0. 35

The expression in square brackets in (35) equals ctR in (32), evaluated at xt=(z¯-z_)/ϕ. Substituted into (35), this gives

(1-β)σ(1-β)σ(1-αβ)+αβztYt-z¯-z_ϕ-1<ϕβz¯-1, 36

or

z¯<σβσ(1-αβ)+αβϕztYt-z¯-z_, 37

which can in turn be simplified to ztYt>X¯, where X¯ is given by (12).

To sum up, if ztYt>X¯, then xt=(z¯-z_)/ϕ and optimal At+1 and τt can be found by setting xt=(z¯-z_)/ϕ in (30) and (31). This gives the top rows of (14) and (43) below.

Interior solutions

Consider next interior solutions for xt, which can be found when ztYt(X_,X¯), and are derived from the first-order condition

(1-β)ctR-1=ϕβz_+ϕxt-1, 38

where ctR is given by (32). We can use (32) and (38) to find optimal xt, but we are rather interested in the associated expression for zt+1. Since zt+1=z_+ϕxt in an interior solution for xt, we can write the first-order condition in (38) as

(1-β)σ(1-β)σ(1-αβ)+αβztYt-zt+1-z_ϕ-1=ϕβzt+1-1. 39

This can be solved for zt+1 to give

zt+1=z_+ϕxt=βσβσ(1-α)+σ+αβϕztYt+z_, 40

which is the middle row of (15).

Using (40), we can also derive the associated solutions for τt and At+1. Dividing (28) by (38), and rearranging, investment in next period’s technology becomes

ηAt+1σ=ασϕz_+ϕxt=αββσ(1-α)+σ+αβztYt+z_ϕ, 41

where the second equality follows from (40). Solving (41) for At+1 gives the middle row in (43) below.

Similarly, dividing (29) by (38), using (40), and rearranging, gives

(1-τt)ztYt=1-αϕz_+ϕxt=βσ(1-α)βσ(1-α)+σ+αβztYt+z_ϕ 42

which can be solved to give the middle row in (14).

Complete characterization of the solution

To sum up, we can write the optimal expressions for τt as in (14), for zt+1 as in (15), and the optimal expression for At+1 can be written

At+1=1ηαβσ(1-αβ)+αβztYt-z¯-z_ϕ1σifztYtX¯,1ηαββσ(1-α)+σ+αβztYt+z_ϕ1σifztYtX_,X¯,1ηαβσ(1-αβ)+αβztYt1σifztYtX_. 43

Dynamics of Yt

This section finds an expression for Yt+1 in terms of Yt and zt (and exogenous variables, such as B). Consider first the case when ztYtX_,X¯, meaning neither of the constraints on zt+1 binds. Using (14), it is then seen that

γ(1-τt)Yt=γϕztβσ(1-α)βσ(1-α)+σ+αβϕztYt+z_. 44

Now (26), (43), and (44) tell us that

Yt+1=Bα1ηϕασαββσ(1-α)+σ+αβασϕztYt+z_ασAt+1α×γϕzt1-αβσ(1-α)βσ(1-α)+σ+αβ1-αϕztYt+z_1-αγ(1-τt)Yt1-α. 45

To simplify this expression, first define

ρ=ασ+1-α(0,1) 46

and

D=αησ(1-α)ασγ1-α1ϕρβσ(1-α)βσ(1-α)+σ+αβρ, 47

where we note that ρ<1 follows from σ>1. Using (46) and (47), we can rewrite (45) more compactly as

Yt+1=DBαztα-1ϕztYt+z_ρ, 48

which is the middle row of (16).

Consider next the case when ztYt>X¯. From (14), it now follows that

γ(1-τt)Yt=γϕztβσ(1-α)σ(1-αβ)+αβϕztYt+z_-z¯. 49

Following similar steps as we followed above for the interior solution, we can use (26), (43), and (49) to show that

Yt+1=D^Bαztα-1ϕztYt+z_-z¯ρ, 50

where

D^=αησ(1-α)ασγ1-α1ϕρβσ(1-α)σ(1-αβ)+αβρ, 51

and (recall) ρ is given by (46).

Finally, for the case when ztYt<X_, we can use (14) again to see that

γ(1-τt)Yt=γβσ(1-α)σ(1-αβ)+αβYt. 52

Applying (26), (43), and (52) some algebra shows that

Yt+1=D^BαztασϕYtρ=D^Bαztα-1ϕztYtρ, 53

where ρ and D^ are given by (46) and (51).

Finally, using (46), (47), and (51) we can define κ as

κ=D^D=βσ(1-α)+σ+αβσ(1-αβ)+αβρ>1. 54

Now, using (50), (53), and substituting for D^=κD, we arrive at the top and bottom rows of (16).

The phase diagram

The (zt+1=zt)-locus

The following can be seen directly from (15): for ztYtX_, it holds that zt+1=zt when zt=z_; for ztYtX¯, it holds that zt+1=zt when zt=z¯; for ztYtX_,X¯ , it holds that zt+1=zt when zt=βσβσ(1-α)+σ+αβϕztYt+z_, or zt=βσz_/{βσ(1-α)+σ+αβ-βσϕYt} . In sum, the (zt+1=zt)-locus can be written

zt=z¯ifztYtX¯,βσz_βσ(1-α)+σ+αβ-βσϕYtifztYtX_,X¯,z_ifztYtX_. 55

The (inverse of) (55) is graphed in Fig. 1 as a three-segment solid blue curve.

The (Yt+1=Yt)-locus

From (16) we learn the following: for ztYtX_, it holds that Yt+1=Yt when Yt=κDBαϕρztα/σ1/(1-ρ) (from using ρ=α/σ+1-α); for ztYtX¯, it holds that Yt+1=Yt when Yt=ξ(zt,B), defined from ξ(zt,B)=κDBαztα-1ϕztξ(zt,B)+z_-z¯ρ; for ztYtX_,X¯, it holds that Yt+1=Yt when Yt=ϑ(zt,B), defined from ϑ(zt,B)=κDBαztα-1ϕztϑ(zt,B)+z_ρ. To summarize, the (Yt+1=Yt)-locus can be written

Yt=ξ(zt,B)ifztYtX¯,ϑ(zt,B)ifztYtX_,X¯,κDBαϕρztασ11-ρifztYtX_. 56

The red solid curves in Fig. 1 show the graphs of the three different segments of the (Yt+1=Yt)-locus in (56).

Change in configuration when changing B

Note that the (zt+1=zt)-locus does not depend on B. It is easy to see, from the definitions above, that ξ(zt,B) and ϑ(zt,B) are strictly increasing in B, that limBξ(zt,B)=limBϑ(zt,B)=, and that ξ(zt,0)=ϑ(zt,0)=0. It follows that we can adjust B to shift the (Yt+1=Yt)-locus to alter the configuration of the two-dimensional dynamical system. When B is sufficiently small the (Yt+1=Yt)- and (zt+1=zt)-loci intersect only once, and this unique intersection lies in the region where ztYt<X_. When B is sufficiently large the two loci also intersect only once, now in the region where ztYt>X¯.

Closing down channels

Closing down investment in extractive capacity

In this setting, the first-order conditions for At+1 and τt can be written

(1-β)τtz~Yt-ηAt+1σ-1ησAt+1σ-1=αβAt+1-1, 57

and

(1-β)τtz~Yt-ηAt+1σ-1z~Yt=β(1-α)1-τt-1. 58

Solving for τt gives the same expression as in the bottom row in ( 14). The expression for ηAt+1σ becomes identical to that in (30), but with xt=0 and zt=z~, i.e.,

ηAt+1σ=αβσ(1-αβ)+αβz~Yt. 59

Using Yt+1=(BAt+1)αLt+11-α and Lt+1=γ(1-τt)Yt, together with the expressions for τt in the bottom row in (14), and At+1 in (59), some algebra shows that Yt+1=GYtρ, where

G=Bααz~ησ(1-α)ασγ1-αβσ(1-α)σ(1-αβ)+αβρ 60

and (recall) ρ=(α/σ)+1-α<1.

Using (51), it can also be seen that G=BαD^ϕρz~ασ, which shows that Yt+1=GYtρ can be derived from (16), setting zt=z~. That is, the dynamics in the model without investment in extractive capacity coincide with those in the benchmark model in the relevant corner solution.

Closing down investment in productive capacity

In the model without investment in productive capacity, the first-order condition for τt (which always holds with equality) becomes

(1-β)τtztYt-xt-1ztYt=β(1-α)1-τt-1. 61

where we have used ctR=τtztYt-xt. It can be seen from (61) that τt can be written

τt=1-β(1-α)1-αβ1-xtztYt. 62
Dynamics for extractive capacity, zt

The optimal choice of xt (which determines zt+1) involves corner solutions. If the marginal effect on UtR from an increase in xt is negative when xt=0, then xt=0 is optimal. The condition for this can be written:

UtRxtxt=0=-(1-β)τtztYt-1+ϕβz_-1<0. 63

If xt=0, we see from (62) that

τt=1-β1-αβ. 64

Using (64) and (63), we see that xt=0 is the ruler’s optimal choice when ztYt<X_, where X_ is given by (25). That is, when ztYt<X_, it holds that xt=0 and zt+1=z_.

Next, if the marginal effect on UtR from an increase in xt is positive when zt+1=z¯, then xt=(z¯-z_)/ϕ is optimal. The condition for this can be written:

UtRxtxt=z¯-z_ϕ=-(1-β)τtztYt-z¯-z_ϕ-1+ϕβz¯-1>0. 65

Evaluating (62) at xt=(z¯-z_)/ϕ gives

τt=1-β(1-α)1-αβ1-1ztYtz¯-z_ϕ. 66

Now (65) and (66) show that xt=(z¯-z_)/ϕ is optimal when ztYt>X¯, where X¯ is given by (24). That is, when ztYt>X¯, it holds that xt=(z¯-z_)/ϕ and zt+1=z¯.

An interior solutions for xt, which can be found when ztYt(X_,X¯), can be derived from the first-order condition

(1-β)τtztYt-xt-1=ϕβz_+ϕxt-1. 67

From (62) and (67) we find an expression for zt+1=z_+ϕxt when ztYt(X_,X¯), namely

zt+1=βϕztYt+z_1+β(1-α). 68

Thus, we can write

zt+1=z¯ifztYtX¯,βϕztYt+z_1+β(1-α)ifztYtX_,X¯,z_ifztYtX_, 69

where we recall that the respective corner solutions coincide with the interior solution when ztYt=X_ and ztYt=X¯.

Dynamics for output, Yt

To find the dynamics for output, we use Yt+1=BαLt+11-α and Lt+1=γ(1-τt)Yt; see (23 ). When ztYt<X¯, we can use the expression for τt in (64) to write

Lt+1=γ(1-τt)Yt=γβ1-α1-αβYt, 70

which gives

Yt+1=Bαγβ1-α1-αβ1-αYt1-α. 71

When ztYt>X¯, we see from (66) that

Lt+1=γ(1-τt)Yt=γβ(1-α)1-αβYt-1ztz¯-z_ϕ, 72

which gives

Yt+1=Bαγβ(1-α)1-αβ1-αYt-1ztz¯-z_ϕ1-α. 73

When ztYt(X_,X¯), we use the expression for zt+1 in (68), which applies when ztYt(X_,X¯). Together with zt+1=z_+ϕxt this gives an expression for xt, which can be substituted into (62) to show that

Lt+1=γ(1-τt)Yt=γβ1-α1+β1-αϕztYt+z_ϕzt. 74

Using Yt+1=BαLt+11-α and (74) gives

Yt+1=Bαγβ1-α1+β1-α1-αϕztYt+z_ϕzt1-α. 75

In sum, (71), (73), and (75) can be written in the same way as in (16), but with ρ replaced by 1-α:

Yt+1=Ψ(Yt,zt,B)κDBαztα-1ϕztYt+z_-z¯1-αifztYtX¯,DBαztα-1ϕztYt+z_1-αifztYtX_,X¯,κDBαϕYt1-αifztYtX_, 76

and with the following new definitions of D and κ:

D=γβ1-αϕ1+β1-α1-α,κ=1+β1-α1-αβ1-α. 77

Proof of propositions

Proof of Proposition 1

(a) Let B^ be defined as the level of B that generates z_Y_=X_. From (17) follows that

B^=1z_X_1-ρκDϕρ1α=1z_X_κDϕX_ρ1α>0, 78

where X_ is given by (13). Note from (17) that Y_ is increasing in B. By implication, B<B^ is equivalent to z_Y_<X_.

(b) Let B^^ be defined as the level of B that generates z¯Y¯=X¯. Recall that imposing steady state in the top row of (16) defines Y¯ from Y¯=κDBz¯α-1ϕz¯Y¯+z_-z¯ρ, which shows that Y¯ is increasing in B. Setting B=B^^ and z¯Y¯=X¯ gives

B^^=1z¯X¯κDϕX¯+z_-z¯ρ1α=1z¯X¯κDϕX_z¯z_ρ1α>0, 79

where the second equality follows from (12) and (13). Since Y¯ is increasing in B, it follows that B>B^^ is equivalent to z¯Y¯>X¯.

(c) Using (78) and (79), and letting things cancel, we can write B^^<B^ as

z_z¯ρ+α=z_z¯1+ασ<X_X¯. 80

where the equality recalls ρ=(α/σ)+1-α from (46). Next, using (12) and (13), and dividing both sides by z_/z¯, we can write the inequality in (80) as

z_z¯ασ<X_/z_X¯/z¯=σ(1-αβ)+αββσ(1-α)+σ+αβ-βσz_z¯, 81

which expresses the condition for B^^<B^ in terms of the ratio z_/z¯<1. Letting z_ go to zero (keeping z¯ constant), the right-hand side of (81) goes to something strictly positive, while the left-hand side goes to zero. Thus, the inequality in (81) must hold for z_ sufficiently close to zero, implying in turn that B^^<B^ holds for z_ sufficiently close to zero.

(d) Part (i): The result follows from (14). The bottom row equals τ_=[σ(1-β)+αβ]/[σ(1-αβ)+αβ], and τ¯ is defined as the top row, evaluated at ztYt=z¯Y¯, which can be written:

τ¯=1-(1-τ_)1-z¯-z_ϕ1z¯Y¯=τ_+z¯-z_ϕ1-τ_z¯Y¯>τ_. 82

Part (ii): Given B(B^^,B^), and the way we defined B^^ and B^ in the proof of (a) and (b), we know that the output levels in the two steady states, Y¯ and Y_, must be such that Y¯>X¯/z¯ (since B>B^^) and Y_<X_/z_ (since B<B^). From (12) and (13) it follows that

X¯z¯-z¯-z_z¯ϕ=X_z_, 83

implying that Y¯>X¯/z¯>X_/z_>Y_ (since z¯>z_).

Part (iii): Using (5) the population levels in the two steady states can be written L¯=γ(1-τ¯)Y¯ and L_=γ(1-τ_)Y_, respectively. From (31) follows that

L¯=γ(1-τ¯)Y¯>γβσ(1-α)σ(1-αβ)+αβX¯z¯-z¯-z_z¯ϕ, 84

where we have used xt=(z¯-z_)/ϕ and Y¯>X¯/z¯; recall that xt=(z¯-z_)/ϕ when z¯Y¯>X¯, i.e., when zt+1z¯ binds. Using (31) again, we see that

L_=γ(1-τ_)Y_<γβσ(1-α)σ(1-αβ)+αβX_z_, 85

where we have used xt=0 and Y_<X_/z_ ; recall that xt=0 when Y_<X_/z_, i.e., when zt+1z_ binds. Now (83), (84 ), and (85) together imply that L¯>L_.

Proof of Proposition 2

The expression for Y~ follows from imposing steady state on ( 20). The expression for τ~ in (21) can be found by solving (57) and (58) for τt. The expression for τ~ is identical to that in the bottom row in (14 ), which was derived by setting xt=0 in (31) in the benchmark setting.

Proof of Proposition 3

(a) Solving for Y_ from the bottom row in (76) gives

Y_=BκDϕ1-α1α. 86

Let B be defined as the level of B that generates z_Y_=X_. From (86) follows that

B=X_z_1κDϕ1-α1α>0. 87

From (86) we see that Y_ is increasing in B. By implication, if, and only if, B<B, then z_Y_<X_.

(b) The top row in (76) gives an implicit definition of Y¯:

Y¯=κDBαz¯α-1ϕz¯Y¯+z_-z¯1-α, 88

which shows that Y¯ is increasing in B. Let B be defined as the level of B that generates z¯Y¯=X¯. Setting B=B and z¯Y¯=X¯ in (88) gives

B=1z¯X¯κDϕX¯+z_-z¯1-α1α=1z¯X¯κDϕX_z¯z_1-α1α>0, 89

where the second equality follows from (24) and (25). Since Y¯ is increasing in B, it follows that B>B is equivalent to z¯Y¯>X¯.

(c) Using the expression for B in (89), we see that

B=1z¯X¯κDϕX_z¯z_1-α1α=X_z_X¯z¯z_X_κDϕ1-α1α=BX¯z¯z_X_1α. 90

where the last equality uses (87). From (24) and (25) we see that X¯/z¯=X_/z_+(z¯-z_)/(ϕz¯)>X_/z_ , which implies that the expression in square brackets following the last equality in (90) is greater than one. Thus, B>B.

(d) If the steady state (zint,Yint) exists, it must be such that zintYint(X_,X¯). This follows from the assumption B(B,B), and parts (a) and (b) of the proposition: B>B implies that zintYint<X_ cannot hold; and B<B implies that zintYint>X¯ cannot hold.

To show that the steady state (zint,Yint) exists and is unique we derive closed-form expressions for zint and Yint. Consider the maximization problem in (22) and (23 ) for some given levels of zt and Yt, such that ztYt(X_,X¯), meaning the solution to the maximization problem must be interior.

The first-order conditions for xt and τt in an interior solution can be written

(1-β)ctR-1=ϕβzt+1-1,(1-β)ctR-1ztYt=β(1-α)(1-τt)-1, 91

where we recall that z_+ϕxt=zt+1 in an interior solution. Together the conditions in (91) give

zt+1=ϕ1-α(1-τt)ztYt. 92

Imposing steady state, and using super-index “int” to denote steady-state levels, we can now write:

(1-τint)Yint=1-αϕ,Lint=γ(1-τint)Yint,Yint=BαLint1-α, 93

where the top row imposes steady-state on (92), and the middle and bottom rows do the same for Lt+1=γ(1-τt)Yt and Yt=BαLt1-α, respectively; see (23). Solving (93) for Lint, Yint and τint we get

Lint=γ1-αϕ,Yint=Bαγ1-αϕ1-α,τint=1-1γ1-α1-αBϕα. 94

Finally, we derive an expression for the steady-state level of zt, denoted zint. To that end, we first rewrite the first-order condition for xt in (91) as

(1-β)τtztYt-zt+1-z_ϕ-1=ϕβzt+1-1, 95

where we have used ctR=τtztYt-xt and zt+1=z_+ϕxt, implying xt=zt+1-z_/ϕ; recall (23) again. Rearranging (95), and imposing steady state, gives us zint in terms of τintYint:

zint=βz_1-βϕτintYint. 96

Next we can use (94) to find that

βϕτintYint=βϕBαγ1-αϕ1-α-1-αϕ=βϕBαγ1-α1-α-β1-α 97

Substituting (97) into (96) gives

zint=βz_1+β(1-α)-βϕBαγ1-α1-α. 98

The existence and uniqueness of the steady state is shown by the closed-form expressions for Yint and zint in (94) and (98). One can also use (94) and (98) to verify that zint(z_,z¯) and zintYint(X_,X¯) when B(B,B).

The claims in (i)–(iv) are confirmed by differentiating the expressions in (94) and (98) with respect to B and ϕ.

Empirics

Simulation

Let Yi,t and zi,t be output and extractive capacity, respectively, of society i in period t. The simulation is done by iterating on (15) and (16), given some initial values for Yi,t and zi,t , with ϕ, D, X_, and X¯ replaced by ϕt, Dt, X_t, and X¯t (the time-dependent levels of the same variables), and with B replaced by Bi (the society-specific level of B). Compactly, this can be written as

zi,t+1=Φ(Yi,t,zi,t;ϕt,Dt,X_t,X¯t),Yi,t+1=Ψ(Yi,t,zi,t,Bi;ϕt,Dt,X_t,X¯t), 99

where Dt, X_t, and X¯t are given by (47), (12), and (13), with ϕt replacing ϕ, and where the functions Φ and Ψ are defined in (15) and (16). To determine ϕt, first let

ztmean=1200i=1200zi,t 100

denote the mean of zi,t in period t across the 200 societies. We then let ϕt depend on ztmean according to

ϕt=1-ztmean-z_z¯-z_ϕ0+ztmean-z_z¯-z_×30ϕ0, 101

where ϕ0 is the exogenously given initial value for ϕt. Note that ztmean[z_,z¯] and that the weight (ztmean-z_)/(z¯-z_) increases from zero to one as ztmean goes from z_ to z¯, implying an increase in ϕt by a factor of 30, which is sufficient to ensure that all 200 societies make a full transition.

The values of Bi are uniformly distributed on the interval (B^^,B^), given by (78) and (79), with ϕ , D, X_, and X¯ replaced by ϕ0, D0, X_0, and X¯0 (see above). That is, B1=B^^ and B200=B^. This implies that all societies exhibit multiple steady states for fixed ϕ0.

Parameter values are set to α=.5, σ=2, β=.95, η=1, z_=.01, and z¯=.99. We set γ to approximately 96.81, targeting D0 to 100.

The initial value for ϕt is set to ϕ0=.01, which together with the parameter values above ensures that B^^<B^.

Initial values for zi,t are set at the minimum level, zi,0=z_=.01, for all i. Initial levels of Yi,t for each society are set at the low-extractive steady-state values associated with their respective Bi, i.e., Yi,0=κD0Biαz_α-1ϕ0z_ρ1/(1-ρ); see (17).

Since all societies are dropped off in the low-extractive steady-state they stay there until an exogenous shock is introduced at t=40. At that point, the levels of zi,t increase from z_=.01 to z¯=.99 for every tenth society when ranked by Bi (the first being i=10 and the last i=191). From that point on, all societies follow the dynamical process described be (99) to (101), eventually transitioning to the high-extractive steady state.

Data

The measure of statehood is from Borcan et al. (2018), in turn building on Bockstette et al. (2002). They report a score on the extent of statehood across territories defined by modern countries and by half century, from 3500 BCE until today. This index is based on three different criteria: whether any government above the tribal level was present; whether this government was local or foreign; and how much of the territory of the modern country that was controlled by the government. Here we use the accumulated state index score from 3500 BCE to to 1500 CE or 450 CE. Both endpoints precede European colonization and the change in crop composition following the Columbian exchange.

Countries without statehood before 450 CE and 1000 BCE, respectively, are defined as those with zero state index score from 3500 BCE to that point in time.

Land productivity is measured by the Caloric Suitability Index, which is from Galor and Özak (2016) and available here:

https://ozak.github.io/Caloric-Suitability-Index/

Specifically, we use the country-level measure of mean productivity across crops and locations in a country, excluding non-productive locations, and using only crops available before 1500 CE.

Distances to state origin are obtained by applying the geodist package in Stata to calculate the distance from the centroids of modern country borders to the geo-coordinates of Baghdad and Beijing, respectively. Distance to state origin is the shortest of those two distances.

To measure country borders we use publicly available shapefiles shared through the ESRI/ArcGIS website, downloadable here:

https://www.arcgis.com/home/item.html?id=2ca75003ef9d477fb22db19832c9554f

Latitude and regional dummies are from the same data. Latitude refers to the country centroid.

Footnotes

1

For example, Hibbs and Olsson (2004, p. 3718) write that “[t]he superior agricultural mode of production made possible specialization of economic activity and the establishment of a non-food producing class devoted to the creation and codification of knowledge and the development of technology.” Diamond (1997, p. 285) writes that “food production [i.e., agriculture] may be organized so as to generate stored food surpluses, which permit economic specialization and social stratification.” In Mann (1986), an oft-cited overview of the literature on early state development, the index lists 26 pages referencing the term “surplus” in various contexts.

2

Dal Bó et al. (2016) capture the interaction between what we may call productive and defensive capacities, while we here focus on productive and extractive capacities. Conceptually, extractive capacity may here represent the powers of a domestic ruler to tax his own people. By contrast, defensive capacity would rather capture the ability to protect against extraction by external and less benevolent actors.

3

Besley et al. (2013) set up a dynamic, but non-Malthusian, model of investment in state capacity.

4

In that case, the ruling collective is assumed to be cohesive enough to act as one agent. It also carries fixed size, meaning each member has one offspring, replacing the (single) parent in the next period.

5

The effective tax base may correspond to what Scott (2009, p. 73) has called “state-accessible product.”

6

We could also let At include external defense, which is a type of public good. See Sect. 3.6.

7

One could imagine other interpretations too. Following Carneiro (1970), one may also think of ϕ as capturing the degree of environmental circumscription. For example, creating records over tax payers may be easier when their ability to move is limited.

8

This utility function is chosen for tractability. Another approach would be a dynastic model where the ruler cares about the utility of the next generation. Letting V(zt,Yt) be the ruler’s value function, the associated Bellman equation could then be written V(zt,Yt)=maxlnctR+βV(zt+1,Yt+1), subject to the budget constraints in (11) below.

9

That is, one can let the production function in (1) be written Yt=(BAt+A_)αLt1-α, where A_ is an exogenous lower bound for productive capacity.

10

One element that the extended model in the Online Appendices does not capture is an endogenous decision by the potential predator, which can generate a link from output to the probability of theft. For such a model, see Dal Bó et al. (2016).

11

Note that Y0 and z0 are exogenously given, so nothing forces the economy to end up on that saddle path. Put another way, if Y0 and z0 were drawn from a joint continuous distribution, then the economy would end up on the saddle path with zero probability.

12

That is, τ_ is given by the bottom row in (14), which does not depend on z_.

13

While the (gross) extraction rate is a constant τ~, following Mayshar et al. (2020) we may instead consider the net extraction rate. This is the same as the rate of extraction, τ~, minus the (implicit) cost of extraction, (1-z~)τ~; cf. Sect. 3.6.5. The net extraction rate here equals just τ~-(1-z~)τ~=z~τ~, which is increasing in z~ (since τ~ does not depend on z~). This is consistent with Proposition B2 in Mayshar et al. (2020, Online Appendices B).

14

It can be seen that B and B coincide with the corresponding expressions in the benchmark setting, B^ and B^^, when σ goes to infinity. That is, limσB^=B and limσB^^=B.

15

One way to see this more formally is to note that the two thresholds for B , above which the high-extractive steady-state exists and the low-extractive one does not, are both decreasing in ϕ. These are the ones denoted B^^ and B^, respectively, in Proposition 1.

16

The small dip in mean extractive capacity for those with the lowest levels of B is due to zt temporarily falling below z¯ in the transition to the high-extractive steady state.

Earlier versions of this paper have been titled “Multiple Steady Statehood” and “Land Productivity and Statehood: The Surplus Theory Revisited.” I thank three anonymous referees for detailed comments. I am also grateful for input from: Oded Galor, Fabio Mariani, Omer Moav, Andreas Irmen, Holger Strulik, David Weil, Balazs Zelity, and participants at presentations that I gave at a May 2018 Growth Lab workshop at Brown University, a June 2018 seminar at the University of Göttingen, and the December 2018 CREA Workshop on Culture and Comparative Development at the University of Luxembourg. This research was supported in part by funding from the Social Sciences and Humanities Research Council. All errors are mine.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Allen RC. Agriculture and the origins of the state in ancient Egypt. Explorations in Economic History. 1997;34:135–154. doi: 10.1006/exeh.1997.0673. [DOI] [Google Scholar]
  2. Ashraf Q, Galor O. Dynamics and stagnation in the Malthusian epoch. American Economic Review. 2011;101:2003–2041. doi: 10.1257/aer.101.5.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Besley T, Persson T. The origins of state capacity: Property rights, taxation, and politics. American Economic Review. 2009;99:1218–1244. doi: 10.1257/aer.99.4.1218. [DOI] [Google Scholar]
  4. Besley T, Persson T. Pillars of prosperity: The political economics of development clusters. Princeton University Press; 2011. [Google Scholar]
  5. Besley T, Ilzetzki E, Persson T. Weak states and steady states: The dynamics of fiscal capacity. American Economic Journal: Macroeconomics. 2013;5:205–235. [Google Scholar]
  6. Bockstette V, Chanda A, Putterman L. States and markets: The advantage of an early start. Journal of Economic Growth. 2002;7:347–69. doi: 10.1023/A:1020827801137. [DOI] [Google Scholar]
  7. Borcan O, Olsson O, Putterman L. State history and economic development: Evidence from six millennia. Journal of Economic Growth. 2018;23:1–40. doi: 10.1007/s10887-017-9152-0. [DOI] [Google Scholar]
  8. Borcan, O., Olsson, O., & Putterman, L. (2020). Transition to agriculture and first state presence: A global analysis. University of Gothenburg working paper in economics no. 741.
  9. Carneiro RL. A theory of the origin of the state. Science. 1970;169:733–738. doi: 10.1126/science.169.3947.733. [DOI] [PubMed] [Google Scholar]
  10. Chanda A, Putterman L. Early starts, reversals and catch-up in the process of economic development. Scandinavian Journal of Economics. 2007;109:387–413. doi: 10.1111/j.1467-9442.2007.00497.x. [DOI] [Google Scholar]
  11. Chanda A, Cook CJ, Putterman L. Persistence of fortune: Accounting for population movements, there was no post-Columbian reversal. American Economic Journal: Macroeconomics. 2014;6:1–28. [Google Scholar]
  12. Childe, V. G. (1936). Man makes himself. The New American Library, Inc. (Reprinted 1951).
  13. Childe VG. The urban revolution. The Town Planning Review. 1950;21:3–17. doi: 10.3828/tpr.21.1.k853061t614q42qh. [DOI] [Google Scholar]
  14. Dal Bó, E., Hernández, P., Mazzuca, S. (2016). The paradox of civilization: pre-institutional sources of security and prosperity, mimeo, UC Berkeley, NYU Abu Dabi, and Johns Hopkins.
  15. Depetris-Chauvin, E., (2016). State history and contemporary conflict: Evidence from Sub-Saharan Africa, mimeo, Pontificia Universidad Cat ólica de Chile.
  16. Depetris-Chauvin, E., & Özak, Ö. (2016) Population diversity, division of labor and comparative development, mimeo, Pontificia Universidad Católica de Chile and Southern Methodist University.
  17. Diamond J. Guns, germs, and steel: The fates of human societies. W.W. Norton & Company; 1997. [Google Scholar]
  18. Ertman T. Birth of the leviathan: Building states and regimes in medieval and early modern Europe. Cambridge University Press; 1997. [Google Scholar]
  19. Fenske J. Ecology, trade, and states in pre-colonial Africa. Journal of the European Economic Association. 2014;12:612–640. doi: 10.1111/jeea.12042. [DOI] [Google Scholar]
  20. Flannery KV. The cultural evolution of civilizations. Annual Review of Ecology and Systematics. 1972;3:399–426. doi: 10.1146/annurev.es.03.110172.002151. [DOI] [Google Scholar]
  21. Galor O. The 2008 Lawrence R. Klein lecture-comparative economic development: Insights from Unified Growth Theory. International Economic Review. 2010;51:1–44. doi: 10.1111/j.1468-2354.2009.00569.x. [DOI] [Google Scholar]
  22. Galor O, Özak Ö. The agricultural origins of time preference. American Economic Review. 2016;106:3064–3103. doi: 10.1257/aer.20150020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hariri JG. The autocratic legacy of early statehood. American Political Science Review. 2012;106:471–494. doi: 10.1017/S0003055412000238. [DOI] [Google Scholar]
  24. Heldring, L., Allen, R. C., & Bertazzin, M. C. (2019). Institutional adaptation to environmental change, mimeo, Institute on Behavior & Inequality (briq), Bonn.
  25. Hibbs DR, Jr, Olsson O. Geography, biogeography, and why some countries are rich and others are poor. Proceedings of the National Academy of Sciences. 2004;101:3715–3720. doi: 10.1073/pnas.0305531101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lagerlöf N-P. Statehood, democracy and preindustrial development. Journal of Economic Dynamics and Control. 2016;67:58–72. doi: 10.1016/j.jedc.2016.04.001. [DOI] [Google Scholar]
  27. Lindstrom L. “Big man:” A short terminological history. American Anthropologist. 1981;83:900–905. doi: 10.1525/aa.1981.83.4.02a00140. [DOI] [Google Scholar]
  28. Litina, A. (2014). The geographical origins of early state formation, CREA Discussion Paper Series 14-28, Center for Research in Economic Analysis, University of Luxembourg.
  29. Mann M. The sources of social power, volume 1: a history of power from the beginning to AD 1760. 2. Cambridge University Press; 1986. [Google Scholar]
  30. Mayoral, L., & Olsson, O. (2020). Pharaoh’s cage: Environmental circumscription and appropriability in early state development, mimeo, University of Gothenburg.
  31. Mayshar J, Moav O, Neeman Z. Geography, transparency, and institutions. American Political Science Review. 2017;111:622–636. doi: 10.1017/S0003055417000132. [DOI] [Google Scholar]
  32. Mayshar, J., Moav, O., Neeman, Z., & Pascali, L. (2020). The origin of the state: Land productivity or appropriability? Mimeo, Hebrew University of Jerusalem, University of Warwick, Tel-Aviv University, and Pompeu Fabra University.
  33. Nissen HJ, Heine P. From Mesopotamia to Iraq: A concise history. The University of Chicago Press; 2009. [Google Scholar]
  34. Putterman L. Agriculture, diffusion and development: Ripple effects of the Neolithic Revolution. Economica. 2008;75:729–748. doi: 10.1111/j.1468-0335.2007.00652.x. [DOI] [Google Scholar]
  35. Read KE. Leadership and consensus in a New Guinea society. American Anthropologist. 1959;61:425–436. doi: 10.1525/aa.1959.61.3.02a00060. [DOI] [Google Scholar]
  36. Sahlins MD. Poor man, rich man, big-man, chief: Political types in Melanesia and Polynesia. Comparative Studies in Society and History. 1963;5:285–303. doi: 10.1017/S0010417500001729. [DOI] [Google Scholar]
  37. Scott JC. The art of not being governed: An anarchist history of Upland Southeast Asia. Yale University Press; 2009. [Google Scholar]
  38. Scott JC. Against the grain: A deep history of the earliest states. Yale University Press; 2017. [Google Scholar]
  39. Service, E. R . Origins of the state and civilization: The process of cultural evolution. W.W. Norton & Co; 1975. [Google Scholar]
  40. Schönholzer D. The origin of the state: Incentive compatible extraction under environmental circumscription, mimeo. IIES, Stockholm University; 2019. [Google Scholar]
  41. Stasavage D. The decline and rise of democracy: A global history from antiquity to today. Princeton University Press; 2020. [Google Scholar]
  42. Stevens Curl J, Wilson S. A dictionary of architecture and landscape architecture. 3. Oxford University Press; 2015. [Google Scholar]
  43. Wittfogel KA. Oriental despotism: A comparative study of total power. Yale University Press; 1957. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Journal of Economic Growth (Boston, Mass.) are provided here courtesy of Springer

RESOURCES