Significance
More than 40 y ago, Schelling introduced one of the first agent-based models in the social sciences. The model showed that even if people only have a mild preference for living with neighbors of the same color, complete segregation will occur. This model has been much discussed by social scientists and analyzed by physicists using analogies with spin-1 Ising models and other systems. Here, we study the metapopulation version of the model, which mimics the division of a city into neighborhoods, and we present the first analysis to our knowledge that gives detailed information about the structure of equilibria and explicit formulas for their densities.
Keywords: segregation, large deviations
Abstract
In 1971, Schelling introduced a model in which families move if they have too many neighbors of the opposite type. In this paper, we will consider a metapopulation version of the model in which a city is divided into N neighborhoods, each of which has L houses. There are ρNL red families and ρNL blue families for some ρ < 1/2. Families are happy if there are ≤ρcL families of the opposite type in their neighborhood and unhappy otherwise. Each family moves to each vacant house at rates that depend on their happiness at their current location and that of their destination. Our main result is that if neighborhoods are large, then there are critical values ρb < ρd < ρc, so that for ρ < ρb, the two types are distributed randomly in equilibrium. When ρ > ρb, a new segregated equilibrium appears; for ρb < ρ < ρd, there is bistability, but when ρ increases past ρd the random state is no longer stable. When ρc is small enough, the random state will again be the stationary distribution when ρ is close to 1/2. If so, this is preceded by a region of bistability.
In 1971, Schelling (1) introduced one of the first agent-based models in the social sciences. Families of two types inhabit cells in a finite square, with 25–30% of the squares vacant. Each family has a neighborhood that consists of a 5 × 5 square centered at their location. Schelling used a number of different rules for picking the next family to move, but the most sensible seems to be that we pick a family at random on each step. If the fraction of neighbors of the opposite type is too large, then they move to the closest location that satisfies their constraints. Schelling simulated this and many other variants of this model (using dice and checkers) to argue that if people have a preference for living with those of their own color, the movements of individual families invariably led to complete segregation (2).
As Clark and Fossett (3) explain “The Schelling model was mostly of theoretical interest and was rarely cited until a significant debate about the extent and explanations of residential segregation in US urban areas was engaged in the 1980s and 1990s. To that point, most social scientists offered an explanation that invoked housing discrimination, principally by whites.” At this point Schelling’s article has been cited more than 800 times. For a sampling of results from the social sciences literature, see Fossett’s lengthy survey (4) or other more recent treatments (5–7). About 10 y ago, physicists discovered this model and analyzed a number of variants of it using techniques of statistical mechanics (8–14). However, to our knowledge, the only rigorous work is ref. 15, which studies the 1D model in which the threshold for happiness is ρc = 0.5 and two unhappy families within distance w swap places at rate 1.
Here, we will consider a metapopulation version of Schelling’s model in which there are N neighborhoods that have L houses, but we ignore spatial structure within the neighborhoods and their physical locations. We do this to make the model analytically tractable, but these assumptions are reasonable from a modeling point of view. Many cities in the United States are divided into neighborhoods that have their own identities. In Durham, these neighborhoods have names like Duke Park, Trinity Park, Watts-Hillendale, Duke Forest, Hope Valley, Colony Park, etc. They are often separated by busy roads and have identities that are reinforced by e-mail newsgroups that allow people to easily communicate with everyone in their neighborhood. Because of this, it is the overall composition of the neighborhood that is important not just the people who live next door. In addition, when a family decides to move they can easily relocate anywhere in the city.
Families, which we suppose are indivisible units, come in two types that we call red and blue. There are ρNL of each type, leaving (1 − 2ρ)NL empty houses. This formulation was inspired by Grauwin et al. (16), who studied segregation in a model with one type of individual whose happiness is given by a piecewise linear unimodal function of the density of occupied sites in their neighborhood. To define the rules of movement, we introduce the threshold level ρc such that a neighborhood is happy for a certain type of agent if the fraction of agents of the opposite type is ≤ρc. For each family and empty house, movements occur at rates that depend on the state of the source and destination houses:
where q, r < 1, and ϵ are small, e.g., 0.1 or smaller. Because there are O(NL) vacant houses, dividing the rates by NL makes each family moves at a rate O(1). Because ϵ is small, happy families are very reluctant to move to a neighborhood in which they would be unhappy, whereas unhappy families move at rate 1 to neighborhoods that will make them happy. As we will see later, the equilibrium distribution does not depend on the values of the rates q and r for transitions that do not change a family’s happiness. We do not have an intuitive explanation for this result.
Convergence to a Deterministic Limit
To describe the dynamics more precisely, let ni,j(t), be the number of neighborhoods with i red and j blue families for (i, j) ∈ Ω = {(i, j): i, j ≥ 0, i + j ≤ L}. The configuration of the system at time t can be fully described by the numbers . If one computes infinitesimal means and variances, it is natural to guess (and not hard to prove) that if we keep L fixed and let N → ∞, then converges to a deterministic limit.
Motivated by individual-based models in finance, Remenik (17) proved a general result that takes care of our example. To describe the limit, we need some notation. Let λ(a1, b1; a2, b2) be N times the total rate of movement from one (a1, b1) neighborhood to one (a2, b2) neighborhood. Let be N times the rate at which a movement from one ω1 = (a1, b1) neighborhood to one ω2 = (a2, b2) neighborhood turns the pair ω1, ω2 into . The exact formulas for these quantities are not important, so they are shown in SI Text.
Theorem 1. As N → ∞ the converge in probability to the solution of the ordinary differential equation:
| [1] |
We do not sum over ω′ because its value is determined by ω1 and ω2. The first term comes from the fact that a migration from (i, j) → ω or ω → (i, j) destroys an (i, j) neighborhood, whereas the second reflects the fact that a migration ω → ω′ may create an (i, j) neighborhood at the source or at the destination.
Special Case L = 2
To illustrate the use of Theorem 1, we consider the case L = 2, and let ℓc = [ρcL] be the largest number of neighbors of the opposite type that allows a family to be happy. Here, [x] is the largest integer ≤x. When L = 2, a neighborhood with both types of families must be (1, 1), so the situation in which ℓc ≥ 1 is trivial because there are never any unhappy families. In the case L = 2 and ℓc = 0, it is easy to find the equilibrium because there is detailed balance, i.e., the rate of each transition is exactly balanced by the one in the opposite direction.
After a little algebra (SI Text), we find that this holds if and only if
At first, it may be surprising that the rate r has nothing to do with the fixed point, but if you look at the first two equations you see that the r appears on both sides. The parameter q does not appear either, but when L = 2, it is for the trivial reason that transitions (1, 1)(1, 0) → (1, 0)(1, 1), which occur at rate q, do not change the state of the system.
Using now the fact that the equilibrium must preserve the red and blue densities, we can solve for x and y to conclude that
The argument given here shows that this is the only fixed point that satisfies detailed balance. We prove in SI Text that it is the only fixed point. Because the formulas, which result from solving a quadratic equation, are somewhat complicated, Fig. 1 shows how the equilibrium probabilities νi,j vary as a function of ρ.
Fig. 1.
Equilibrium probabilities νi,j for the case L = 2 plotted against ρ.
Unfortunately, when L ≥ 3, there is no stationary distribution that satisfies detailed balance. One can, of course, solve for the stationary distribution numerically. Fig. 2 shows the limit behavior of the system with L = 20 and ρc = 0.3, i.e., ℓc = 6 for initial densities ρ = 0.1, 0.2, 0.25, and 0.35. In the first two cases, most of the families are happy. In the third situation, the threshold is ℓc = 6, whereas the average number of reds and blues per neighborhood is five, but because fluctuations in the makeup of neighborhoods can lead to unhappiness, there is a tendency toward segregation. In the fourth case, segregation is almost complete, with most neighborhoods having 0 or 1 of the minority type.
Fig. 2.

Limiting behavior of limit differential equation, with ρc = 0.3, ϵ = 0.01, and ρ = 0.1, 0.2, 0.25, and 0.35.
Outline of Our Solution
Finding the stationary distribution requires solving roughly L2/2 equations. To be precise, there are 231 equations when L = 20 and 5,151 when L = 100. In this section, we will adopt a different approach: we concentrate on the evolution of neighborhood 1 and consider the other N − 1 neighborhoods to be its environment, which can be summarized by the following four parameters: (i) the average number of happy red and blue families per neighborhood, and , and (ii) the average number of vacant sites happy for red or blue, and , again per neighborhood.
The first in our solution is to identify the stationary distribution of the neighborhood–environment chain that are self-consistent. That is, if we calculate the expected values of , , , and in equilibrium in neighborhood, they agree with the original parameters. To begin to do this, we divide the state space Ω into four quadrants based on red and blue happiness. Writing 0 for H and 1 for U, we have
If we let Tri(pR, pB) be the trinomial distribution
| [2] |
then inside Qk,ℓ, the detailed balance condition is satisfied by Tri(pR, pB) where
and formulas for αk,ℓ and βk,ℓ are given in SI Text.
Unfortunately, there is no stationary distribution that satisfies detailed balance on the entire state space. To verify this, we note that if the stationary distribution π and the jump rates q satisfy detailed balance π(x)q(x, y) = π(y)q(y, x), then around any closed path x0, x1,…xn = x0 with q(xi−1, xi) > 0 for 1 ≤ i ≤ n, we must have
but this is not satisfied around loops that visit two or more quadrants. To avoid this problem, we cut the connections between the different quadrants and identify the self-consistent distributions for the modified chain. At this point, we can only do this under
Assumption 1. Stationary distributions are symmetric under interchange of red and blue.
The answer given in the next section is a one-parameter family of stationary distributions indexed by a ∈ [0, 1/2]. There, and in the next two steps, we have a pair of results: one for ρ < ρc and one for ρ > ρc.
In the second step, we investigate the flow of probability between quadrants when transitions between the quadrants are restored. The key idea is that the measures in each quadrant are trinomial, so the probabilities will decay exponentially away from the mean (pRL, pBL). This observation implies that the flow between quadrants occurs at rate exp(−cL), which is much smaller than the time, O(1), it takes the probability distributions to reach equilibrium. In words, the process comes to equilibrium on a fast time scale, whereas the parameters change on a much slower one. We will prove this separation of time scales in a version of the paper for a mathematical audience. Here, we will only give the answers that result under
Assumption 2. The process is always in one of self-consistent stationary distributions, but the value of a changes over time.
The third step is to use the stability results to show that the only possible stable equilibria are at the following end points: a = 0, which represents a random distribution; and a = 1/2, which represents a segregated state.
We will call these measures μr and μs when ρ < ρc and and when ρ > ρc. Theorems 4A and 4B describe when they are stable fixed points.
Self-Consistent Stationary Distributions
The results given here are proved in SI Text. The formulas, which again come from solving a quadratic equation, are not simple but they are explicit and easily evaluated.
Theorem 2A. Suppose ρ < ρc. For a ∈ (0, 1/2] let
| [3] |
where
Let ρ1(0, ρ) = lima→0 ρ1(a, ρ) = ρ/[1 − ρ(1 − ϵ)] and for a ∈ [0, 1/2] let
A symmetric distribution μ is self-consistent if and only if it has the form above with parameters ρ1 > ρc, ρ2 = ϵρ1 < ρc, and ρ0 = ρ1/[1 + (1 − ϵ)ρ1] < ρc.
To clarify the last sentence: the definition of ρ1 does not guarantee that the three conditions are satisfied for all values of a ∈ [0, 1/2], so the inequalities are additional conditions. As shown in SI Text, a → ρ1(a, ρ) is increasing, so the range of possible values of ρ1 for a fixed value of ρ is
| [4] |
The possible self-consistent stationary distributions are similar in the second case but the formulas are different.
Theorem 2B. Suppose ρ ≥ ρc. For a ∈ (0, 1/2] let
| [5] |
where
Let , and for a ∈ [0, 1/2] let
A symmetric distribution is self-consistent if and only if it has the form above with parameters , and .
This time a → ρ1(a, ρ) is decreasing, so the range of possible values of ρ1 for a fixed value of ρ is
| [6] |
i.e., the old upper bound on the range of ρ1 in 4 has become the lower bound. See Fig. 3 for a picture.
Fig. 3.
Picture to explain calculation of the phase transition when ρc = 0.2 and ϵ = 0.1. The x axis gives the value of ρ. Dots on the axis are the locations of ρb, ρd, , and . The two curves are ρ1(0, ρ) for ρ < ρc and for ρ ≥ ρc, whereas the straight line is .
Stability Calculations
The results in this section are proved in SI Text. Using large deviations for the trinomial distribution, which in this case is just calculating probabilities using Stirling’s formula, we conclude the following:
Theorem 3A. Suppose ρ < ρc and recall μa has no mass on Q1,1. The flow into Q0,0 from Q0,1 and Q1,0 is larger than the flow out if and only if
| [7] |
Theorem 3B. Suppose ρ ≥ ρc and recall has no mass on Q0,0. The flow out of Q1,1 to Q0,1 and Q1,0 is larger than the flow in if and only if
| [8] |
Phase Transition
Combining Theorems 2A and 3A, we can determine the behavior of the process for ρ < ρc. The set of possible values for ρ1(a, ρ) for a fixed ρ is the interval [ρ1(0, ρ), ρ1(1/2, ρ)] given in 4. Because 0 ≤ ρ ≤ ρc, we are looking for a solution to
with x0 ∈ [0, 2ρc/(1 + ϵ)). In SI Text, we show that x0 exists and is unique. Here, we will concentrate on what happens in the example ρc = 0.2 and ϵ = 0.1. When ρc = 0.2, the interval is [0, 0.4/1.1], and we have x0 = 0.2183.
Let ρb be chosen so that x0 = ρ1(1/2, ρb) and ρd be chosen so that x0 = ρ1(0, ρd). See Fig. 3 for a picture. When a solution x0 exists in the desired interval
In our special case, ρb = 0.1201 and ρd = 0.1825.
Theorem 4A. The stable stationary distributions for ρ < ρc are
Why is this true? When ρ < ρb, ρ0 > ρ1(1/2, ρ), the flow into Q0,0 is always larger than the flow out, so μr is the stationary distribution. When ρb < ρ < ρd, there will be an ac ∈ (0, 1/2) so that ρ1(ac, ρ) = ρ0. The flow into Q0,0 is larger than the flow out when a < ac and the a in the mixture will decrease, whereas for a > ac, the flow out of Q0,0 will be larger than the flow in and a will increase. Thus, we have stable fixed points at 0 and 1/2. When ρd < ρ < ρc, ρ < ρ1 (0, ρ), the flow out of Q0,0 is always larger than the flow in, and the segregated fixed point with a = 1/2 is the stationary distribution.
Using Theorems 2B and 3B, we can determine the behavior of the process for ρ ≥ ρc. The set of possible values for for a fixed ρ is the interval given in 6. Because ρc ≤ ρ ≤ 0.5, we are looking for a solution to
with . In SI Text, we show that there is a solution in the desired interval if and only if , and it is unique. The condition comes from having = in 8 at the right end point 1/(1 + ϵ). When ϵ = 0.1, the condition is ρc < 0.25964.
When ρc = 0.2 and ϵ = 0.1, this interval is [0.4/1.1, 1/1.1], and . Let be chosen so that and be chosen so that . When a solution exists in the desired interval, we have
In our example, and .
Theorem 4B. The stable stationary distributions for ρ ≥ ρc are
The reasoning behind this result is the same as for Theorem 4A. To show that these result can be used to explicitly describe the phase transition, Fig. 4 shows how the four critical values depend on ρc when ϵ = 0.1.
Fig. 4.
as a function of ρc when ϵ = 0.1.
Discussion
Here, we considered a metapopulation version of Schelling’s model, which we believe is a better model for studying the dynamics of segregation in a city than a nearest neighborhood interaction on the 2D square lattice. Due to the simple structure of the model, we are able to describe the phase transition in great detail. For ρ < ρb, a random distribution of families μr = Tri(ρ, ρ) is the unique stationary distribution. As ρ increases there is a discontinuous phase transition to a segregated state, μs at ρd preceded by an interval (ρb, ρd), in which both μr and μs are stable. Surprisingly the phase transition occurs to a segregated state occurs at a value ρd < ρc, i.e., at a point where in a random distribution most families are happy. This shift in behavior occurs because random fluctuations create segregated neighborhoods, which, as our analysis shows, are more stable than the random ones.
If ρc is small enough, then as ρ nears 1/2, there is another discontinuous transition at , which returns the equilibrium to the random state , and this is preceded by an interval of bistability. To explain this, we note that when families are distributed randomly, everyone is unhappy and moves at rate 1, maintaining the random distribution. In our concrete example, ρc = 0.2 and ϵ = 0.1, the fraction of vacant houses at is only 4.04%, so it is very difficult to make segregated neighborhoods where one type is happy. Our stability analysis implies that these segregated neighborhoods are created at a slower rate than they are lost, so the random state prevails.
The results in this paper have been derived under two assumptions: (i) stationary distributions are invariant under interchange of red and blue and (ii) the process is always in one of a one-parameter family of self-consistent stationary distributions indexed by a ∈ [0,1/2], but the value of a changes over time. We are confident that ii can be rigorously proved. Removing i will be more difficult, because when symmetry is dropped, there is a two-parameter family of self-consistent distributions. A more interesting problem, which is important for applications to real cities, is to allow the initial densities of reds and blues and their threshold for happiness to differ. Although our solution is not yet complete, we believe it is an important first step in obtaining a detailed understanding of the equilibrium behavior of Schelling’s model in a situation that is relevant for applications.
Supplementary Material
Acknowledgments
We thank David Aldous, Nicholas Lanchier, Simon Levin, and Thomas Liggett for helpful comments. R.D. and Y.Z. were partially supported by Grants DMS 10-05470 and DMS13-05997 from the probability program at the National Science Foundation.
Footnotes
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1414915111/-/DCSupplemental.
References
- 1.Schelling TC. Dynamic models of segregation. J Math Sociol. 1971;1(2):143–186. [Google Scholar]
- 2.Schelling TC. Micromotives and Macrobehavior. New York: Norton; 1978. [Google Scholar]
- 3.Clark WAV, Fossett M. Understanding the social context of the Schelling segregation model. Proc Natl Acad Sci USA. 2008;105(11):4109–4114. doi: 10.1073/pnas.0708155105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Fossett M. Ethnic preferences, social science dynamics, and residential segregation: Theoretical explanations using simulation analysis. J Math Sociol. 2006;30(3-4):185–274. [Google Scholar]
- 5.Pancs R, Vriend NJ. Schelling’s spatial proximity model of segregation revisited. J Public Econ. 2007;91(1-2):1–24. [Google Scholar]
- 6.Kandler A, Perreault C, Steele J. 2012. Cultural evolution in spatially structured populations: A review of alternative modeling frameworks. Adv Complex Syst 15:1203001.
- 7.Hatna E, Benenson I. The Schelling model of ethnic residential dyanmics: Beyon the integrated-segregation dichotomy of patterns. J Artif Soc Soc Simul. 2009;15(1):6. [Google Scholar]
- 8.Vinkovic D, Kirman A. A physical analogue of the Schelling model. Proc Natl Acad Sci USA. 2006;103(51):19261–19265. doi: 10.1073/pnas.0609371103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Stauffer D, Solomon S. Ising, Schelling and self-organizing segregation. Eur Phys J B. 2007;57(4):473–479. [Google Scholar]
- 10.Singh A, Vainchtein D, Weiss H. 2007. Schelling’s segregation model: Parameters, scaling, and aggregation. arXiv:0711.2212.
- 11.Dall’Asta L, Castellano C, Marsili M. 2008. Statistical physics of the Schelling model of segregation. J Stat Mechanics: Theory and Experiment 7:L07002.
- 12.Gauvin L, Vannimenus J, Nadal J-P. Phase diagram of a Schelling segregation model. Eur Phys J B. 2009;70(2):293–304. [Google Scholar]
- 13.Rogers T, McKane AJ. 2011. A unified framework for Schelling’s model of segregation. J Stat Mechanics: Theory and Experiment 7:P07006.
- 14.Domic NG, Goles E, Rica S. Dynamics and complexity of the Schelling segregation model. Phys Rev E. 2011;83(5):056111. doi: 10.1103/PhysRevE.83.056111. [DOI] [PubMed] [Google Scholar]
- 15.Brandt C, Immorlica N, Kamath G, Kleinberg R. 2012. An analysis of one-dimensional Schelling segregation. arXiv:1203.6346.
- 16.Grauwin S, Bertin E, Lemoy R, Jensen P. Competition between collective and individual dynamics. Proc Natl Acad Sci USA. 2009;106(49):20622–20626. doi: 10.1073/pnas.0906263106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Remenik D. Limit theorems for individual-based models in economics and finance. Stoch Proc Appl. 2009;119(8):2401–2435. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



