Travel in city road networks follows similar transport trade-off principles to neural and plant arbors

Jonathan Y Suen; Saket Navlakha

doi:10.1098/rsif.2019.0041

. 2019 May 15;16(154):20190041. doi: 10.1098/rsif.2019.0041

Travel in city road networks follows similar transport trade-off principles to neural and plant arbors

Jonathan Y Suen ^1,^✉, Saket Navlakha ¹

PMCID: PMC6544892 PMID: 31088262

Abstract

Both engineered and biological transportation networks face trade-offs in their design. Network users desire to quickly get from one location in the network to another, whereas network planners need to minimize costs in building infrastructure. Here, we use the theory of Pareto optimality to study this design trade-off in the road networks of 101 cities, with wide-ranging population sizes, land areas and geographies. Using a simple one parameter trade-off function, we find that most cities lie near the Pareto front and are significantly closer to the front than expected by alternate design structures. To account for other optimization dimensions or constraints that may be important (e.g. traffic congestion, geography), we performed a higher-order Pareto optimality analysis and found that most cities analysed lie within a region of design space bounded by only four archetypal cities. The trade-offs studied here are also faced and well-optimized by two biological transport networks—neural arbors in the brain and branching architectures of plant shoots—suggesting similar design principles across some biological and engineered transport systems.

Keywords: network theory, Pareto optimality, transport networks, biological networks

1. Introduction

Networks face trade-offs between conflicting goals. In engineering, this trade-off is often between the performance of the network, benefiting individual users, and the costs of constructing the network. Users desire to transport a payload, such as information or a physical object, at an unlimited capacity and at the fastest speed possible. Conversely, a network fulfilling these goals is expensive to construct, consuming significant resources, such as money, space and energy. Thus, network designers seek to build networks that are efficient, maximizing benefits provided by allocated resources.

Nearly all engineered networks are subject to this trade-off. For example, a data network, like the Internet, would need dedicated links between every pair of users to achieve maximum performance, since high bandwidth and low latency are desired by the user. To the network provider, however, the number and length of physical links impose major costs [1,2]. Similar trade-offs are also faced by water pipeline networks [3], power grids [4] and wireless sensor networks [5]. In practice, most networks are not likely to be optimally efficient due to constraints (e.g. mountains that necessitate a circuitous path) or difficulty in modifying or pruning certain parts of the network in response to changing user demands. There may also be other factors to optimize in the network, such as reliability and robustness [2].

Here, we study travel paths in metropolitan area road networks, where users in suburban locations desire to commute downtown. We make the following contributions:

(1) We collected travel paths for 11 615 users over 101 cities worldwide, with metro area populations ranging from 7000 to 38 million, with city proper land areas ranging from 5 to 16 400 km², and with a variety of geographical constraints, including rivers and bays, mountainous terrain and geopolitical boundaries (figure 1a).
(2) We study the trade-off between travel distance (user performance) and total network length (road cost). We use a network design algorithm to generate near-Pareto optimal topologies that balance between these two competing objectives based on a single parameter (α) denoting how the two objectives are weighted against each other. We find that many, though not all, cities lie near the Pareto front with a surprisingly tight range of α values. This indicates similar balancing of the two objectives across cities, despite the broad diversity in population, city size and region.
(3) We find that when three additional time-related optimization measures (speed, congestion and number of manoeuvres) are added to travel distance and network length, the dimensionality of the data increases by only one, i.e. most cities lie within a three-dimensional subspace. The polytope that bounds all cities within this subspace has four corners, representing four archetypical cities: Jerusalem, Juba, Amarillo and Delhi. These cities formed extremum in trade-offs, some favouring optimizing network length (Jerusalem and Juba), optimizing travel distance (Amarillo and Delhi) and optimizing travel speed (Amarillo), while also performing poorly in handling congestion (Delhi) and minimizing travel distances (Jerusalem). Our results suggest a certain simplicity to the structure of global travel paths that have emerged through various growth processes.

Figure 1. — Pareto optimality analysis of travel paths. (a) Travel paths from randomly distributed points in Beijing to a central downtown location (Tiananmen Square), as provided by Google Maps. (b) The Pareto front that optimally trades-off construction costs (α = 1) and travel distance (α = 0). The variable α defines the relative weights between the two competing goals. (c) Example cities that lie close (Ithaca) and far (Abbottabad) to the Pareto front. The distance to the Pareto front is calculated using a variable ε. Cities with ε > 1 require greater construction costs and/or travel distance than a Pareto-optimal city. This diagram is only to illustrate how ε can vary across cities; in reality, different road networks likely have different Pareto fronts. (Online version in colour.)

We conclude by discussing analogous trade-offs faced in biological networks, and how the study of one may shed light on the other.

2. Related work

Prior work has largely focused on analysing the structure of road networks as graphs, or on developing highly parameterized models of traffic flow and routing. Our focus instead is on quantifying the trade-off between user performance and network length in road structures using travel path data from numerous cities.

2.1. Analysing road structures as graphs

Graphs are a common abstraction used to analyse the structure of road networks [6,7]. Such analyses have shown, for example, that roads form scale-free networks [8] and display self-similarity [9]. Others have studied the hierarchy of road networks, ranging from slow residential streets with many intersections to large controlled-access highways. These works studied the structural properties (e.g. ‘beltness’, ‘treeness’, etc.) used in each layer of the hierarchy, and the interfaces between layers [7]. Prior works have also studied how to optimally redesign existing road structures by integrating data on connectivity, broad geography and road usage statistics [10]. However, there are many external factors that affect the design and routing of roads, especially on a sub-kilometre scale. This includes local terrain (small hills, ravines and subsurface geology) [11], environmental concerns (noise, pollution), land acquisition costs, and political factors [12], among others. While broad scale geography, at 1 km resolution, has been analysed to determine where roads should be built [13], it is difficult to gather the data necessary to generate a representative model on a city level.

In contrast to these works, we derive a measure to describe the network not in a strictly topological or structural sense, but rather in terms of a user-network trade-off. The focus of this trade-off is on travel paths, which describe how the underlying roads are used to get from one location in the network to another, while implicitly taking road type into account. In contrast to previous studies that only study one road type, or that treat all road types as identical [7–9,14,15], our travel paths consider residential streets, central arteries and freeways together; we are also able to study more rural and lesser-developed cities, which may not have larger types of roads. Previous studies using travel paths have focused on morphological features, such as ‘inness’ [16], but have not studied the user-network trade-off. The user optimality of small-scale non-road networks has been explored before [17], but has not been applied to the continuum of network topologies between user optimal and network optimal configurations.

2.2. Models of traffic flow and routing

There are many features important to road travel that are not captured by static analysis of road graphs. To get from one location in the network to another, a user will often travel on different types of roads, such as surface streets, arteries and highways. Determining realistic travel paths given a map is thus often done by either applying a routing algorithm [18], or by collecting empirical data [19]. The decision process for a user to select between these types of roads has been modelled in terms of a continuous flow on a discrete network [20], or as an empirical factor based on probabilistic processes [18]. Accurate routing algorithms must also be privy to other roadway characteristics such as intersection types (e.g. a two-way versus a four-way stop) and utilization (e.g. unprotected turns against traffic), and the effects of varying capacity, speed limits and traffic congestion. However, these factors can vary significantly based on local environmental factors and may not be apparent from large-scale mapping data [21].

Routing algorithms are further complicated in that a user would like to reach the destination in both the fastest time and shortest travel distance possible. When this is not possible, users are willing to trade-off some additional distance travelled for a reduction in travel time, as exhibited by drivers travelling a longer route to avoid traffic congestion and delay at intersections [22]. In routing algorithms, this trade-off is described by a variable equal to the assumed cost per minute of travel time divided by the assumed cost per kilometre travelled (sometimes called the cost index [23]). Other studies have analysed how travel times can be decreased if vehicles were routed using centrally determined, system-optimal paths rather than user-optimal paths [24], whereas our more realistic scenarios assume ordinary greedy driver behaviour.

Rather than developing a complex traffic model and routing algorithm for each city, or limiting our study to certain model structures [20], we used the routing algorithms developed by Google Maps [25]. Though we treat Google Maps as a ‘black box’, the travel paths suggested by Google Maps are followed by millions of users every day and are generally regarded as accurate. Notably, Google Maps has a traffic congestion and travel time model based on crowdsourced empirical data. The unmatched scale and quality of Google Maps allows us to sample long routes over hundreds of square kilometres, across 101 global cities, in a uniform manner and more accurately than theoretical models.

2.3. Relationship to biological network design and routing

Our study is also motivated by similar network design trade-offs that occur in biology. For example, Conn et al. [26] showed that plant architectures can also be viewed as transport networks, used to shuttle nutrients from one organ of the plant to another. These networks also must straddle between optimizing for fast nutrient transport (user performance) while also balancing for costs in building the architecture (network cost). Similarly, Budd et al. [27] and Cuntz et al. [28] showed that neural arbors (dendrites and axons) can also be viewed as transport networks used to shuttle information (action potentials or signals) from one neuron to another. These networks must balance between the distance or time required to route the signal along the arbor (user performance) versus the amount of wiring needed to realize a specific topology given a space-constrained brain (network cost). Prior work has suggested that both of these systems have evolved mechanisms to generate Pareto-optimal topologies [29]. Our goal here is to test these ideas in human engineered road networks that have likely evolved using different mechanisms.

2.4. Additional work in Pareto optimality of complex networks

Seoane and Sole [30,31] study a trade-off between network efficiency and cost using Pareto optimization, and they study how these competing forces can lead to phase transitions in design space, also called network morphospace [32]. Our work studies a specific example of this morphospace as instantiated by travel paths in cities. Moses et al. [33] study trade-offs between minimizing delivery time and energy dissipation required for delivery of a resource, and they find scaling principles between the two in vascular networks and computer chip designs. Thus, while the idea of studying network design trade-offs across engineered and biological contexts is not new, our study instantiates these ideas in a specific domain of single-source transport systems.

3. Results

First, we describe a framework based on the theory of Pareto optimality to study user-performance trade-offs in road network travel paths. Second, we analyse travel paths in 101 cities and show that most fall significantly closer to the Pareto front than expected with surprisingly little variability in how different cities trade-off competing objectives. Third, we analyse the travel paths using a higher-order Pareto optimization and show that all cities can be bounded within a small region of trait space, exemplified by four archetype cities.

3.1. A framework for quantifying user-network trade-offs in road structures

The idea of the Pareto front is often used in engineering and economics to find optimal trade-offs between competing objectives [34–36]. This theory has had a long history of being used to study trade-offs in biological systems [37–40], for which evolution and natural selection are hypothesized to optimize biological function with respect to competing demands. Intuitively, consider two competing objectives, o₁ and o₂. If some solution a₁ outperforms solution a₂ on both objectives, then a₁ is said to ‘dominate’ a₂, leading to a₂ being removed from the population. Repeating this argument for all possible solutions, the only solutions that remain lie along the Pareto front of o₁ and o₂. This front is defined as the set of solutions where increasing the performance on one objective necessarily results in a loss in performance on the other objective (figure 1b)

In our case, we are given as input a set of points, $P = {p_{1}, p_{2}, \dots, p_{n}}$ , corresponding to the 2D locations of n random users in a city; and a root point r, corresponding to the location of downtown, where the users seek to travel to (Methods). Our goal is to output a tree G = (V, E), where $V = P \cup {r}$ , and E are the set of edges connecting the nodes in V.

The first objective is to minimize the road cost (network length), which is equal to the sum of all the edge lengths:

C (G) = \sum_{(u, v) \in E} d (u, v),

3.1

where d(u, v) is the Euclidean distance between u and v. Ideally, all points in $P$ would have the same position relative to downtown for all cities. This was not possible because some points for a given city may lie in bodies of water or uninhabited areas (Methods). We thus normalized C(G) by the mean straight-line distance between r and each $p_{i} \in P$ .

The second objective is to minimize the user travel distances, which is equal to the sum of the distances from each point to the root (downtown):

T (G) = \sum_{i} l (G, p_{i}, r),

3.2

where l(G, p_i, r) is the distance to travel from p_i to r along the edges in G. Similar to the reasoning above, T(G) was normalized by the sum of straight-line distances.

Both of these measures are clearly approximations of actual performance and cost. For example, network length is one aspect of road costs, but there are other factors, as well, including the number of lanes and the terrain. Travel distance itself can be determined fairly accurately; however, users also assign value to minimizing travel time. Distance and time are often correlated—the shortest path is often the fastest path—but there are factors such as roadway speeds, traffic congestion and intersection delays. We consider these and other factors in our higher-order Pareto analysis discussed later.

The objectives of minimizing network length and minimizing travel distance are at odds with one another. For example, minimizing network length is achieved by a minimum spanning tree of the graph (or a Steiner tree, if branch points are allowed). This solution, however, would not be very attractive from a user’s perspective, who may have to travel long distances to get to downtown. On the other hand, minimizing travel distance is achieved using a ‘satellite’ tree, where the root is connected via a straight line to each user location. However, this solution would not be very attractive from a network cost perspective since it would require significant resources to build. These two cases bound possible networks: there can be no shorter travel distance than the straight-line paths of the satellite tree, nor can network costs be lower than the Steiner tree.

One natural way to define an optimization function that incorporates these two objectives together is to solve:

\begin{aligned} minimize & T (G) \\ subject to \\ C (G) < B, \end{aligned}

3.3

where B is a budget on the total length of the network. This formulation sets the network length to be a hard constraint. To soften this constraint, we seek to solve:

minimize α C (G) + (1 - α) T (G),

3.4

where 0 ≤ α ≤ 1 is a parameter (similar to the Lagrange multiplier) that controls how much weight or priority should be placed on each objective. For example, if α ≈ 0, then the network promotes minimizing travel distance and would look satellite-like. If α ≈ 1, then the network promotes minimizing total length and would look closer to a spanning tree. Importantly, this parameter α provides a single number representing how a network trades-off these two objectives.

The Pareto front consists of solutions to equation (3.4) for all values of α ∈ [0, 1]. This front is also bounded by the two extremal networks, the satellite and a minimum spanning tree. The objective in equation (3.4) is clearly NP-hard, as it equals the Steiner tree problem for α = 1. We, therefore, use an empirically near-optimal greedy algorithm [26] to compute the Pareto front (Methods). Each city has its own Pareto front defined by r and $P$ (its set of n user locations). The use of a single downtown destination allowed us to examine travel paths towards a primary location in a city. Large cities may have multiple population centres, and our framework can similarly be used to analyse these polycentric structures.

3.2. Comparison to baseline networks

To establish a baseline to compare how close a city lies to the Pareto front, we compared with two baseline architectures. The first baseline, Random, creates a random spanning tree using the points $P$ and r as input and is used to test whether Pareto optimality can be achieved trivially. The second baseline, PrefAttach, uses the Barabasi–Albert model [41] to build a tree on the input points. This generative graph model starts with two random nodes connected by an edge. Then, in each iteration of the algorithm, it connects a new node to one existing node biased by the degree of the existing node. This results in a ‘rich get richer’ scale-free tree, with many potential hubs, as are commonly found in many transport networks.

3.3. Quantifying optimality, i.e. the distance to the Pareto front

For each city, we computed both its value of ε (how far away it is to the Pareto front) and α (where it lies on the scaled Pareto front).

Given the network G of a city, we can compute its values for x = C(G) and y = T(G). Call this point p_G = (x, y). To determine how far away p_G lies from the Pareto front, we scaled the Pareto front until it intersected with p_G, and we then used the amount of scaling required as a measure of optimality. Specifically, let the Pareto front be defined by a set of points, {(x₁, y₁), (x₂, y₂), …, (x_k, y_k)}, where (x_i, y_i) corresponds to the optimal travel distance and total length of the network for α_i. To scale the front, we multiply each (x_i, y_i) pair by a small value ε > 1 and test if this new Pareto front intersects with p_G. We keep increasing the value of ε until it intersects with p_G. The larger the value of ε, the further away the network lies to the Pareto front (figure 1c). If the network lies exactly on the Pareto front, its ε = 1.0.

This scaling allows a direct interpretation of ε: a city with ε = 1.5 imposes the same road length and travel distance as a Pareto-optimal city with the points $P$ with radius 1.5 times farther from the root. This is more explanatory than computing the Euclidean distance (in kilometres) from p_G to the closest point on the Pareto curve, since this distance combines two indirectly related lengths (network length and travel distance). Our method determines the Pareto trade-off α that results in the smallest ε, in contrast to other methods, such as the one utilized by Gastner et al. [17] which strictly assumes α = 0.

3.4. Using Google Maps to infer travel paths in a city

We used Google Maps to collect travel paths for 11 615 users in 101 global cities. These cities spanned different geographical regions, sizes, ages and levels of economic development. For example, cities had metropolitan area populations ranging from 7000 (Great Barrington, MA, USA) to 38 million (Tokyo, Japan), and had city land areas ranging from 5 km (Byblos, Lebanon) to 16 400 km (Beijing, China). The oldest city (Byblos, Lebanon) has been inhabited for 7000 years, and we also included newly built cities, such as Brasilia, Brazil and Abuja, Nigeria, (founded 1960 and 1984, respectively). The diversity of city features considered here makes this dataset a strong benchmark for studying any principle of travel paths.

For each city, we selected n = 115 random locations (P), which each had a uniformly distributed random 360° azimuth and radius $ρ \in [5, 30] km$ from a centrally located point (downtown) r. The point r was in the central business district of the city, most often a city hall or central monument. We then obtained the travel paths from each $p_{i} \in P$ to r for every city. To compute the network length C(G), we formed a tree G by merging all paths and deleting overlapping segments, then summing the lengths of all the edges. To compute the travel distance T(G), we summed the lengths of all 115 travel paths (Methods).

3.5. Other objectives of interest

Google Maps’ suggested travel paths incorporate factors that we do not explicitly try to optimize with our greedy algorithm, including time spent waiting at intersections, speed limits, and traffic congestion. As discussed previously, as opposed to developing detailed and highly parameterized routing and road construction models, our goal is to use network length and travel distance as a proxy for construction costs and travel times, to determine how much these two simple factors alone dictate the structure of travel paths. Later, we also use a higher-order Pareto optimality technique to incorporate a number of these other factors, including traffic congestion and travel time.

3.6. Are travel paths in city road networks close to being Pareto optimal?

For each city, we used as input the locations of the n random points $P$ in the city, and the location of downtown (r). We used the greedy algorithm to generate a set of Pareto-optimal trees for this city by varying α ∈ [0, 1]. We then computed the network length and travel distance for the actual city’s road network and assessed how far the city was from the Pareto front (ε). Using the same set of points, we also computed the distance to the Pareto front for two baseline networks (Random and PrefAttach, averaged over 1000 trees) to assess significance.

We found that the travel paths in every city were closer to the Pareto front than the baseline networks (figure 2a). For example, the distance to the Pareto front for Beijing was ε = 1.36, compared to 3.82 and 7.47 for PrefAttach and Random, respectively. This indicates that achieving the level of Pareto optimality achieved by the city road network is not trivial. This also suggests that despite the diverse constraints faced by actual road topologies compared to our graph-theoretic algorithm, most cities still performed better than baseline algorithms (Random, PrefAttach), which did not face such constraints.

We observed a range of ε values across the 101 cities, indicating that some cities were closer to the Pareto front than others (figure 2b). For example, Houston was relatively close to the Pareto front (ε = 1.26), whereas Abbottabad was much further away (ε = 1.87), and Melbourne was in between (ε = 1.40). The topology of the travel paths of these cities was also quite diverse (figure 2c). For example, Fresno (ε = 1.21) has many straight-lined paths indicative of a very grid-like topology, whereas Abbottabad (ε = 1.87) has many jagged edges due to the mountainous region. Later, we study the effect of other city attributes (such as geography and land area) on ε.

The overall range of ε values was limited (figure 3b); the mean $\bar{ϵ} = 1.40$ , with a small variance $σ_{ϵ}^{2} = 0.0175$ . This indicates that most cities were within a relatively narrow band of optimality, despite the diversity of cities analysed.

3.7. Where near the Pareto front do cities tend to lie?

Next, we analysed where near the Pareto front the travel paths from each city lie. The smaller the value of α, the more the travel paths look like the satellite tree, and the larger the α, the more the paths resemble a spanning tree.

We observed α ranging from α = 0.57 (Reykjavik), which favoured minimizing network length to α = 0.14 (Ithaca), which minimized travel distance (figure 2d). The values of α were all less than 0.5, except for Reykjavik (mean $\bar{α} = 0.243$ , with variance $σ_{α}^{2} = 0.0062$ ; figure 3a). This again suggests that similar trade-offs between travel distance and network length are made by these diverse cities.

3.8. Do other city features correlate with Pareto optimality?

Here, we explore how ε (distance to the Pareto front) and α (trade-off) values for each city vary with three types of city measures: (i) city demographics, including metropolitan area population, age of the city and geographical region; (ii) traffic measures from Google Maps for each of our travel paths, including average number of manoeuvres (turns, merges, etc.), average speed with and without traffic, average travel time with and without traffic, and the traffic slowdown factor (the ratio of speed with traffic over speed without traffic); and (iii) additional traffic measures that are not specific to our travel paths but rather to the city as a whole. These measures are collected from the 2016 INRIX Global Traffic Scorecard [42]. The INRIX data were only available for 46 of the 101 cities and included peak hours spent in congestion (relevant for a rush-hour commuter), percentage of time spent in congestion (an overall congestion index using congestion rates weighted to account for the typical driver and a city’s average journey times), and an overall ranking of cities (by peak hours spent in congestion, where a rank of 1 signifies the worst).

We found significant positive correlations between ε and the average number of manoeuvres and the travel time with and without traffic (table 1). This lends credence to our model, showing that cities that were further from the Pareto front yielded longer travel times and more manoeuvres per travel path. We also found significant, albeit slightly weaker, correlations between ε and the INRIX congestion rank and per cent time in congestion, which similarly suggests that cities further away from the Pareto front are more susceptible to traffic jams.

Table 1.

Pearson correlation coefficient (r) and p-value for α and ε versus city attributes. Significant correlations are shown in italics. Attributes that are significant after a Bonferroni correction (n = 13) are denoted with an asterisk.

	α		ε
attribute	r	p-value	r	p-value
population	− 0.216	0.0309	0.051	0.6162
area	0.040	0.6921	0.189	0.0611
age	− 0.212	0.0331	0.180	0.0712
average number of manoeuvres	− 0.071	0.4850	0.381	<0.0001*
average speed	0.119	0.2369	− 0.116	0.2512
average speed in traffic (σ)	0.156	0.1183	− 0.123	0.2212
average time	0.071	0.4834	0.485	<0.0001*
average time in traffic	0.006	0.9517	0.457	<0.0001*
traffic slowdown factor	0.260	0.0089	− 0.047	0.6438
INRIX hours spent in congestion	− 0.081	0.5914	0.143	0.3433
INRIX congestion index	− 0.079	0.6041	0.188	0.2102
INRIX all cities congestion rank	0.007	0.9619	− 0.372	0.0109
INRIX percentage of time in congestion	− 0.077	0.6119	0.451	0.0016*

Open in a new tab

For α, there was a weak correlation with population and age (table 1), suggesting that smaller and younger cities are more satellite in nature. Furthermore, the weak correlation of α with the traffic slowdown factor suggests that the more satellite-tree-like a city is, the greater effect traffic congestion has. Other studies have found relationships with travel miles and gross domestic product [43].

Next, we studied how α and ε vary according to regional differences (figure 3c,d). For each region, we performed a Kolmogorov–Smirnov test between the cities in that region and the cities in all other regions, to test if the region had a significantly different distribution of α or ε values compared to the rest of the world. For ε, we found that the Middle East had significantly larger values of ε (i.e. was more sub-optimal) compared to the rest of the world (p = 0.002), whereas North America and Africa had slightly lower values of ε (p = 0.051 and p = 0.063, respectively). For α, we find no significant differences between regions.

Thus, terrain appears to influence ε more so than α. The eight cities with the highest values of ε, specifically, Abbottabad (ε = 1.87), Jerusalem (ε = 1.80), Byblos (ε = 1.72), Medellin (ε = 1.68), Cusco (ε = 1.67), Rio de Janeiro (ε = 1.66), Caracas (ε = 1.64) and Taipei (ε = 1.63), are all in mountainous or hilly areas that require construction of winding roads (figure 2c). On the other hand, a number of the lowest ε cities are flat, namely Juba (ε = 1.15), Fresno (ε = 1.21), Harare (ε = 1.21) and Kampala (ε = 1.22). This was in contrast to α, where three of the 10 highest and three of the 10 lowest α cities were mountainous: Abbottabad (α = 0.35), Rio de Janeiro (α = 0.39) and Cusco (α = 0.41); Byblos (α = 0.1), Barcelona (α = 0.13) and Ithaca (α = 0.14).

The impact of terrain on Pareto trade-offs is further supported by a positive correlation between ε and α, (r = 0.248, p = 0.0131) indicating that cities that lie far from the Pareto front also require more indirect travel. Obstacles that raise the cost of construction and block direct paths, e.g. mountainous regions, may cause this.

We also assessed how proximity to a large body of water influenced transport efficiency in a city. Specifically, for each city, we manually inspected its terrain and classified it as being bordered by water (including being adjacent to oceans, or near bays and major rivers), or otherwise. We found no significant difference in the distributions of ε nor α values for cities near water versus otherwise (Kolmogorov–Smirnoff test, p = 0.510, 0.738 for ε and α, respectively). In many cases, this was because there was no need for the road network to traverse the obstacle, e.g. the city was coastal. Examples include Osaka (ε = 1.34) and Vancouver (ε = 1.35). Otherwise, bridges enabled efficient connectivity over bays and rivers, such as in San Francisco and London (both ε = 1.34), and Portland, Oregon (ε = 1.37).

Overall, these results suggest that the closer a city lies to the Pareto front, the shorter the travel times and the better it deals with traffic congestion. Geography also imposes some constraints, especially those cities in mountainous regions. Interestingly, there was little effect of these city features on α; the tight distribution of α (figure 3a) suggests that the trade-off between total length and travel distance used may be similar globally.

3.9. A higher-order Pareto front to further bound travel path complexity

Many cities lie near the Pareto front; however, there were some exceptions to this rule, indicating that other optimization factors or constraints also influence travel path structure. Ideally, there would be a simple way to incorporate additional objectives (e.g. travel time, traffic congestion, number of manoeuvres) into a joint optimization framework amenable to Pareto analysis. However, this would create several more variables to parameterize, including weighing factors among objectives. We are not aware of a theoretical model that describes these trade-offs for studying travel paths.

Instead, we leverage recent advances in Pareto optimality that seek to find ‘archetypes’ by analysing the dimensionality of traits of travel paths. Specifically, we used the PartTI package [44] to analyse five traits of the 101 cities: travel distance, network length, average travel speed, traffic slowdown factor (speed decrease due to traffic congestion), and number of manoeuvres. The latter three traits were derived from our Google Maps dataset (Methods). Overall, this generated a matrix with 101 rows (cities) and five columns corresponding to the five traits for each city.

PartTI performed a principal component analysis (PCA) on the z-score normalized matrix, and then attempted to enclose the 101 points in the reduced-dimensional space with a minimum-volume simplex (an n-dimensional triangle). We found that the cities in this five-dimensional space were well bounded within a tetrahedral simplex in a three-dimensional principal component subspace. This simplex had an explained variance of 73.5% and contained 80 of 101 cities (figure 4). This three-dimensional subspace is spanned by four corner points, or ‘archetypes’. These archetypes represent extremal variations in travel path structures, whose traits can reveal how the five different traits may be weighed differently. They can also be viewed as a type of ‘dictionary’ that encodes the diversity of travel path structures across cities.

The four cities closest to the four archetypes were Amarillo, Jerusalem, Juba and Delhi (figure 4). These cities showed different trade-offs with respect to their rankings of ε, α and σ, a new variable representing the average speed in the presence of traffic (lower is worse). Amarillo had the fastest travel speeds (highest σ) and had the ninth lowest ε (lying relatively close to the Pareto front). Jerusalem, on the other hand, was far from the Pareto front (second-worst ε) and had relatively slow traffic (low σ), but had a high value of α, meaning it was spanning-tree-like. Similarly, Delhi exhibited the absolute slowest traffic, but in contrast to Jerusalem, had the eighth lowest value of α, meaning it was satellite-like. Finally, Juba had the lowest value of ε (very close to the Pareto front) with the 10th highest value of α (spanning-tree-like), but showed low speeds in traffic. We also used feature enrichment analysis from the package, which similarly found that ε and α were properties statistically enriched near the archetypes (electronic supplementary material, ParTI Feature Enrichment Analysis, table S1).

Overall, our theoretical model found that travel paths within many cities were nearly Pareto optimal with respect to a trade-off between road length and travel distance; when adding three additional time-related variables, the Pareto space only increased by roughly one dimension. Therefore, our archetype analysis showed that time may be described as a single factor in the network.

4. Discussion

We showed that travel paths in road networks provide transport in a nearly Pareto optimal manner, with topologies that trade-off network length versus travel distance. When also including time-related measures, cities were bounded by only four archetypical topologies, suggesting that the design space of travel paths is encapsulated by a few canonical structures. The closer a city lies to the Pareto front the shorter its travel times and less is the impact of traffic congestion. Finally, despite wide variation in city features, we found a very tight distribution of α values, suggesting that the balance between objectives across these cities was very similar, and perhaps, universal. These measures may give light to the objectives and cost functions that drive the formation and evolution of travel paths in road networks.

We analysed a set of 101 cities, selected to represent a broad range of population sizes, land areas and geographies, to test the generality of this trade-off principle. The cities selected varied in population by over four orders of magnitude, by land area by over three orders of magnitude, and by age by over three orders of magnitude. While the analysis of additional cities is possible via Google Maps, these cities appeared generally representative of the diversity of land structures in the world. We studied the effects of additional features, such as population size and geography, on the Pareto optimality of cities (table 1). Travel path structure, however, may also be effected by other features, including gross domestic product [43], culture or dominant mode of transportation.

Our study was not meant to analyse all objectives of a city-scale transport system. We focused on one important aspect of city travel, which was in part motivated by similar challenges faced by two biological transport systems, where there is a single ‘city centre’ (analogous to a neuron’s soma or a plant’s base) through which transport is organized. However, our general framework could be extended to study additional objectives, for example, those where users seek to travel to multiple locations. In the extreme case, where users desire fast transport from any location in the city to any other, the satellite structure would be replaced by a clique structure, which minimizes the distance between any pair of nodes in a network.

There are two reasons why exact Pareto optimality may be difficult to achieve by travel paths in city road networks. First, road growth is typically ‘online’, where some set of roads are initially built, and then, based on feedback and demand, subsequent roads are built. This hampers optimality because design decisions made earlier in time strongly constrain those that are made subsequently. Our Pareto front generation algorithm, on the other hand, assumed that all the information about which points to connect was provided a priori. Thus, some difference in performance of ‘online’ versus ‘offline’ network design should be expected, and future work should attempt to quantify this gap. Second, road networks typically do not ‘prune’ the past; i.e. road networks rarely exhibit the abandonment of large stretches of road. This can also affect optimality because past decisions may no longer be as beneficial under current demands. Despite these limitations, city networks have managed to achieve a level of Pareto optimality that is significantly higher than baseline networks.

Finally, our work was motivated by prior analyses of Pareto-optimal transport in two biological structures, an observation we sought to test here for a human engineered system. The extent of Pareto optimality remains debated in biology. For example, Barve & Wagner [45] show that metabolic traits may have non-adaptive origins, and Valverde et al. [46] suggest that ecological networks have traits that represent evolutionary spandrels. Thus, we do not expect that all biological networks make Pareto-optimal trade-offs, and understanding the evolutionary trajectories that gave rise to these structures remains important.

However, at least two biological transport networks demonstrate Pareto-optimal trade-offs. First, plant shoot (above-ground) architectures can be viewed as transport networks, where the leaves are analogous to the locations $P$ in the city, and the base of the plant is considered downtown (root, r). For plants, travel distance corresponds to a measure of how efficiently nutrients (e.g. sugars) can be transported along the branching architectures; network length measures how much resources are required to build the architecture. Conn et al. [26] showed that plant architectures for several species are Pareto optimal under the same cost-performance trade-off considered here. Second, neural branching arbors (dendrites and axons) can also be viewed as transport networks used to transmit information from one neuron to another. Here, the locations $P$ correspond to the locations of synapses, and the root corresponds to the cell soma. Travel distance corresponds to a measure of conduction delay, which seeks to minimize the time required to transmit a signal from one neuron to another; network length is a measure of arbor wiring length, which is a commodity in space-constrained brains [27,28]. On a higher level, it has been shown that minimizing feedback latency, including transport times, is critical for the performance of biological control systems such as bipedal balance and locomotion [47,48]. Finally, there may also be other biological transport networks constrained by these principles, such as vascular networks that deliver oxygen throughout the body [49–52], and other nutrient supply networks [53,54]. Thus, similar network design principles shape the design of some biological and engineered structures.

5. Methods

5.1. Travel path data

We obtained travel path data from Google Maps via the Directions API [25]. Routing data for each travel path were requested for Wednesday, 13 December 2017 at 12.00 noon local time, run on 5 December 2017, about one week in advance. This allowed the routes to consider a moderate level of traffic, while avoiding congestion found during rush hour. Additionally, the future date meant one-time events, such as accidents, were not considered. The destination point, downtown, was the default destination for the city returned by Google Maps. If a default was undefined (applicable to only approx. 10 cities), downtown was manually chosen.

We attempted to use the same set of 120 origin points for every city, with respect to distance and azimuth, however, certain paths returned by Google Maps were unable to be routed, or clearly unrepresentative of typical road travel patterns. This was because the origin point was placed in an inaccessible or uninhabited location, such as an island, body of water, or an area without roads. In some of these cases, Google Maps attempted to snap the origin point to ferries or to roads many kilometres away. Thus, points were rejected if the snapping distance was greater than $π / 120 \approx 2.6 %$ of the distance from the point to downtown. Additionally, if a ferry was required or the path had a length of over 10 times the straight-line distance, the point was rejected as well. We replaced rejected points with the next points from the same random sequence. Finally, to remove outliers, the five longest paths were removed, leaving 115 total paths for each city.

The travel distance for each path was provided by the Google Maps API. To determine the total length of the utilized road network, all paths were merged. The API returns each path as a series of latitude/longitude points, which are not identical across multiple paths that share the same road. Therefore, we wrote an overlap detection algorithm that considered points that were closer than 100 m to be sharing the same road, so the duplicate path section between such points was removed. The length of the remaining sections were then summed to determine the network length.

The 120 origin points represented, on average, 1 point every 1.3 km², which is smaller than the area of most neighbourhoods. Thus, higher sampling density would likely select points within similar neighbourhoods, which would have similar travel paths. We also considered origin points located in a circle of radius 5, 15 and 30 km from downtown. We found that the α and ε values for these three datasets were well-correlated with the random set, with Pearson correlations r_α = [0.46, 0.72, 0.61] and r_ε = [0.58, 0.91, 0.90] for radii of [5, 15, 30] km respectively, suggesting that 120 origin points was adequate.

5.2. Greedy algorithm to generate a near-optimal Pareto front

The algorithm used to generate the Pareto front (i.e. to minimize equation (3.4) for a given value of α) was described in Conn et al. [26], and a similar algorithm was used by Gastner et al. [17]. The algorithm is initialized with just the root node in the tree. In each step of the greedy algorithm, an edge is added that connects a node outside the tree to a node within the tree that minimizes equation (3.4). Steiner nodes may also be added along each edge to offer more potential edges to consider in each step, at the expense of longer running time. While this method is clearly not optimal—as minimizing equation (3.4) is NP-hard—Conn et al. [26] provide evidence that this method generates very close to optimal trees and outperforms prior heuristics for this problem.

Our greedy algorithm does not generate the Pareto-optimal front, as finding this front is an NP-hard problem [26]. However, this algorithm has been shown effective for approximating the Pareto front, and performs comparably to the optimal Steiner tree for α = 1. Other heuristics, for example, based on genetic algorithms [55] and other graph-theoretic optimizations [56], were found to be too slow and generally worst-performing, respectively, compared to the greedy algorithm [29]. Thus, while our greedy algorithm is effective in practice, finding near-optimal Pareto fronts for large point sets and all values of α remains an open problem.

5.3. Archetype analysis and traffic data

Traffic data, used in both the correlation and archetype analysis, was obtained from Google Maps. Estimated trip times, without the impact of traffic, was returned with each travel path. Where Google Maps had traffic data, the trip time including the impact of traffic was obtained, using Google Maps’ congestion model at the aforementioned 12.00 noon, local time.

Average speed was determined by the sum of all trip lengths in a city divided by the sum of the trip times. The traffic slowdown factor was the ratio of the sum of trip times without traffic to that with traffic. Thus, 1 indicates no traffic, and 0.5 indicates speed halved on average due to traffic. Finally, the average number of manoeuvres per trip, as returned by Google Maps, counted both turns, roundabout use, road forks, merges and exits on ramps. Straight travel through highway interchanges were counted as a manoeuvre, but not straight travel through surface street intersections [25].

For archetype analysis with the ParTI package [44], each variable was first z-score normalized. The package calculated the simplex using the PCHA algorithm, and the number of archetypes was determined using the elbow method. The cities closest to the archetypes were found in regards to the ℓ²-norm of the principal components.

Supplementary Material

Supplementary ParTI analysis, city data table and Pareto trade-off diagrams

rsif20190041supp1.pdf^{(3.1MB, pdf)}

Supplementary Material

City Data Table

rsif20190041supp2.csv^{(13.4KB, csv)}

Data accessibility

The data produced by the algorithms, demographic information used, and downtown coordinates are provided as a comma-separated value (CSV) file available as electronic supplementary material. The electronic supplementary material also lists the same data in tabular format, as well as plots the Pareto trade-off curves for all 101 cities. Source code is available at https://bitbucket.org/navlakha/city_pareto.

Author contributions

J.Y.S. and S.N. conceived the study, gathered and analysed the data, and wrote the manuscript.

Competing interests

We declare we have no competing interests.

Funding

S.N. was supported by the Pew Charitable Trusts, the National Science Foundation under award CAREER DBI-1846554, and the NIDCD of the National Institutes of Health under award number 1R01DC017695.

References

1.Jay S, Illic D, Plückebaum T, Zoz K. 2012. Bottom-up cost model for the fixed access network in Spain. Technical report. Rhündorfer Str. 68, 53604 Bad Honnef, WIK-Consult GmbH, Germany.
2.Chen J, Wosinska L, Machuca CM, Jaeger M. 2010. Cost vs. reliability performance study of fiber access network architectures. IEEE Commun. Mag. 48, 56–65. ( 10.1109/MCOM.2010.5402664) [DOI] [Google Scholar]
3.Swamee PK, Sharma AK. 2008. Design of water supply pipe networks. New York, NY: Wiley. [Google Scholar]
4.Pagani GA, Aiello M. 2013. The power grid as a complex network: a survey. Physica A 392, 2688–2700. ( 10.1016/j.physa.2013.01.023) [DOI] [Google Scholar]
5.Akkaya K, Younis M. 2005. A survey on routing protocols for wireless sensor networks. Ad Hoc Netw. 3, 325–349. ( 10.1016/j.adhoc.2003.09.010) [DOI] [Google Scholar]
6.Magnanti TL, Wong RT. 1984. Network design and transportation planning: models and algorithms. Transp. Sci. 18, 1–55. ( 10.1287/trsc.18.1.1) [DOI] [Google Scholar]
7.Xie F, Levinson D. 2007. Measuring the structure of road networks. Geogr. Anal. 39, 336–356. ( 10.1111/gean.2007.39.issue-3) [DOI] [Google Scholar]
8.Kalapala V, Sanwalani V, Clauset A, Moore C. 2006. Scale invariance in road networks. Phys. Rev. E 73, 026130 ( 10.1103/PhysRevE.73.026130) [DOI] [PubMed] [Google Scholar]
9.Zhang H, Li Z. 2012. Fractality and self-similarity in the structure of road networks. Ann. Assoc. Am. Geogr. 102, 350–365. ( 10.1080/00045608.2011.620505) [DOI] [Google Scholar]
10.Snelder M, Wagelmans AP, Schrijver JM, van Zuylen HJ, Immers LH. 2005. Optimal redesign of the Dutch road network. Technical report no. EI 2005-55. Delft, Econometric Institute, TNO, The Netherlands.
11.Beaven PJ, Lawrance CJ. 1982. Terrain evaluation for highway planning and design. Transp. Res. Rec. 892, 36–46. [Google Scholar]
12.Nilsson J-E. 1991. Investment decisions in a public bureaucracy: a case study of Swedish road planning practices. J. Transp. Econ. Policy 25, 163–175. [Google Scholar]
13.Laurance WF. et al. 2014. A global strategy for road building. Nature 513, 229–232. ( 10.1038/nature13717) [DOI] [PubMed] [Google Scholar]
14.Cotilla-Sanchez E, Hines PDH, Barrows C, Blumsack S. 2012. Comparing the topological and electrical structure of the North American electric power infrastructure. IEEE Syst. J. 6, 616–626. ( 10.1109/JSYST.2012.2183033) [DOI] [Google Scholar]
15.Dunn S, Wilkinson S, Ford A. 2016. Spatial structure and evolution of infrastructure networks. Sustain. Cities Soc. 27, 23–31. ( 10.1016/j.scs.2016.08.011) [DOI] [Google Scholar]
16.Lee M, Barbosa H, Youn H, Holme P, Ghoshal G. 2017. Morphology of travel routes and the organization of cities. Nat. Commun. 8, 2229 ( 10.1038/s41467-017-02374-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Gastner MT, Newman MEJ. 2006. Shape and efficiency in spatial distribution networks. J. Stat. Mech: Theory Exp. 2006, P01015 ( 10.1088/1742-5468/2006/01/P01015) [DOI] [Google Scholar]
18.Wang P, Liu L, Li X, Li G, González MC. 2014. Empirical study of long-range connections in a road network offers new ingredient for navigation optimization models. New J. Phys. 16, 013012 ( 10.1088/1367-2630/16/1/013012) [DOI] [Google Scholar]
19.Yuan J, Zheng Y, Zhang C, Xie W, Xie X, Sun G, Huang Y. 2010. T-drive: driving directions based on taxi trajectories. In Proc. of the 18th SIGSPATIAL Int. Conf. on Advances in Geographic Information Systems. GIS ’10. San Jose, CA, pp. 99–108. New York, NY: ACM.
20.Yang H, Yagar S, Iida Y. 1994. Traffic assignment in a congested discrete/continuous transportation system. Transp. Res. B: Methodol. 28, 161–174. ( 10.1016/0191-2615(94)90023-X) [DOI] [Google Scholar]
21.Yan X, Radwan E. 2007. Effect of restricted sight distances on driver behaviors during unprotected left-turn phase at signalized intersections. Transp. Res. Part F: Traffic Psychol. Behav. 10, 330–344. ( 10.1016/j.trf.2007.01.001) [DOI] [Google Scholar]
22.Mainali MK, Mabu S, Yu S, Eto S, Hirasawa K. 2010. Dynamic optimal route search algorithm for car navigation systems with preferences by dynamic programming. IEEJ Trans. Electr. Electron. Eng. 6, 14–22. ( 10.1002/tee.v6.1) [DOI] [Google Scholar]
23.Linden S. 1986. Practical considerations in optimal flight management computations. J. Guid. Control Dyn. 9, 427–432. ( 10.2514/3.20128) [DOI] [Google Scholar]
24.Boyce D, Xiong Q. 2004. User-optimal and system-optimal route choices for a large road network. Rev. Netw. Econ. 3, 371–380. ( 10.2202/1446-9022.1058) [DOI] [Google Scholar]
25.Google. Directions API. Developer guide. https://developers.google.com/maps/documentation/directions/intro.
26.Conn A, Pedmale UV, Chory J, Navlakha S. 2017. High-resolution laser scanning reveals plant architectures that reflect universal network design principles. Cell Syst. 5, 53–62. ( 10.1016/j.cels.2017.06.017) [DOI] [PubMed] [Google Scholar]
27.Budd JM, Kováacs K, Ferecskó AS, Buzás P, Eysel UT, Kisvárday ZF. 2010. Neocortical axon arbors trade-off material and conduction delay conservation. PLoS Comput. Biol. 6, e1000711 ( 10.1371/journal.pcbi.1000711) [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Cuntz H, Forstner F, Borst A, Häusser M. 2010. One rule to grow them all: a general theory of neuronal branching and its practical application. PLoS Comput. Biol. 6, e1000877 ( 10.1371/journal.pcbi.1000877) [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Chandrasekhar A, Navlakha S. 2019. Neural arbors are Pareto optimal. Proc. R. Soc. B 286, 20182727 ( 10.1098/rspb.2018.2727) [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Seoane LF, Solé R. 2015. Phase transitions in Pareto optimal complex networks. Phys. Rev. E 92, 032807 ( 10.1103/PhysRevE.92.032807) [DOI] [PubMed] [Google Scholar]
31.Seoane LF, Solé R. 2016. Multiobjective optimization and phase transitions. In Proc. of ECCS 2014. (eds S Battiston, F De Pellegrini, G Caldarelli, E Merelli), pp. 259–270. Cham, Switzerland: Springer International Publishing.
32.Avena-Koenigsberger A, Goñi J, Solé R, Sporns O. 2015. Network morphospace. J. R. Soc. Interface 12, 20140881 ( 10.1098/rsif.2014.0881) [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Moses M, Bezerra G, Edwards B, Brown J, Forrest S. 2016. Energy and time determine scaling in biological and computer designs. Phil. Trans. R. Soc. B 371, 20150446 ( 10.1098/rstb.2015.0446) [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Pardalos PM, Migdalas A, Pitsoulis L (eds). 2008. Pareto optimality, game theory and equilibria. Springer optimization and its applications, vol. 17 New York, NY: Springer. [Google Scholar]
35.Diamond PA, Mirrlees JA. 1971. Optimal taxation and public production I: production efficiency. Am. Econ. Rev. 61, 8–27. [Google Scholar]
36.Samuelson PA. 1954. The pure theory of public expenditure. Rev. Econ. Stat. 36, 387–389. ( 10.2307/1925895) [DOI] [Google Scholar]
37.Shoval O, Sheftel H, Shinar G, Hart Y, Ramote O, Mayo A, Dekel E, Kavanagh K, Alon U. 2012. Evolutionary trade-offs, Pareto optimality, and the geometry of phenotype space. Science 336, 1157–1160. ( 10.1126/science.1217405) [DOI] [PubMed] [Google Scholar]
38.Tullock G. 1971. Biological externalities. J. Theor. Biol. 33, 565–576. ( 10.1016/0022-5193(71)90097-X) [DOI] [PubMed] [Google Scholar]
39.Handl J, Kell DB, Knowles J. 2007. Multiobjective optimization in bioinformatics and computational biology. IEEE/ACM Trans. Comput. Biol. Bioinf. 4, 279–292. ( 10.1109/TCBB.2007.070203) [DOI] [PubMed] [Google Scholar]
40.Heinrich R, Schuster S. 1996. The regulation of cellular systems. London, UK: Chapman & Hall. [Google Scholar]
41.Barabási A-L, Albert R. 1999. Emergence of scaling in random networks. Science 286, 509–512. ( 10.1126/science.286.5439.509) [DOI] [PubMed] [Google Scholar]
42.Cookson G, Pishue B. 2017. INRIX global traffic scorecard. Technical report no. 10210 NE Points Drive, Suite 400, Kirkland, WA 98033. INRIX Research, United States.
43.Bettencourt LMA. 2013. The origins of scaling in cities. Science 340, 1438–1441. ( 10.1126/science.1235823) [DOI] [PubMed] [Google Scholar]
44.Hart Y, Sheftel H, Hausser J, Szekely P, Ben-Moshe NB, Korem Y, Tendler A, Mayo AE, Alon U. 2015. Inferring biological tasks using Pareto analysis of high-dimensional data. Nat. Methods 12, 233–235. ( 10.1038/nmeth.3254) [DOI] [PubMed] [Google Scholar]
45.Barve A, Wagner A. 2013. A latent capacity for evolutionary innovation through exaptation in metabolic systems. Nature 500, 203 ( 10.1038/nature12301) [DOI] [PubMed] [Google Scholar]
46.Valverde S, Piñero J, Corominas-Murtra B, Montoya J, Joppa L, Solé R. 2018. The architecture of mutualistic networks as an evolutionary spandrel. Nat. Ecol. Evol. 2, 94–99. ( 10.1038/s41559-017-0383-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Milton J, Cabrera JL, Ohira T, Tajima S, Tonosaki Y, Eurich CW, Campbell SA. 2009. The time-delayed inverted pendulum: implications for human balance control. Chaos: an Interdiscipl. J. Nonlinear Sci. 19, 026110 ( 10.1063/1.3141429) [DOI] [PubMed] [Google Scholar]
48.Doyle JC, Csete M. 2011. Architecture, constraints, and behavior. Proc. Natl Acad. Sci. USA 108, 15 624–15 630. ( 10.1073/pnas.1103557108) [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Newberry MG, Ennis DB, Savage VM. 2015. Testing foundations of biological scaling theory using automated measurements of vascular networks. PLoS Comput. Biol. 11, e1004455 ( 10.1371/journal.pcbi.1004455) [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Tekin E, Hunt D, Newberry MG, Savage VM. 2016. Do vascular networks branch optimally or randomly across spatial scales? PLoS Comput. Biol. 12, e1005223 ( 10.1371/journal.pcbi.1005223) [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Murray CD. 1926. The physiological principle of minimum work: I. The vascular system and the cost of blood volume. Proc. Natl Acad. Sci. USA 12, 207–214. ( 10.1073/pnas.12.3.207) [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Gafiychuk V, Lubashevsky I. 2001. On the principles of the vascular network branching. J. Theor. Biol. 212, 1–9. ( 10.1006/jtbi.2001.2277) [DOI] [PubMed] [Google Scholar]
53.Dodds PS, Rothman DH, Weitz JS. 2001. Re-examination of the ‘3/4-law’ of metabolism. J. Theor. Biol. 209, 9–27. ( 10.1006/jtbi.2000.2238) [DOI] [PubMed] [Google Scholar]
54.West GB, Brown JH, Enquist BJ. 1997. A general model for the origin of allometric scaling laws in biology. Science 276, 122–126. ( 10.1126/science.276.5309.122) [DOI] [PubMed] [Google Scholar]
55.Chen G, Chen S, Guo W, Chen H. 2007. The multi-criteria minimum spanning tree problem based genetic algorithm. Inf. Sci. 177, 5050–5063. ( 10.1016/j.ins.2007.06.005) [DOI] [Google Scholar]
56.Khuller S, Raghavachari B, Young N. 1995. Approximating the minimum equivalent digraph. SIAM J. Comput. 24, 859–872. ( 10.1137/S0097539793256685) [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary ParTI analysis, city data table and Pareto trade-off diagrams

rsif20190041supp1.pdf^{(3.1MB, pdf)}

City Data Table

rsif20190041supp2.csv^{(13.4KB, csv)}

Data Availability Statement

[RSIF20190041C1] 1.Jay S, Illic D, Plückebaum T, Zoz K. 2012. Bottom-up cost model for the fixed access network in Spain. Technical report. Rhündorfer Str. 68, 53604 Bad Honnef, WIK-Consult GmbH, Germany.

[RSIF20190041C2] 2.Chen J, Wosinska L, Machuca CM, Jaeger M. 2010. Cost vs. reliability performance study of fiber access network architectures. IEEE Commun. Mag. 48, 56–65. ( 10.1109/MCOM.2010.5402664) [DOI] [Google Scholar]

[RSIF20190041C3] 3.Swamee PK, Sharma AK. 2008. Design of water supply pipe networks. New York, NY: Wiley. [Google Scholar]

[RSIF20190041C4] 4.Pagani GA, Aiello M. 2013. The power grid as a complex network: a survey. Physica A 392, 2688–2700. ( 10.1016/j.physa.2013.01.023) [DOI] [Google Scholar]

[RSIF20190041C5] 5.Akkaya K, Younis M. 2005. A survey on routing protocols for wireless sensor networks. Ad Hoc Netw. 3, 325–349. ( 10.1016/j.adhoc.2003.09.010) [DOI] [Google Scholar]

[RSIF20190041C6] 6.Magnanti TL, Wong RT. 1984. Network design and transportation planning: models and algorithms. Transp. Sci. 18, 1–55. ( 10.1287/trsc.18.1.1) [DOI] [Google Scholar]

[RSIF20190041C7] 7.Xie F, Levinson D. 2007. Measuring the structure of road networks. Geogr. Anal. 39, 336–356. ( 10.1111/gean.2007.39.issue-3) [DOI] [Google Scholar]

[RSIF20190041C8] 8.Kalapala V, Sanwalani V, Clauset A, Moore C. 2006. Scale invariance in road networks. Phys. Rev. E 73, 026130 ( 10.1103/PhysRevE.73.026130) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C9] 9.Zhang H, Li Z. 2012. Fractality and self-similarity in the structure of road networks. Ann. Assoc. Am. Geogr. 102, 350–365. ( 10.1080/00045608.2011.620505) [DOI] [Google Scholar]

[RSIF20190041C10] 10.Snelder M, Wagelmans AP, Schrijver JM, van Zuylen HJ, Immers LH. 2005. Optimal redesign of the Dutch road network. Technical report no. EI 2005-55. Delft, Econometric Institute, TNO, The Netherlands.

[RSIF20190041C11] 11.Beaven PJ, Lawrance CJ. 1982. Terrain evaluation for highway planning and design. Transp. Res. Rec. 892, 36–46. [Google Scholar]

[RSIF20190041C12] 12.Nilsson J-E. 1991. Investment decisions in a public bureaucracy: a case study of Swedish road planning practices. J. Transp. Econ. Policy 25, 163–175. [Google Scholar]

[RSIF20190041C13] 13.Laurance WF. et al. 2014. A global strategy for road building. Nature 513, 229–232. ( 10.1038/nature13717) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C14] 14.Cotilla-Sanchez E, Hines PDH, Barrows C, Blumsack S. 2012. Comparing the topological and electrical structure of the North American electric power infrastructure. IEEE Syst. J. 6, 616–626. ( 10.1109/JSYST.2012.2183033) [DOI] [Google Scholar]

[RSIF20190041C15] 15.Dunn S, Wilkinson S, Ford A. 2016. Spatial structure and evolution of infrastructure networks. Sustain. Cities Soc. 27, 23–31. ( 10.1016/j.scs.2016.08.011) [DOI] [Google Scholar]

[RSIF20190041C16] 16.Lee M, Barbosa H, Youn H, Holme P, Ghoshal G. 2017. Morphology of travel routes and the organization of cities. Nat. Commun. 8, 2229 ( 10.1038/s41467-017-02374-7) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C17] 17.Gastner MT, Newman MEJ. 2006. Shape and efficiency in spatial distribution networks. J. Stat. Mech: Theory Exp. 2006, P01015 ( 10.1088/1742-5468/2006/01/P01015) [DOI] [Google Scholar]

[RSIF20190041C18] 18.Wang P, Liu L, Li X, Li G, González MC. 2014. Empirical study of long-range connections in a road network offers new ingredient for navigation optimization models. New J. Phys. 16, 013012 ( 10.1088/1367-2630/16/1/013012) [DOI] [Google Scholar]

[RSIF20190041C19] 19.Yuan J, Zheng Y, Zhang C, Xie W, Xie X, Sun G, Huang Y. 2010. T-drive: driving directions based on taxi trajectories. In Proc. of the 18th SIGSPATIAL Int. Conf. on Advances in Geographic Information Systems. GIS ’10. San Jose, CA, pp. 99–108. New York, NY: ACM.

[RSIF20190041C20] 20.Yang H, Yagar S, Iida Y. 1994. Traffic assignment in a congested discrete/continuous transportation system. Transp. Res. B: Methodol. 28, 161–174. ( 10.1016/0191-2615(94)90023-X) [DOI] [Google Scholar]

[RSIF20190041C21] 21.Yan X, Radwan E. 2007. Effect of restricted sight distances on driver behaviors during unprotected left-turn phase at signalized intersections. Transp. Res. Part F: Traffic Psychol. Behav. 10, 330–344. ( 10.1016/j.trf.2007.01.001) [DOI] [Google Scholar]

[RSIF20190041C22] 22.Mainali MK, Mabu S, Yu S, Eto S, Hirasawa K. 2010. Dynamic optimal route search algorithm for car navigation systems with preferences by dynamic programming. IEEJ Trans. Electr. Electron. Eng. 6, 14–22. ( 10.1002/tee.v6.1) [DOI] [Google Scholar]

[RSIF20190041C23] 23.Linden S. 1986. Practical considerations in optimal flight management computations. J. Guid. Control Dyn. 9, 427–432. ( 10.2514/3.20128) [DOI] [Google Scholar]

[RSIF20190041C24] 24.Boyce D, Xiong Q. 2004. User-optimal and system-optimal route choices for a large road network. Rev. Netw. Econ. 3, 371–380. ( 10.2202/1446-9022.1058) [DOI] [Google Scholar]

[RSIF20190041C25] 25.Google. Directions API. Developer guide. https://developers.google.com/maps/documentation/directions/intro.

[RSIF20190041C26] 26.Conn A, Pedmale UV, Chory J, Navlakha S. 2017. High-resolution laser scanning reveals plant architectures that reflect universal network design principles. Cell Syst. 5, 53–62. ( 10.1016/j.cels.2017.06.017) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C27] 27.Budd JM, Kováacs K, Ferecskó AS, Buzás P, Eysel UT, Kisvárday ZF. 2010. Neocortical axon arbors trade-off material and conduction delay conservation. PLoS Comput. Biol. 6, e1000711 ( 10.1371/journal.pcbi.1000711) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C28] 28.Cuntz H, Forstner F, Borst A, Häusser M. 2010. One rule to grow them all: a general theory of neuronal branching and its practical application. PLoS Comput. Biol. 6, e1000877 ( 10.1371/journal.pcbi.1000877) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C29] 29.Chandrasekhar A, Navlakha S. 2019. Neural arbors are Pareto optimal. Proc. R. Soc. B 286, 20182727 ( 10.1098/rspb.2018.2727) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C30] 30.Seoane LF, Solé R. 2015. Phase transitions in Pareto optimal complex networks. Phys. Rev. E 92, 032807 ( 10.1103/PhysRevE.92.032807) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C31] 31.Seoane LF, Solé R. 2016. Multiobjective optimization and phase transitions. In Proc. of ECCS 2014. (eds S Battiston, F De Pellegrini, G Caldarelli, E Merelli), pp. 259–270. Cham, Switzerland: Springer International Publishing.

[RSIF20190041C32] 32.Avena-Koenigsberger A, Goñi J, Solé R, Sporns O. 2015. Network morphospace. J. R. Soc. Interface 12, 20140881 ( 10.1098/rsif.2014.0881) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C33] 33.Moses M, Bezerra G, Edwards B, Brown J, Forrest S. 2016. Energy and time determine scaling in biological and computer designs. Phil. Trans. R. Soc. B 371, 20150446 ( 10.1098/rstb.2015.0446) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C34] 34.Pardalos PM, Migdalas A, Pitsoulis L (eds). 2008. Pareto optimality, game theory and equilibria. Springer optimization and its applications, vol. 17 New York, NY: Springer. [Google Scholar]

[RSIF20190041C35] 35.Diamond PA, Mirrlees JA. 1971. Optimal taxation and public production I: production efficiency. Am. Econ. Rev. 61, 8–27. [Google Scholar]

[RSIF20190041C36] 36.Samuelson PA. 1954. The pure theory of public expenditure. Rev. Econ. Stat. 36, 387–389. ( 10.2307/1925895) [DOI] [Google Scholar]

[RSIF20190041C37] 37.Shoval O, Sheftel H, Shinar G, Hart Y, Ramote O, Mayo A, Dekel E, Kavanagh K, Alon U. 2012. Evolutionary trade-offs, Pareto optimality, and the geometry of phenotype space. Science 336, 1157–1160. ( 10.1126/science.1217405) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C38] 38.Tullock G. 1971. Biological externalities. J. Theor. Biol. 33, 565–576. ( 10.1016/0022-5193(71)90097-X) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C39] 39.Handl J, Kell DB, Knowles J. 2007. Multiobjective optimization in bioinformatics and computational biology. IEEE/ACM Trans. Comput. Biol. Bioinf. 4, 279–292. ( 10.1109/TCBB.2007.070203) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C40] 40.Heinrich R, Schuster S. 1996. The regulation of cellular systems. London, UK: Chapman & Hall. [Google Scholar]

[RSIF20190041C41] 41.Barabási A-L, Albert R. 1999. Emergence of scaling in random networks. Science 286, 509–512. ( 10.1126/science.286.5439.509) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C42] 42.Cookson G, Pishue B. 2017. INRIX global traffic scorecard. Technical report no. 10210 NE Points Drive, Suite 400, Kirkland, WA 98033. INRIX Research, United States.

[RSIF20190041C43] 43.Bettencourt LMA. 2013. The origins of scaling in cities. Science 340, 1438–1441. ( 10.1126/science.1235823) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C44] 44.Hart Y, Sheftel H, Hausser J, Szekely P, Ben-Moshe NB, Korem Y, Tendler A, Mayo AE, Alon U. 2015. Inferring biological tasks using Pareto analysis of high-dimensional data. Nat. Methods 12, 233–235. ( 10.1038/nmeth.3254) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C45] 45.Barve A, Wagner A. 2013. A latent capacity for evolutionary innovation through exaptation in metabolic systems. Nature 500, 203 ( 10.1038/nature12301) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C46] 46.Valverde S, Piñero J, Corominas-Murtra B, Montoya J, Joppa L, Solé R. 2018. The architecture of mutualistic networks as an evolutionary spandrel. Nat. Ecol. Evol. 2, 94–99. ( 10.1038/s41559-017-0383-4) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C47] 47.Milton J, Cabrera JL, Ohira T, Tajima S, Tonosaki Y, Eurich CW, Campbell SA. 2009. The time-delayed inverted pendulum: implications for human balance control. Chaos: an Interdiscipl. J. Nonlinear Sci. 19, 026110 ( 10.1063/1.3141429) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C48] 48.Doyle JC, Csete M. 2011. Architecture, constraints, and behavior. Proc. Natl Acad. Sci. USA 108, 15 624–15 630. ( 10.1073/pnas.1103557108) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C49] 49.Newberry MG, Ennis DB, Savage VM. 2015. Testing foundations of biological scaling theory using automated measurements of vascular networks. PLoS Comput. Biol. 11, e1004455 ( 10.1371/journal.pcbi.1004455) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C50] 50.Tekin E, Hunt D, Newberry MG, Savage VM. 2016. Do vascular networks branch optimally or randomly across spatial scales? PLoS Comput. Biol. 12, e1005223 ( 10.1371/journal.pcbi.1005223) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C51] 51.Murray CD. 1926. The physiological principle of minimum work: I. The vascular system and the cost of blood volume. Proc. Natl Acad. Sci. USA 12, 207–214. ( 10.1073/pnas.12.3.207) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190041C52] 52.Gafiychuk V, Lubashevsky I. 2001. On the principles of the vascular network branching. J. Theor. Biol. 212, 1–9. ( 10.1006/jtbi.2001.2277) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C53] 53.Dodds PS, Rothman DH, Weitz JS. 2001. Re-examination of the ‘3/4-law’ of metabolism. J. Theor. Biol. 209, 9–27. ( 10.1006/jtbi.2000.2238) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C54] 54.West GB, Brown JH, Enquist BJ. 1997. A general model for the origin of allometric scaling laws in biology. Science 276, 122–126. ( 10.1126/science.276.5309.122) [DOI] [PubMed] [Google Scholar]

[RSIF20190041C55] 55.Chen G, Chen S, Guo W, Chen H. 2007. The multi-criteria minimum spanning tree problem based genetic algorithm. Inf. Sci. 177, 5050–5063. ( 10.1016/j.ins.2007.06.005) [DOI] [Google Scholar]

[RSIF20190041C56] 56.Khuller S, Raghavachari B, Young N. 1995. Approximating the minimum equivalent digraph. SIAM J. Comput. 24, 859–872. ( 10.1137/S0097539793256685) [DOI] [Google Scholar]

PERMALINK

Travel in city road networks follows similar transport trade-off principles to neural and plant arbors

Jonathan Y Suen

Saket Navlakha

Abstract

1. Introduction

Figure 1.

2. Related work

2.1. Analysing road structures as graphs

2.2. Models of traffic flow and routing

2.3. Relationship to biological network design and routing

2.4. Additional work in Pareto optimality of complex networks

3. Results

3.1. A framework for quantifying user-network trade-offs in road structures

3.2. Comparison to baseline networks

3.3. Quantifying optimality, i.e. the distance to the Pareto front

3.4. Using Google Maps to infer travel paths in a city

3.5. Other objectives of interest

3.6. Are travel paths in city road networks close to being Pareto optimal?

Figure 2.

Figure 3.

3.7. Where near the Pareto front do cities tend to lie?

3.8. Do other city features correlate with Pareto optimality?

Table 1.

3.9. A higher-order Pareto front to further bound travel path complexity

Figure 4.

4. Discussion

5. Methods

5.1. Travel path data

5.2. Greedy algorithm to generate a near-optimal Pareto front

5.3. Archetype analysis and traffic data

Supplementary Material

Supplementary Material

Data accessibility

Author contributions

Competing interests

Funding

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases