Abstract
The movements of individuals within and among cities influence critical aspects of our society, such as well-being, the spreading of epidemics, and the quality of the environment. When information about mobility flows is not available for a particular region of interest, we must rely on mathematical models to generate them. In this work, we propose Deep Gravity, an effective model to generate flow probabilities that exploits many features (e.g., land use, road network, transport, food, health facilities) extracted from voluntary geographic data, and uses deep neural networks to discover non-linear relationships between those features and mobility flows. Our experiments, conducted on mobility flows in England, Italy, and New York State, show that Deep Gravity achieves a significant increase in performance, especially in densely populated regions of interest, with respect to the classic gravity model and models that do not use deep neural networks or geographic data. Deep Gravity has good generalization capability, generating realistic flows also for geographic areas for which there is no data availability for training. Finally, we show how flows generated by Deep Gravity may be explained in terms of the geographic features and highlight crucial differences among the three considered countries interpreting the model’s prediction with explainable AI techniques.
Subject terms: Applied mathematics, Statistics, Geography, Computer science
The movements of individuals within and among cities influence critical aspects of our society, such as well-being, the spreading of epidemics, and the quality of the environment. Here, the authors use deep neural networks to discover non-linear relationships between geographical variables and mobility flows.
Introduction
Cities are complex and dynamic ecosystems that define where people live, how they move around, whom they interact with, and how they consume services1–5. Most of the world’s population live now in urban areas, whose evolution in structure and size influences crucial aspects of our society such as the objective and subjective well-being6–11 and the diffusion of innovations4,12,13. It is therefore not surprising that the study of human mobility has attracted particular interest in recent years3,14–17, with a particular focus on the migration between cities and from rural to urban areas18,19, the study and modeling of mobility patterns in urban environments15,20–24, the estimation of city population25–27, the migration induced by natural disasters, climate change, and conflicts28–32, the prediction of traffic and crowd flows14,33–37, and the forecasting of the spreading of epidemics38–42. Human mobility modelling has important applications in these research areas. Traffic congestion, domestic migration, and the spread of infectious diseases are processes in which the presence of mobility flows induces a net change of the spatial distribution of some quantity of interest (e.g., vehicles, population, pathogens). The ability to accurately describe the dynamics of these processes depends on our understanding of the characteristics of the underlying spatial flows and it is crucial to make cities and human settlements inclusive, safe, resilient, and sustainable43–45.
Among all relevant problems in the study of human mobility, the generation of mobility flows, also known as flow generation14,15,17, is particularly challenging. In simple words, this problem consists of generating the flows between a set of locations (e.g., how many people move from a location to another) given the demographics and geographic characteristics (e.g., population and distance) and without any historical information about the real flows.
Flow generation has attracted interest for a long time. Notably, in 1946 George K. Zipf proposed a model to estimate mobility flows, drawing an analogy with Newton’s law of universal gravitation46. This model, known as the gravity model, is based on the assumption that the number of travelers between two locations (flow) increases with the locations’ populations while decreases with the distance between them15,47. Given its ability to generate spatial flows and traffic demand between locations, the gravity model has been used in various contexts such as transport planning48, spatial economics18,49,50, and the modeling of epidemic spreading patterns51–54.
Although the gravity model has the clear advantage of being interpretable by design and of requiring a few parameters, it suffers from several drawbacks, such as the inability to accurately capture the structure of the real flows and the greater variability of real flows than expected15,47,55. Since the gravity model relies on a restricted set of variables, usually just the population and the distance between locations, flows are generated without considering information that is essential to account for the complexity of the geographical landscape, such as land use, the diversity of points of interest (POIs), and the transportation network56–59. Therefore, we need more detailed input data and more flexible models to generate more realistic mobility flows. The former can be achieved by extracting a rich set of geographical features from data sources freely available online; the latter by using powerful nonlinear models like deep artificial neural networks. Deep-learning approaches exist for a different declination of the problem, namely flow prediction: they use historical flows between geographic locations to forecast the future ones, but they are not able to generate flows between pairs of locations for which historical flows are not available14,33–37,60–65. To what extent deep learning can generate realistic flows without any knowledge about historical ones is barely explored in the literature14. Finally, since deep-learning models are not transparent, explainability is crucial to gain a deeper understanding of the patterns underlying mobility flows. We may achieve this goal using explainable AI techniques66–69, which unveil the most important variables overall as well as explain single flows between locations on the basis of their geographic characteristics.
We design an approach to flow generation that considers a large set of variables extracted from OpenStreetMap70,71, a public and voluntary geographic information system. These variables describe essential aspects of urban areas such as land use, road network features, transportation, food, health, education, and retail facilities. We use these geographical features to train a deep neural network, namely the Deep Gravity model, to estimate mobility flows between census areas in England, Italy, and New York State. We prefer neural networks over other machine learning models because they are the natural extension of the state-of-the-art model for flow generation, i.e., the singly constrained gravity model15,47, which corresponds to a multinomial logistic regression that is formally equivalent to a linear neural network with one softmax layer. Our approach is based on a nonlinear variant of the multinomial logistic regression obtained by adding hidden layers, which introduce nonlinearities and build more complex representations of the input geographic features.
We find that Deep Gravity outperforms flow generation models that use shallow neural networks or that do not exploit complex geographic features, with a relative improvement in the realism with respect to the classic gravity model of up to 66% (Italy), 246% (England), and 1076% (New York State) in highly populated areas, where flows are harder to predict because of the high number of relevant locations. Deep Gravity also has a good generalization capability, making it applicable to areas that are geographically disjoint from those used for training the model. Finally, we show how to explain Deep Gravity’s predictions on the basis of the collected geographic features. This allows us to observe that, while in Italy and New York State the nonlinear relationship between population and distance captured by the model provides the strongest contribution to predict the flow probability, in England the interplay between the various geographic features plays a key role in boosting the model’s predictions.
Results
We define a geographical region of interest, R, as the portion of territory for which we are interested in generating the flows. Over the region of interest, a set of geographical polygons called tessellation, , is defined with the following properties: (1) the tessellation contains a finite number of polygons, li, called locations, ; (2) the locations are nonoverlapping, ; (3) the union of all locations completely covers the region of interest, .
The flow, y(li, lj), between locations li and lj denotes the total number of people moving for any reason from location li to location lj per unit time. As a concrete example, if the region of interest is England and we consider commuting (i.e., home to work) trips between England postcodes, a flow y(SW1W0NY, PO167GZ) may be the total number of people that commute every day between location (postcode) SW1W0NY and location PO167GZ. The total outflow, Oi, from location li is the total number of trips per unit time originating from location li, i.e., Oi = ∑jy(li, lj).
Given a tessellation, , over a region of interest R, and the total outflows from all locations in , we aim to estimate the flows, y, between any two locations in . Note that this problem definition does not allow to use flows within the region of interest as input data. That is, we cannot use a subset of the flows between the locations in the region of interest neither historical information to generate other flows in the same region. This means that a model tested to predict flows in region R must have been trained on a different region , nonoverlapping with R.
The most common metric used to evaluate the performance of flow generation models is the Sørensen-Dice index, also called Common Part of Commuters (CPC)14,15,47, which is a well-established measure to compute the similarity between real flows, yr, and generated flows, yg:
1 |
CPC is always positive and contained in the closed interval [0, 1] with 1 indicating a perfect match between the generated flows and the ground truth and 0 highlighting bad performance with no overlap. Note that when the generated total outflow is equal to the real total outflow, as for all the models we consider in this paper, CPC is equivalent to the accuracy, i.e., the fraction of trips’ destinations correctly predicted by the model. In fact, when the generated total outflow is equal to the real total outflow, the denominator becomes 2∑i,jyr(li, lj) and the CPC measures the fraction of all trips that were assigned to the correct destination, i.e., the fraction of correct predictions or accuracy.
For a more comprehensive evaluation, we also use the Pearson correlation coefficient, the Normalized Root Mean Squared Error (NRMSE) and the Jensen-Shannon divergence (JSD), which measure the linear correlation, the error, and the dissimilarity between the distributions of the real and the generated flows, respectively14 (see Supplementary Note 1 for details).
Derivation of Deep Gravity
Deep Gravity originates from the observation that the state-of-the-art model of flow generation, the gravity model15,46,47,72, is equivalent to a shallow linear neural network. Based on this equivalence, we naturally define Deep Gravity by adding nonlinearity and hidden layers to the gravity model, as well as considering additional geographical features.
The singly constrained gravity model15,47 prescribes that the expected flow, , between an origin location li and a destination location lj is generated according to the following equation:
2 |
where Oi is the origin’s total outflow, mj is the resident population of location lj, pij is the probability to observe a trip (unit flow) from location li to location lj, β1 is a parameter and f(rij) is called deterrence function. Typically, the deterrence function f(rij) can be either an exponential, , or a power-law function, , where β2 is another parameter. In these two cases, the gravity model can be formulated as a Generalized Linear Model with a multinomial distribution73. Thanks to the linearity of the model, the maximum likelihood’s estimate of parameters β1 and β2 in Eq. (2) can be found efficiently, for example using Newton’s method, maximizing the model’s loglikelihood:
3 |
where y is the matrix of observed flows, β = [β1, β2] is the vector of parameters and the input feature vector is x(li, lj) = concat[xj, rij] for the exponential deterrence function ( for the power-law deterrence function) with . Note that the negative of loglikelihood in Eq. (3) is proportional to the cross-entropy loss, , of a shallow neural network with an input of dimension two and a single linear layer followed by a softmax layer.
This equivalence suggests to interpret the flow generation problem as a classification problem, where each observation (trip or unit flow from an origin location) should be assigned to the correct class (the actual location of destination) chosen among all possible classes (all locations in tessellation ). In practice, for each possible destination in the tessellation, the model outputs the probability that an individual from a given origin would move to that destination. To compute the average flows from an origin, these probabilities are multiplied by the origin’s total outflow. According to this interpretation, the gravity model is a linear classifier based on two explanatory variables, i.e., population and distance. The interpretation of the flow generation problem as a classification problem allows us to naturally extend the gravity model’s shallow neural network introducing hidden layers and nonlinearities.
Architecture of Deep Gravity
To generate the flows from a given origin location (e.g., li), Deep Gravity uses a number of input features to compute the probability pi,j that any of the n locations in the region of interest (e.g., lj) is the destination of a trip from li. Specifically, the model output is a n-dimensional vector of probabilities pi,j for j = 1, . . . , n. These probabilities are computed in three steps (see Fig. 1b).
First, the input vectors x(li, lj) = concat[xi, xj, ri,j] for j = 1, . . . , n are obtained performing a concatenation of the following input features: xi, the feature vector of the origin location li; xj the feature vector of the destination location lj; and the distance between origin and destination ri,j. For each origin location (e.g., li), n input vectors x(li, lj) with j = 1, . . . , n are created, one for each location in the region of interest that could be a potential destination.
Second, the input vectors x(li, lj) are fed in parallel to the same feed-forward neural network. The network has 15 hidden layers of dimensions 256 (the bottom six layers) and 128 (the other layers) with LeakyReLu74 activation function, a. Specifically, the output of hidden layer h is given by the vector z(0)(li, lj) = a(W(0) ⋅ x(li, lj)) for the first layer (h = 0) and z(h)(li, lj) = a(W(h) ⋅ z(h−1)(li, lj)) for h > 0, where W are matrices whose entries are parameters learned during training.
The output of the last layer is a scalar s(li, lj) ∈ [−∞, +∞] called score: the higher the score for a pair of locations (li, lj), the higher the probability to observe a trip from li to lj according to the model. Finally, scores are transformed into probabilities using a softmax function, , which transforms all scores into positive numbers that sum up to one. The generated flow between two locations is then obtained by multiplying the probability (i.e., the model’s output) and the origin’s total outflow.
The location feature vector xi provides a spatial representation of an area, and it contains features describing some properties of location li, e.g., the total length of residential roads or the number of restaurants therein. Its dimension, d, is equal to the total number of features considered. The location features we use include the population size of each location and geographical features extracted from OpenStreetMap70,71 belonging to the following categories:
Land-use areas (5 features): total area (in km2) for each possible land-use class, i.e., residential, commercial, industrial, retail, and natural;
Road network (3 features): total length (in km) for each different types of roads, i.e., residential, main and other;
Transport facilities (2 features): total count of Points Of Interest (POIs) and buildings related to each possible transport facility, e.g., bus/train station, bus stop, car parking;
Food facilities (2 features): total count of POIs and buildings related to food facilities, e.g., bar, cafe, restaurant;
Health facilities (2 features): total count of POIs and buildings related to health facilities, e.g., clinic, hospital, pharmacy;
Education facilities (2 features): total count of POIs and buildings related to education facilities, e.g., school, college, kindergarten;
Retail facilities (2 features): total count of POIs and buildings related to retail facilities, e.g., supermarket, department store, mall.
In addition, we include as feature of Deep Gravity the geographic distance, ri,j, between two locations li and lj, which is defined as the distance measured along the surface of the earth between the centroids of the two polygons representing the locations. All values of features for a given location (excluding distance) are normalized dividing them by the location’s area.
Each flow in Deep Gravity is hence described by 39 features (18 geographic features of the origin and 18 of the destination, distance between origin and destination, and their populations). We also consider a light version of Deep Gravity in which we just count a location’s total number of POIs without distinguishing among the categories (5 features per flow in total), and a heavy version of it in which we include the average of the geographic features of the k nearest locations to a flow’s origin and destination (e.g., 77 features per flow in total for k = 2). The performance of these two models is comparable to, or worse than, the performance of Deep Gravity (see Supplementary Note 2 and Supplementary Figs. 4 and 5).
The loss function of Deep Gravity is the cross-entropy:
4 |
where y(li, lj)/Oi is the fraction of observed flows from li that go to lj and pi,j is the model’s probability of a unit flow from li to lj. Note that the sum over i of the cross-entropies of different origin locations follows from the assumption that flows from different locations are independent events, which allows us to apply the additive property of the cross-entropy for independent random variables. The network is trained for 20 epochs with the RMSprop optimizer with momentum 0.9 and learning rate 5 ⋅ 10−6 using batches of size 64 origin locations. To reduce the training time, we use negative sampling and consider up to 512 randomly selected destinations for each origin location.
Experiments
We perform a series of experiments to estimate mobility flows in England (UK), Italy (EU), and New York State (US). In England and Italy, the mobility flows are among 885 and 1551 regions of interest, respectively, consisting of nonoverlapping square regions of 25 by 25 km2, which cover the whole of the country. Half of these regions are used to train the models and the other half are used for testing. Each region of interest is further subdivided into locations: in England we use Output Areas (OAs) provided by the UK Census, in Italy we use Census Areas (CAs) provided by the Italian census. We also consider mobility flows among 5367 Census Tracts (CTs) provided by the United States Census Bureau in New York State extracted from millions of anonymous mobile phone users’ visits to various places75. Supplementary Table 1 summarizes the characteristics of the datasets. For details on the definition of the regions of interest, the locations, their features, and the real flows used to train and validate the models, see Methods.
Our experiments aim to assess the effectiveness of the models in generating mobility flows within the region of interest belonging to the test set. Given the formal similarity between Deep Gravity (DG) and the gravity model (G), we use the latter as a baseline to assess Deep Gravity’s improved predictive performance. Indeed, the gravity model is the state-of-the-art model for flow generation and it is thus preferred to a null model in which flows are evenly distributed at random across the edges of the mobility network15,46,47. Additionally, we define two hybrid models to understand the performance gain obtained by adding either multiple nonlinear hidden layers or complex geographical features to the gravity model:
the Nonlinear Gravity model (NG) uses a feed-forward neural network with the same structure of Deep Gravity, but, similarly to the gravity model, its input features are only population and distance;
the Multi-Feature Gravity model (MFG) has the same multiple input features of Deep Gravity, including various geographical variables extracted from OpenStreetMap but, similarly to the gravity model, these features are processed by a single-layer linear neural network;
Table 1 compares the performance of the models. For England, DG has CPC = 0.32, an improvement of 39% over MFG (CPC = 0.23), 166% over NG (CPC = 0.12), and 190% over G (CPC = 0.11) (see Table 1, Supplementary Fig. 1, and Supplementary Note 3). Note that DG’s improvement on G is a common characteristic (see Fig. 3a-c). Although an overall CPC = 0.32 may seem low, we should consider that human mobility is a highly complex system: on the one hand, the number of factors influencing the decision underlying people’s displacements are far more than those captured by the available features; on the other hand, mobility flows have an intrinsic random component and hence the prediction of a single event cannot be determined in a deterministic way. Figure 2a-c compares real flows with flows generated by DG and G on a region of interest in England. As suggested by the value of CPC computed on the flows in that region of interest, DG’s network of flows is visually more similar to the real ones than G’s one, both in terms of structure and distribution of flow values.
Table 1.
Decile of population | Global metrics | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | CPC | NRMSE | Corr. | JSD | ||
England | |||||||||||||||
G | Mean CPC | 0.66 | 0.51 | 0.40 | 0.34 | 0.28 | 0.25 | 0.20 | 0.16 | 0.12 | 0.08 | ||||
std CPC | 0.18 | 0.09 | 0.07 | 0.04 | 0.04 | 0.03 | 0.03 | 0.02 | 0.02 | 0.02 | 0.11 | 0.51 | 0.35 | 0.73 | |
Mean CPC | 0.64 | 0.50 | 0.41 | 0.36 | 0.31 | 0.27 | 0.21 | 0.16 | 0.13 | 0.08 | |||||
std CPC | 0.18 | 0.07 | 0.06 | 0.07 | 0.06 | 0.04 | 0.03 | 0.02 | 0.02 | 0.02 | |||||
NG | Rel. Imp. | -1.52 | -1.88 | 0.35 | 5.79 | 6.41 | 4.41 | 3.82 | 4.46 | 3.99 | 4.53 | 0.12 | 0.45 | 0.56 | 0.72 |
Mean CPC | 0.66 | 0.52 | 0.45 | 0.41 | 0.36 | 0.36 | 0.32 | 0.30 | 0.26 | 0.19 | |||||
std CPC | 0.17 | 0.09 | 0.07 | 0.05 | 0.04 | 0.04 | 0.06 | 0.05 | 0.04 | 0.04 | |||||
MFG | Rel. Imp. | 1.29 | 1.55 | 13.46 | 20.11 | 26.89 | 43.01 | 61.43 | 87.83 | 105.64 | 139.46 | 0.23 | 0.47 | 0.48 | 0.65 |
Mean CPC | 0.67 | 0.57 | 0.50 | 0.48 | 0.44 | 0.45 | 41 | 0.39 | 0.35 | 0.28 | |||||
std CPC | 0.17 | 0.07 | 0.06 | 0.06 | 0.04 | 0.05 | 0.05 | 0.05 | 0.04 | 0.05 | |||||
DG | Rel. Imp. | 3.20 | 11.72 | 24.91 | 41.47 | 54.35 | 75.76 | 108.47 | 143.54 | 174.97 | 246.88 | 0.32 | 0.41 | 0.61 | 0.60 |
Italy | |||||||||||||||
G | Mean CPC | 0.26 | 0.38 | 0.41 | 0.37 | 0.31 | 0.29 | 0.27 | 0.24 | 0.21 | 0.13 | ||||
std CPC | 0.27 | 0.14 | 0.09 | 0.09 | 0.08 | 0.07 | 0.06 | 0.06 | 0.05 | 0.05 | 0.18 | 0.48 | 0.49 | 0.69 | |
Mean CPC | 0.31 | 0.44 | 0.48 | 0.43 | 0.38 | 0.35 | 0.34 | 0.30 | 0.25 | 0.15 | |||||
std CPC | 0.31 | 0.14 | 0.10 | 0.10 | 0.08 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | |||||
NG | Rel. Imp. | 19.30 | 14.63 | 16.41 | 16.82 | 19.76 | 22.26 | 23.67 | 22.93 | 19.93 | 19.45 | 0.21 | 0.45 | 0.57 | 0.67 |
Mean CPC | 0.29 | 0.41 | 0.45 | 0.41 | 0.37 | 0.33 | 0.31 | 0.28 | 0.23 | 0.14 | |||||
std CPC | 0.30 | 0.16 | 0.09 | 0.09 | 0.09 | 0.07 | 0.06 | 0.07 | 0.06 | 0.06 | |||||
MFG | Rel. Imp. | 10.96 | 5.95 | 10.68 | 10.88 | 15.62 | 14.76 | 14.69 | 15.70 | 13.99 | 14.20 | 0.20 | 0.50 | 0.44 | 0.67 |
Mean CPC | 0.34 | 51 | 0.55 | 0.49 | 0.46 | 0.43 | 0.41 | 0.37 | 0.31 | 0.21 | |||||
std CPC | 0.34 | 0.16 | 0.09 | 0.10 | 0.07 | 0.08 | 0.07 | 0.08 | 0.07 | 0.08 | |||||
DG | Rel. Imp. | 30.02 | 31.62 | 32.98 | 33.26 | 43.97 | 48.46 | 49.18 | 52.35 | 51.46 | 66.02 | 0.27 | 0.43 | 0.62 | 0.63 |
New York State | |||||||||||||||
G | Mean CPC | 0.29 | 0.35 | 0.24 | 0.25 | 0.13 | 0.13 | 0.16 | 0.09 | 0.06 | 0.04 | ||||
std CPC | 0.31 | 0.28 | 0.25 | 0.26 | 0.19 | 0.18 | 0.19 | 0.14 | 0.04 | 0.02 | 0.06 | 0.74 | 0.03 | 0.78 | |
Mean CPC | 0.29 | 0.35 | 0.43 | 0.46 | 0.42 | 0.49 | 0.45 | 0.50 | 0.49 | 0.47 | |||||
std CPC | 0.31 | 0.28 | 0.20 | 0.22 | 0.19 | 0.13 | 0.16 | 0.05 | 0.02 | 0.03 | |||||
NG | Rel. Imp. | 0 | 0 | 75.71 | 81.90 | 206.35 | 278.09 | 180.91 | 405.66 | 607.17 | 1031.14 | 0.68 | 0.26 | 0.93 | 0.34 |
Mean CPC | 0.29 | 0.35 | 0.36 | 0.37 | 0.31 | 0.33 | 0.28 | 0.28 | 0.27 | 0.18 | |||||
std CPC | 0.31 | 0.28 | 0.18 | 0.19 | 0.15 | 0.10 | 0.13 | 0.07 | 0.08 | 0.03 | |||||
MFG | Rel. Imp. | 0 | 0 | 49.70 | 48.08 | 126.68 | 157.01 | 74.47 | 184.04 | 297.44 | 351.59 | 0.28 | 0.48 | 0.35 | 0.62 |
Mean CPC | 0.29 | 0.35 | 0.43 | 0.46 | 0.42 | 0.49 | 0.45 | 0.51 | 0.52 | 0.49 | |||||
std CPC | 0.31 | 0.28 | 0.20 | 0.22 | 0.19 | 0.13 | 0.16 | 0.06 | 0.05 | 0.03 | |||||
DG | Rel. Imp. | 0 | 0 | 75.80 | 82.41 | 208.03 | 282.91 | 184.33 | 416.43 | 661.03 | 1076.93 | 0.70 | 0.19 | 0.93 | 0.33 |
Comparison of the performance, in terms of Common Part of Commuters (CPC), of Gravity (G), Nonlinear Gravity (NG), Multi-Feature Gravity (MFG), and Deep Gravity (DG), for England, Italy, and New York State, varying the decile of the population of the regions of interest (side size of 25 km). We also provide a global evaluation of the goodness of each model in terms of CPC, Pearson correlation coefficient, Normalized Root Mean Squared Error (NRMSE), and the Jensen-Shannon divergence (JSD) between the distribution of real and generated flows. For each model, and for each decile of the distribution of population, we show the average CPC and the standard deviation of the CPC obtained over five runs of the model. For NG, MFG, and DG we also show the relative improvement in terms of CPC with respect to G. We put in bold the values over the deciles that correspond to the best mean CPC and relative improvement. Regardless of the evaluation metric, DG significantly improves on the other models in all the geographic areas considered.
We obtain similar results for Italy and New York State (Table 1): DG performs significantly better than the other models, with an improvement in terms of global CPC over G of 66% (Italy) and 1076% (New York State). The improvement of DG over G is again spread in all areas of the two countries (Fig. 3d-i). This difference in the performance of DG among countries may be due to several factors, such as the differences in shapes and sizes of the spatial units, sparsity of flows, and mobility data sources.
To investigate the performance of the model in high and low populated regions, we split each country’s regions of interest into ten equal-sized groups, i.e., deciles, based on their population, where decile 1 includes the regions of interest with the smaller population and decile 10 includes the regions of interest with the larger population, and we analyze the performance of the four models in each decile (Fig. 4 and Table 1). In England and Italy, all models degrade (i.e., CPC decreases) as the decile of the population increases, denoting that they are more accurate in sparsely populated regions of interest (Fig. 4a, c). This is not the case for New York State, in which the model’s performance increases slightly as the decile of the population increases (Fig. 4e). Nevertheless, in all three countries, the relative improvement of DG with respect to G increases as the population increases (Fig. 4b, d, f). In other words, the performance of DG degrades less as population increases. This is a remarkable outcome because in highly populated regions of interest there are many relevant locations, and hence predicting the correct destinations of trips is harder. DG improves especially where current models are unrealistic.
The introduction of the geographic features (MFG) and of nonlinearity and hidden layers (DG) leads to a significant improvement of the overall performance. In England, the relative improvement of MFG and DG with respect to G is significant, with values of about 139% and 246%, respectively, in the last decile of population (see Fig. 4a,b and Table 1). Even in the first decile of the population, we find a relative improvement of DG with respect to G of 3%. Similarly, in the other two countries we have an improvement of MFG and DG over G of 14% and 66% (Italy) and of 351% and 1076% (New York State), respectively. Note that DG’s improvement on G is a common characteristic, as DG improves on G in all the regions of interest for all countries (see Fig. 3 and 4). We find that the performances of NG and MFG are country specific. In England, MFG outperforms NG, as opposed to Italy and New York State where NG outperforms MFG (Fig. 4). Despite these country-specific differences, we observe a clear pattern in our results that is valid for all countries: G has always the worst performance and DG has always the best performance, while MFG and NG have intermediate performances. This confirms our hypothesis that the increased performance of DG originates from the interplay between a richer set of geographics features (present in MFG but not in NG) and the model’s nonlinearity (present in NG but not in MFG).
The performance of all models does not change significantly if we use regions of interest of 10 by 10 km2. In particular, all models have a CPC25km around 0.03 higher than CPC10km (see Supplementary Fig. 3, Supplementary Table 2, and Supplementary Note 3). However, the relative improvement of DG over G on the last decile is slightly smaller with a region of interest size of 10km (Supplementary Table 2): for example, in England it is about 220%, i.e., about 26% less than the improvement on the same decile for a region of interest size of 25 km.
Geographic transferability
Neural networks trained on spatial data may suffer from low generalization capabilities when applied to different geographical regions than the ones used for training. In the previous experimental settings, it may happen that for a large city covered by multiple regions of interest (e.g., London, Manchester) some of its locations are used during the training phase, hence leading to a good performance when applied to test locations of the same city. To investigate the model generalization capability, we design specific training and testing datasets so that a city is never seen during the training phase. This setting allows us to discover whether we can generate flows for a city where no flows have been used to train the model, a peculiarity that we cannot fully investigate if the model partially see a city (e.g., use some of the city flows during the training phase).
Given the nine England major cities, i.e., the so-called Core Cities76 and London, the training dataset contains the locations and the information of eight cities and the test set contains information on the city excluded from the training. In particular, we select 15 regions of interest corresponding to London, eight to Leeds, seven to Sheffield, five to Birmingham, four to Bristol, Liverpool, Manchester and Newcastle, and three to Nottingham. In this way, we can test whether DG is able to generalize by analyzing its performances according to a leave-one-city-out validation mechanism, i.e., generate flows on a city whose regions of interest never appear in the training set. We denote this implementation with Leave-one-city-out-DG (LDG).
LDG produces average CPCs that are remarkably close to the DG’s ones (see Supplementary Fig. 2 and Supplementary Note 4). For instance, the average CPC slightly improves by testing LDG on London’s locations using the locations of the other cities as training. We find similar results by testing the model on Newcastle, Liverpool and Nottingham. The average CPC slightly decreases when tested on Bristol and Sheffield, while it does not change significantly with respect to DG on Leeds, Birmingham and Manchester. The negligible difference between the performance DG and LDG shows that our model can generate flow probabilities also for geographic areas for which there is no data availability for training the model.
Explaining generated flows
Understanding why a model makes a certain prediction is crucial to interpret results, explain differences between models, and assess to what extent we understand the phenomenon under analysis67–69. Moreover, the Ethics Guidelines for Trustworthy AI of the EU High-Level Expert Group on AI suggest that the behavior of AI system should be transparent, explainable, and trustworthy77,78.
We use SHapley Additive exPlanations (SHAP)79,80 to understand how the input geographic features contribute to determine the output of Deep Gravity. SHAP is based on game theory81 and estimates the contribution of each feature based on the optimal Shapley value79, which denotes how the presence or absence of that feature change the model prediction of a particular instance compared to the average prediction for the dataset66 (see Methods for details). We show some insights provided by SHAP for global explanations in the three countries considered (Fig. 5) and for local explanations for an origin-destination pair in England (Fig. 6).
From a global perspective (Fig. 5), one of the most relevant features with large Shapely values is the geographic distance: as expected, a large distance between origin and destination contributes to a reduction of flow probability, while a small distance leads to an increase. The population of the destination (“D: Population” in Fig. 5) is also globally relevant, especially in Italy and New York State. In England, however, in contrast with the usual assumption of the gravity model that the flow probability is an increasing function of the population, we find that population has a mixed effect, with high values of the population’s feature (red points in Fig. 5a) that may also contribute to a decrease of the predicted flow. A possible explanation is that residential areas have a high population, but are not likely destinations of commuting trips, while other geographical features related to commercial and industrial land use, healthcare, and food are more relevant than population. For instance, locations having a large number of food facilities, retail, and industrial zones are predicted to attract commuters. On the other hand, locations with health-related POIs and commercial land use are predicted to have fewer commuters. Differently from England, in Italy and New York State (Fig. 5b,c) the populations in the origin and destination locations are the features with the strongest impact on the model output. In particular, both a small population in the origin and a large population in the destination increase the flow probability. The fact that populations and distance are more relevant than other geographic features in Italy and New York State explains why the Nonlinear Gravity model (NG) outperforms the Multi-Feature Gravity model (MFG) in these two countries: a deep-learning model that is able to capture the existing nonlinear relationship between populations and distance can accurately predict the flow probabilities, while the other geographic features only bring a marginal contribution.
Finally, we show how to explain the contribution of each feature to Deep Gravity’s prediction for a single origin-destination pair. We select two locations in England, E00137201 (population of 238 individuals) and E00137194 (population of 223 individuals) in a highly populated region of interest (in the 8th decile of population) situated in Corby, a city with about 50 thousands inhabitants (see Fig. 6a). We consider the two flows between them, i.e., from E00137201 to E00137194 and from E00137194 to E00137201. While the gravity model (G) generates identical flows because distances and populations are the same, Deep Gravity assigns different probabilities for the two flows and the Shapely values indicate that various geographical features (like transportation points and land use) are more relevant than population in this case (Fig. 6b,c). This example illustrates how DG’s predictions for individual flows depend on the various geographical variables considered, and that the most relevant features for a specific origin-destination pair can differ from the most relevant features overall (Fig. 5a). Examples of local explanations for Italy and New York State are available in Supplementary Note 5.
Discussion
The comparison of the performance of Deep Gravity with models that do not use nonlinearity or do not include the geographic information reveals several key results.
All models are generally more accurate on scarcely populated regions, which have fewer locations and trips’ destinations are thus easier to predict. More importantly, in highly populated regions where there are many relevant locations and hence predicting the correct destinations of trips is harder, the improvement of Deep Gravity with respect to its competitors becomes much higher, suggesting that our model improves especially where current models fail. We observe that Deep Gravity still outperforms all its competitors even when using a smaller region of interest size.
The addition of the geographic features is crucial to boost the realism of our approach. In this regard, Deep Gravity’s architecture allows for adding many other geographic features such as travel time with different transportation modalities, the structure of the underlying road network, or socio-economic information such as a neighborhood’s house price, degree of gentrification, and segregation. Anyhow, our analysis clearly shows that it is the combination of deep neural networks and voluntary geographic information that significantly boosts the realism in the generation of mobility flows, paving the road to a new breed of data-driven flow generation models. In this regard, as depicted by the explanations extracted from DG, the impact of the voluntary geographic information and nonlinearity varies from country to country: while in England geographic information plays the strongest role in the model’s performance, the nonlinearity predominates in Italy and New York State. More work is needed to delve into these interesting differences.
Deep Gravity is a geographic agnostic model able to generate flows between locations for any urban agglomeration, given the availability of appropriate information such as the tessellation, the total outflow per location, the population in the locations, and the information about POIs. This opens the doors to a set of intriguing questions regarding the model’s geographical explainability and transferability14. Regarding explainability, while we use agnostic techniques to explain the role of geographic variables to the model’s predictions, there is the need for more sophisticated explanations tailored for human mobility models. These explanations should take into account the peculiarities of flow generation (e.g., spatiality, networking), providing more suitable global and local explanations of mobility flows. Regarding transferability, flow generation may be extended to include the generation of each location’s total outflow to allow the application of the model to regions for which only the population and public POIs are available. Moreover, a future improvement of the model may consist in analyzing whether we can apply geographic transferability on other scales: Can we use rural areas flows to generate flows in cities? On the other hand, can we use cities’ flows to generate flows in rural areas? And can we use a model trained on an entire country to generate flows on a different one?
Methods
Regions of interest
First, we define a squared tessellation over the original polygonal shape of England. Formally, let C be the polygon composed by q vertices v1, . . . , vq that define the polygon boundary. We define the grid G as the square tessellation covering C with Lx × Ly regions of interest: , where Rij is the square cell (i, j) defined by two vertices representing the top-left and bottom-right coordinates. Depending on the nature of the problem, such tessellation can be formed with any triangle or quadrilateral tile or, as in the case of Voronoi tessellation, with tile defined as the set of points closest to one of the points in a discrete set of defining points. In this paper, we build a square grid using the tessellation builder from the python library scikit-mobility82, defining 885 regions of interest of 25 by 25 km2, which cover the whole of England. Half of these regions are used to train the models and the other half are used for testing: the regions of interest have been randomly allocated to the train and test sets in a stratified fashion based on the regions’ populations, so that the two sets have the same number of regions belonging to the various population deciles. Similarly, we have 1551 regions of interest of 25 by 25 km2 covering Italy and 475 regions of 25 by 25 km2 covering New York State.
Locations
The area covered by each region of interest is further divided into locations using a tessellation provided by the UK Census in 2011. The UK Census defines 232,296 nonoverlapping polygons called census Output Areas (OAs), which cover the whole of England. By construction, OAs should all contain a similar number of households (125), hence in cities and urban areas where population density is higher, there is a larger number of OAs and they have a smaller size than average. For a given region of interest Rij, its locations are defined as all OAs whose centroids are contained in Rij. Unfortunately, information about real commuting flows at country level are provided by official statistics bureaus at the level of OAs only, which are administrative units of different shapes. It is not possible to aggregate or disaggregate flows between OAs onto flows between locations of the same size (e.g., using a squared tessellation) without introducing significant distortions in the data or obtaining aggregated locations of size much larger than the area of the largest OA. Regarding Italy, the Italian national statistics bureau (ISTAT) defines 402,678 nonoverlapping polygons known as Census Areas (CAs), which cover the entire country. Similarly to England, for a given region of interest, its locations are defined as all Census Areas (CAs) whose centroids are contained in Ri,j. Finally, the US Census Bureau defines Census Tracts (CTs) with characteristic similar to those of England and Italy. In particular, in New York State there are 5367 CTs.
Location features
We collect information about the geographic features of each location from OpenStreetMap (OSM)70,71, an online collaborative project aimed to create an open source map of the world with geographic information collected from volunteers. The OSM data contain three types of geographical objects: nodes, lines and polygons. Nodes are geographic points, stored as latitude and longitude pairs, which represent points of interests (e.g., restaurants, schools). Lines are ordered lists of nodes, representing linear features such as streets or railways. Polygons are lines that form a closed loop enclosing an area and may represent, for example, land use or buildings. We use OpenStreetMap (OSM) data to compute the 18 geographic features for the origin and 18 for the destination. Regarding the population, we include the number of inhabitants for each location as an input feature and we use the number of residents in each OA provided by the UK Census for the year 2011 and for each CA provided by the Italian Census for the year 2011, and the number of people estimated in each CT for New York State computed as the sum of outgoing flows from each CT.
Mobility flows
The UK Census collects information about commuting flows between OAs. We use the commuting flows collected by the UK Census in 2011 and consider flows that have origin location and destination location in the same region of interest only. The UK Census covers 30,008,634 commuters with an average flow of 1.78 and a standard deviation of 3.21. The Italian census covers 15,003,287 commuters with an average flow of 2.07 and a standard deviation of 4.27. Finally, in New York State, there are 41,070,279 commuters with an average of 66.86 people traveling between CTs and a standard deviation of 364.58.
SHAP explanations
SHAP (SHapley Additive exPlanations) applies a game theoretic approach to explain the output of any machine learning model79. It relies on the Shapley values from game theory81, which connect optimal credit allocation with local explanations. Shapley values consist in the average of the marginal contributions across all the permutations of the players solving a game. They are obtained by composing a combination of variables and their average change depending on the presence or absence of the variables to determine the importance of a single variable based on game theory79. Based on this idea, SHAP values are used as a unified measure of feature importance. The interpretation of the SHAP value for variable value j is: the value of the jth variable contributed ϕj to the prediction of a particular instance compared to the average prediction for the dataset66. SHAP values allow us to give both a global and local explainability of Deep Gravity. In the first case, we use the collective SHAP values to understand which predictors (i.e., geographic variables, distance, populations) contributed either positively or negatively to the prediction. In the latter, we use single observations or smaller sets of observations (e.g., a specific decile) to both understand which features played a role in a specific prediction or, more in general, if the set of features used to predict flows in different deciles vary and how much it changes.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
L.P. has been partially supported by EU project SoBigData++ grant agreement 871042. F.S. has been supported by EPSRC (EP/P012906/1). This research used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357. We acknowledge the OpenStreetMap contributors, OpenStreetMap data are available under the Open Database License and licensed as CC BY-SA https://creativecommons.org/licenses/by-sa/2.0/. We thank Daniele Fadda for his support on data visualization and plots design.
Author contributions
F.S. designed the model and collected the data for England and Italy. M.L. collected the data for US. M.L. and L.P. performed the experiments. L.P. directed the study. All authors contributed to interpreting the results and writing the paper. G.B. developed this work prior joining Amazon.
Data availability
The commuting data for England are freely available at https://census.ukdataservice.ac.uk/use-data/guides/flow-data.aspx and https://census.ukdataservice.ac.uk/use-data/guides/boundary-data. he commuting data for Italy are freely available at http://datiopen.istat.it/datasetPND.php and https://www.istat.it/it/archivio/104317. The flows data in New York State are freely available at github.com/GeoDS/COVID19USFlows and are described in Kang et al.75.
Code availability
The Python code of Deep Gravity is available at github.com/scikit-mobility/DeepGravity. The version of the code used to make the experiments in this paper is available at ref. 83.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-26752-4.
References
- 1.Batty, M. The New Science of Cities (MIT press, 2013).
- 2.Byrne D. Class and ethnicity in complex cities: the cases of leicester and bradford. Environ. Plan. A Econ. Space. 1998;30:703–720. [Google Scholar]
- 3.Andrienko, G. et al. (so) big data and the transformation of the city. Int. J. Data Sci. Anal.11, 311–340 (2021).
- 4.Bettencourt LM, Lobo J, Helbing D, Kühnert C, West GB. Growth, innovation, scaling, and the pace of life in cities. Proc. Natl Acad. Sci. USA. 2007;104:7301–7306. doi: 10.1073/pnas.0610172104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.De Nadai, M. et al. The death and life of great italian cities: a mobile phone data perspective. in Proc. 25th international conference on world wide web, 413–423 (International World Wide Web Conferences, 2016).
- 6.Voukelatou, V. et al. Measuring objective and subjective well-being: dimensions and data sources. Int. J. Data Sci. Anal.10.1007/s41060-020-00224-2 (2020).
- 7.Bettencourt LM, Lobo J, Strumsky D, West GB. Urban scaling and its deviations: revealing the structure of wealth, innovation and crime across cities. PLoS ONE. 2010;5:e13541. doi: 10.1371/journal.pone.0013541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pappalardo L, et al. An analytical framework to nowcast well-being using mobile phone data. Int. J. Data Sci. Anal. 2016;2:75–92. [Google Scholar]
- 9.Pappalardo, L., Pedreschi, D., Smoreda, Z. & Giannotti, F. Using big data to study the link between human mobility and socio-economic development. in 2015 IEEE International Conference on Big Data (Big Data), 871–878 (IEEE, 2015).
- 10.Soto, V., Frias-Martinez, V., Virseda, J. & Frias-Martinez, E. Prediction of socioeconomic levels using cell phone records. in International Conference on User Modeling, Adaptation, and Personalization, 377–388 (Springer, 2011).
- 11.De Nadai, M., Xu, Y., Emmanuel, L., González, M. C. & Lepri, B. Socio-economic, built environment, and mobility conditions associated with crime: a study of multiple cities. Sci. Rep.10, 13871 (2020). [DOI] [PMC free article] [PubMed]
- 12.Chen D, Gao H, Luo J, Ma Y. The effects of rural–urban migration on corporate innovation: evidence from a natural experiment in china. Financial Manag. 2020;49:521–545. [Google Scholar]
- 13.Lissoni F. International migration and innovation diffusion: an eclectic survey. Regional Stud. 2018;52:702–714. [Google Scholar]
- 14.Luca, M., Barlacchi, G., Lepri, B. & Pappalardo, L. A survey on deep learning for human mobility. arXivhttps://arxiv.org/abs/2012.02825 (2020).
- 15.Barbosa H, et al. Human mobility: models and applications. Phys. Rep. 2018;734:1–74. [Google Scholar]
- 16.Pappalardo, L., Barlacchi, G., Pellungrini, R. & Simini, F. Human mobility from theory to practice: Data, models and applications. in Companion Proceedings of The 2019 World Wide Web Conference, 1311–1312 (Association for Computing Machinery, 2019).
- 17.Wang, J., Kong, X., Xia, F. & Sun, L. Urban human mobility: Data-driven modeling and prediction. in ACM SIGKDD Explorations Newsletter, 1–19 (Association for Computing Machinery, 2019).
- 18.Prieto Curiel R, Pappalardo L, Gabrielli L, Bishop SR. Gravity and scaling laws of city to city migration. PLoS ONE. 2018;13:1–19. doi: 10.1371/journal.pone.0199892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sirbu A, et al. Human migration: the big data perspective. Int. J. Data Sci. Anal. 2021;11:341–360. [Google Scholar]
- 20.González MC, Hidalgo CA, Barabási A-L. Understanding individual human mobility patterns. Nature. 2008;453:779 EP –. doi: 10.1038/nature06958. [DOI] [PubMed] [Google Scholar]
- 21.Pappalardo L, et al. Returners and explorers dichotomy in human mobility. Nat. Commun. 2015;6:8166. doi: 10.1038/ncomms9166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Song C, Qu Z, Blumm N, Barabási A-L. Limits of predictability in human mobility. Science. 2010;327:1018–1021. doi: 10.1126/science.1177170. [DOI] [PubMed] [Google Scholar]
- 23.Pappalardo L, Simini F. Data-driven generation of spatio-temporal routines in human mobility. Data Min. Knowl. Discov. 2018;32:787–829. doi: 10.1007/s10618-017-0548-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Alessandretti L, Sapiezynski P, Sekara V, Lehmann S, Baronchelli A. Evidence for a conserved quantity in human mobility. Nat. Hum. Behav. 2018;2:485–491. doi: 10.1038/s41562-018-0364-x. [DOI] [PubMed] [Google Scholar]
- 25.Deville P, et al. Dynamic population mapping using mobile phone data. Proc. Natl Acad. Sci. USA. 2014;111:15888–15893. doi: 10.1073/pnas.1408439111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Pappalardo L, Ferres L, Sacasa M, Cattuto C, Bravo L. Evaluation of home detection algorithms on mobile phone data using individual-level ground truth. EPJ Data Sci. 2021;10:29. doi: 10.1140/epjds/s13688-021-00284-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Vanhoof, M., Lee, C. & Smoreda, Z. in Performance and Sensitivities of Home Detection on Mobile Phone Data, Chap. 8, 245–271 (John Wiley & Sons, Ltd, 2020).
- 28.Gray CL, Mueller V. Natural disasters and population mobility in bangladesh. Proc. Natl Acad. Sci. USA. 2012;109:6000–6005. doi: 10.1073/pnas.1115944109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Paul BK. Evidence against disaster-induced migration: the 2004 tornado in north-central bangladesh. Disasters. 2005;29:370–385. doi: 10.1111/j.0361-3666.2005.00298.x. [DOI] [PubMed] [Google Scholar]
- 30.Reuveny R. Climate change-induced migration and violent conflict. Political Geogr. 2007;26:656–673. [Google Scholar]
- 31.Salah, A. A. et al. Data for refugees: the d4r challenge on mobility of syrian refugees in turkey. arXiv preprint arXiv:1807.00523 (2018).
- 32.Myers CA, Slack T, Singelmann J. Social vulnerability and migration in the wake of disaster: the case of hurricanes katrina and rita. Popul. Environ. 2008;29:271–291. [Google Scholar]
- 33.Jayarajah, K., Tan, A. & Misra, A. Understanding the interdependency of land use and mobility for urban planning. in Proc. 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers, UbiComp ’18, 1079-1087 (Association for Computing Machinery, 2018).
- 34.Yuan, H., Li, G. A Survey of Traffic Prediction: from Spatio-Temporal Data to Intelligent Transportation. Data Sci. Eng. 6, 63–85 10.1007/s41019-020-00151-z (2021).
- 35.Xie P, et al. Urban flow prediction from spatiotemporal data using machine learning: a survey. Inf. Fusion. 2020;59:1–12. [Google Scholar]
- 36.Ebrahimpour Z, Wan W, Cervantes O, Luo T, Ullah H. Comparison of main approaches for extracting behavior features from crowd flow analysis. ISPRS Int. J. Geo Inf. 2019;8:440. [Google Scholar]
- 37.Shi, Y., Feng, H., Geng, X., Tang, X. & Wang, Y. A survey of hybrid deep learning methods for traffic flow prediction. in Proc. 2019 3rd International Conference on Advances in Image Processing, ICAIP 2019, 133–138 (Association for Computing Machinery, 2019).
- 38.Pepe E, et al. Covid-19 outbreak response, a dataset to assess mobility changes in italy following national lockdown. Sci. Data. 2020;7:1–7. doi: 10.1038/s41597-020-00575-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lai S, Farnham A, Ruktanonchai NW, Tatem AJ. Measuring mobility, disease connectivity and individual risk: a review of using mobile phone data and health for travel medicine. J. Travel Med. 2019;26:taz019. doi: 10.1093/jtm/taz019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ruktanonchai NW, et al. Assessing the impact of coordinated covid-19 exit strategies across europe. Science. 2020;369:1465–1470. doi: 10.1126/science.abc5096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kraemer MU, et al. The effect of human mobility and control measures on the covid-19 epidemic in china. Science. 2020;368:493–497. doi: 10.1126/science.abb4218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Oliver, N. et al. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv.6, eabc0764 (2020). [DOI] [PMC free article] [PubMed]
- 43.Le Blanc D. Towards integration at last? the sustainable development goals as a network of targets. Sustain. Dev. 2015;23:176–187. [Google Scholar]
- 44.Kroll C, Warchold A, Pradhan P. Sustainable development goals (sdgs): are we successful in turning trade-offs into synergies? Palgrave Commun. 2019;5:1–11. [Google Scholar]
- 45.United Nations General Assembly. Transforming our world: the 2030 agenda for sustainable development. Tech. Rep. https://sdgs.un.org/2030agenda (2015).
- 46.Zipf GK. The p 1 p 2/d hypothesis: on the intercity movement of persons. Am. Sociol. Rev. 1946;11:677–686. [Google Scholar]
- 47.Lenormand M, Bassolas A, Ramasco JJ. Systematic comparison of trip distribution laws and models. J. Transp. Geogr. 2016;51:158–169. [Google Scholar]
- 48.Erlander, S. & Stewart, N. F. The Gravity Model in Transportation Analysis: Theory and Extensions, vol. 3 (Vsp, 1990).
- 49.Karemera D, Oguledo VI, Davis B. A gravity model analysis of international migration to north america. Appl. Econ. 2000;32:1745–1755. [Google Scholar]
- 50.Patuelli R, Reggiani A, Gorman SP, Nijkamp P, Bade F-J. Network analysis of commuting flows: a comparative static approach to german data. Netw. Spat. Econ. 2007;7:315–331. [Google Scholar]
- 51.Balcan D, et al. Modeling the spatial spread of infectious diseases: the global epidemic and mobility computational model. J. Comput. Sci. 2010;1:132–145. doi: 10.1016/j.jocs.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Li X, Tian H, Lai D, Zhang Z. Validation of the gravity model in predicting the global spread of influenza. Int. J. Environ. Res. Public Health. 2011;8:3134–3143. doi: 10.3390/ijerph8083134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Cevik, S. Going Viral: A Gravity Model of Infectious Diseases and Tourism Flows. Open Econ Rev. 10.1007/s11079-021-09619-5 (Springer, 2021).
- 54.Zhang Y, Zhang A, Wang J. Exploring the roles of high-speed train, air and coach services in the spread of covid-19 in china. Transp. Policy. 2020;94:34 – 42. doi: 10.1016/j.tranpol.2020.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Simini F, González MC, Maritan A, Barabási A-L. A universal model for mobility and migration patterns. Nature. 2012;484:96. doi: 10.1038/nature10856. [DOI] [PubMed] [Google Scholar]
- 56.Zhang, C. et al. React: Online multimodal embedding for recency-aware spatiotemporal activity modeling. in Proc. 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 245–254 (ACM, 2017).
- 57.Krumm, J. & Krumm, K. Land use inference from mobility traces. in Proc. 3rd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery, 1–4 (ACM, 2019).
- 58.Rossi, A., Barlacchi, G., Bianchini, M. & Lepri, B. Modelling taxi drivers’ behaviour for the next destination prediction. in IEEE Transactions on Intelligent Transportation Systems (IEEE, 2019).
- 59.Barlacchi, G., Rossi, A., Lepri, B. & Moschitti, A. Structural semantic models for automatic analysis of urban areas. in Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 279–291 (Springer, 2017).
- 60.Iwata, T. & Shimizu, H. Neural collective graphical models for estimating spatio-temporal population flow from aggregated data. in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, 3935–3942 (AAAI Press, 2019).
- 61.Rong, C., Feng, J. & Li, Y. Deep learning models for population flow generation from aggregated mobility data. in Adjunct Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers, 1008–1013 (Association for Computing Machinery, 2019).
- 62.Tanaka, Y., Iwata, T., Kurashima, T., Toda, H. & Ueda, N. Estimating latent people flow without tracking individuals. in IJCAI, 3556–3563 (AAAI Press, 2018).
- 63.Zhang, J., Zheng, Y. & Qi, D. Deep spatio-temporal residual networks for citywide crowd flows prediction. in Thirty-First AAAI Conference on Artificial Intelligence (AAAI Press, 2017).
- 64.Iwata T, Shimizu H, Naya F, Ueda N. Estimating people flow from spatiotemporal population data via collective graphical mixture models. ACM Trans. Spat. Algorithms Syst. 2017;3:1–18. [Google Scholar]
- 65.Robinson, C. & Dilkina, B. A machine learning approach to modeling human migration. in Proc. 1st ACM SIGCAS Conference on Computing and Sustainable Societies, COMPASS ’18 (Association for Computing Machinery, 2018).
- 66.Molnar, C. Interpretable Machine Learning (Lulu. com, 2020).
- 67.Guidotti, R. et al. A survey of methods for explaining black box models. ACM Comput. Surv. 10.1145/3236009 (2018).
- 68.Burkart N, Huber MF. A survey on the explainability of supervised machine learning. J. Artif. Intell. Res. 2021;70:245–317. [Google Scholar]
- 69.Hofman JM, et al. Integrating explanation and prediction in computational social science. Nature. 2021;595:181–188. doi: 10.1038/s41586-021-03659-0. [DOI] [PubMed] [Google Scholar]
- 70.OpenStreetMap contributors. Planet dump. https://planet.osm.org. https://www.openstreetmap.org. (2017).
- 71.Mooney, P. & Minghini, M. in Mapping and the Citizen Sensor, (eds. Foody, G. et al.) Chap. 3, 37–60 (Ubiquity Press, 2017).
- 72.Barthélemy M. Spatial networks. Phys. Rep. 2011;499:1–101. [Google Scholar]
- 73.Agresti, A. Foundations of Linear and Generalized Linear Models (John Wiley & Sons, 2015).
- 74.Nair, V. & Hinton, G. E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on International Conference on Machine Learning (Omnipress, 2010).
- 75.Kang Y, et al. Multiscale dynamic human mobility flow dataset in the us during the covid-19 epidemic. Sci. Data. 2020;7:1–13. doi: 10.1038/s41597-020-00734-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.CoreCitiesUK. Core cities uk. https://www.corecities.com/. (2021).
- 77.Commission, E. Ethics guidelines for trustworthy ai. https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai. (2019).
- 78.Smuha NA. The eu approach to ethics guidelines for trustworthy artificial intelligence. Computer Law Rev. Int. 2019;20:97–106. [Google Scholar]
- 79.Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. in Advances in Neural Information Processing Systems (eds. Guyon, I. et al.) vol. 30, 4765–4774 (Curran Associates, Inc., 2017).
- 80.Lundberg, S. M., Erion, G. G. & Lee, S.-I. Consistent individualized feature attribution for tree ensembles. arXiv preprint arXiv:1802.03888 (2018).
- 81.Štrumbelj E, Kononenko I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 2014;41:647–665. [Google Scholar]
- 82.Pappalardo, L., Simini, F., Barlacchi, G. & Pellungrini, R. scikit-mobility: A python library for the analysis, generation and risk assessment of mobility data. arXiv preprint arXiv:1907.07062 (2019).
- 83.Simini, F., Barlacchi, G., Luca, M. & Pappalardo, L. Deep gravity (1.1.0). Zenodo. 10.5281/zenodo.5573573 (2021). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The commuting data for England are freely available at https://census.ukdataservice.ac.uk/use-data/guides/flow-data.aspx and https://census.ukdataservice.ac.uk/use-data/guides/boundary-data. he commuting data for Italy are freely available at http://datiopen.istat.it/datasetPND.php and https://www.istat.it/it/archivio/104317. The flows data in New York State are freely available at github.com/GeoDS/COVID19USFlows and are described in Kang et al.75.
The Python code of Deep Gravity is available at github.com/scikit-mobility/DeepGravity. The version of the code used to make the experiments in this paper is available at ref. 83.