Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Jul 12;17(7):e1009162. doi: 10.1371/journal.pcbi.1009162

Detecting behavioural changes in human movement to inform the spatial scale of interventions against COVID-19

Hamish Gibbs 1,*, Emily Nightingale 2, Yang Liu 1, James Cheshire 3, Leon Danon 4,5,6, Liam Smeeth 7, Carl A B Pearson 1, Chris Grundy 1; LSHTM CMMID COVID-19 working group1,, Adam J Kucharski 1, Rosalind M Eggo 1,*
Editor: Alex Perkins8
PMCID: PMC8297940  PMID: 34252085

Abstract

On March 23 2020, the UK enacted an intensive, nationwide lockdown to mitigate transmission of COVID-19. As restrictions began to ease, more localized interventions were used to target resurgences in transmission. Understanding the spatial scale of networks of human interaction, and how these networks change over time, is critical to targeting interventions at the most at-risk areas without unnecessarily restricting areas at low risk of resurgence. We use detailed human mobility data aggregated from Facebook users to determine how the spatially-explicit network of movements changed before and during the lockdown period, in response to the easing of restrictions, and to the introduction of locally-targeted interventions. We also apply community detection techniques to the weighted, directed network of movements to identify geographically-explicit movement communities and measure the evolution of these community structures through time. We found that the mobility network became more sparse and the number of mobility communities decreased under the national lockdown, a change that disproportionately affected long distance connections central to the mobility network. We also found that the community structure of areas in which locally-targeted interventions were implemented following epidemic resurgence did not show reorganization of community structure but did show small decreases in indicators of travel outside of local areas. We propose that communities detected using Facebook or other mobility data be used to assess the impact of spatially-targeted restrictions and may inform policymakers about the spatial extent of human movement patterns in the UK. These data are available in near real-time, allowing quantification of changes in the distribution of the population across the UK, as well as changes in travel patterns to inform our understanding of the impact of geographically-targeted interventions.

Author summary

Large-scale intensive interventions in response to the COVID-19 pandemic have been implemented globally, significantly affecting human movement patterns. Mobility data provide an empirical measurement of UK travel within a spatially-explicit network, but it is not clear how the structure of that network changed in response to national or locally-targeted interventions. We use daily mobility data aggregated from Facebook users to quantify changes in the travel network in the UK during the national lockdown, and in response to local interventions. We identified changes in human behaviour in response to interventions and identified the community structure inherent in these networks. This approach can help quantify the extent of strongly connected communities of interaction and their relationship to the extent of spatially-explicit interventions. We show that spatial mobility data available in near real-time can give information on connectivity that can be used to understand the impact of geographically-targeted interventions and in the future, to inform these intervention strategies.

Introduction

Fine-scale geographic monitoring of large populations can potentially increase the accuracy and responsiveness of epidemiological modelling, outbreak response, and intervention planning in response to public health emergencies like the COVID-19 pandemic [16]. Population and mobility datasets collected from the movement of individuals’ mobile phones provide empirical, near-real time metrics of population movement between different geographic regions [68]. The COVID-19 pandemic response has coincided with the availability of new data sources for measuring human movement, aggregated from mobile devices by network providers and popular applications including Google Maps, Apple Maps, Citymapper, and Facebook [7,9,10].

The spread of COVID-19 through travel networks has been demonstrated in China, where connectivity to Wuhan was shown to predict the timing of arrival of COVID-19 cases in each region [11,12]. The volume of mobility has also been shown to correlate with transmission of COVID-19, where movement can be used as a proxy for the measurement of the degree of social distancing [13]. Travel and movement behavior during epidemics may also change in response to imposed interventions, perceived risk, and due to seasonal activities such as vacations [11,12]. During the COVID-19 pandemic, mobility data has been used to assess adherence to movement restrictions [13,14], the impact of movement restrictions on the transmission dynamics of COVID-19 [1517], and demonstrate differential adherence to movement restrictions among socioeconomic groups [1821].

In this analysis, we use movement and population data provided by Facebook from March 10 to November 1 2020, which records approximately 15 million daily locations of 4.8 million users [7]. We also used population, age, ethnicity, and Index of Multiple Deprivation (IMD) data from UK national statistics agencies to understand the population of users recorded in the movement and population data [2225]. We identify changes in travel behavior in response to initially stringent movement restrictions (March to May 2020) and subsequent easing of restrictions, paired with a policy of spatially-targeted interventions in response to local resurgences (May to October 2020). Using network analytic methods to understand the structure of interconnected communities in the movement network, we trace the evolution of geographic communities through time, comparing them to intervention measures implemented in response to local resurgences.

Materials and methods

Ethics statement

This research was approved by the LSHTM Observational Research Ethics Committee (ref 16834–1).

Facebook data

Data provided by the Facebook Data for Good partner program [7] uses aggregated and anonymized user data to record user locations in grid cells (S1 Fig). This data is generated from the population of Facebook users with location services actively enabled and is released approximately 48 hours after data collection.

We used movement data, describing users’ modal location in map cells in sequential 8 hour periods. This means that, in each period, a user is assigned to the cell in which they record the largest number of locations. For this individual, the beginning and end points of a network edge are defined as their location in sequential periods. This individual’s movement is then aggregated with others to form a weighted Origin-Destination matrix of movement between grid cells. Edges with fewer than 10 travellers were removed by Facebook prior to data sharing to preserve privacy (S2 Fig). Any cell that did not record any between-cell travels with greater than 10 travellers in a given time window was omitted from the dataset, regardless of whether that cell recorded an internal number of users greater than 10. In our network analysis, we constructed a weighted, directed network where nodes were cells, and edge weights were the number of users observed travelling between cells.

We use movement outside of local areas as a measurement of movement from one cell to any other cell in the network. The movement data also records connections from one cell to the same cell in sequential periods. This type of movement may indicate movement within a local area (less than the area of a given cell), or completely stationary individuals. Because of the spatial resolution of the data, it is not possible to quantify the volume of within cell movement that corresponds to each of these behaviours within the same cell. We therefore use the volume of movement outside of the cell as a measurement of the overall movement behaviour, making the assumption that travellers leaving a cell will primarily return to the same cell in sequential periods.

Bing maps tile system

Movement data is referenced to the Bing Maps Tile System, a standard geospatial reference used primarily for serving web maps [26]. The system is divided into 23 zoom levels ranging from global level 1, (map scale: 1:295,829,355.45) to detailed level 23, (map scale: 1:70.53). Each Bing Map cell is identified by a “quadkey”, or unique identifier of the zoom level and pixel coordinates of an individual cell. In this analysis, all mobility and census datasets were referenced to Bing Maps cells. The movement dataset was referenced to cells at zoom level 12 (approximately 4.8 to 6.2 km2 in the UK—measured at 60.77° and 50.59° respectively). The ground resolution of cells varies with latitude, with cells at higher latitudes covering a smaller ground area than those at lower latitudes. This distortion is caused by the distortion inherent in the Web Mercator projection (EPSG:3857) used by the Bing Maps Tile System.

Demographic information

We compared the age, population, ethnicity, and IMD of each cell to the population of users recorded in the movement data to identify relationships between the percentage of Facebook users and these demographic factors [2732]. We extracted these variables from national statistics agencies (Office for National Statistics, Northern Ireland Statistics and Research Agency, Scottish Government, and Welsh Government) and aggregated them to grid cells. Census variables were referenced to different statistical units by country. In Northern Ireland, census variables were referenced to Super Output areas (SOAs), in England and Wales, Lower Super Output areas (LSOAs), and in Scotland, Data Zones (DZs). Detailed population data was also collected from national statistics agencies, providing a measure of population for Small Areas (Northern Ireland), Output Areas (OA; England and Wales), and Data Zones (Scotland) [33].

Census variables referenced to different national statistical areas were aggregated to align with mobility datasets at zoom level 12. First, we combined the 2011 population weighted centroids of each OA (or equivalent) from the UK Census with 2020 mid-year population estimates in each UK country. We then assigned each OA centroid to the cell it falls within. We then joined 2011-derived census variables (Age, Ethnicity and IMD) to the OA centroids and computed an average of each census variable for each cell, weighted by the OA population estimates. For IMD data (recorded as ranks) we ranked the weighted average values to create a rank of cells by their population weighted IMD. OAs are much more granular than cells and therefore nested within them in the majority of cases, minimising the risk of the cells detrimentally intersecting OAs during the spatial assignment of demographic variables.

To assess the correlation between census variables and the proportion of Facebook users in each cell, we computed the Pearson correlation coefficient and two-sided p-values between the proportion of Facebook users in a cell and each census variable.

Temporal aggregation

Both Facebook movement and population data are recorded in 8 hour intervals. These data display strong and consistent intraday and intraweek patterns. To isolate changes in daily mobility, data collected in 8 hour periods were aggregated to daily periods by taking the sum of the observed number of travellers along an edge for all periods within a day.

Baseline population estimates

To obtain an accurate measurement of the number of users in the Facebook movement and population datasets relative to census population estimates, we used baseline measurements of travel from one cell to the same cell in sequential time periods computed during the 45 days prior to the creation of the data collection, from January 29th to March 9th 2020. This baseline population recorded the median number of users in a cell for each daily time window in the reference period.

Edge betweenness centrality

We computed the edge betweenness centrality of the movement network, a measure of which edges are most responsible for maintaining connection between different parts of the network. Betweenness centrality was calculated for the weighted, directed network. Comparing this measure of centrality provides information about changes in network topology, the volume of travel along the network, and which connections might be preferred targets for interventions.

Community detection

Community detection methods are algorithms for identifying groups of meaningfully connected vertices in a network. Many methods exist, with various tradeoffs on computational performance, resolution, or other characteristics [3437]. Different community detection methods produce different results because of differences in the network characteristics that they use to define communities. To understand the robustness of the communities detected in this study, we employed two different algorithms, InfoMap and Leiden. InfoMap is a community detection algorithm based on the Map equation which records the movements of a random walker along the network. The algorithm tests network partitions, attempting to identify the partition that minimizes the description length required to describe the walker’s path [38]. In other words, the algorithm compresses the description of movements in parts of the network where the walker spends a large amount of time, thereby forming communities from these strongly connected sections of the network. The Leiden algorithm maximizes the modularity of different node partitions. To do this, the algorithm assigns network nodes to a partition, assesses the modularity of the partitioned groups, aggregates the nodes in a given partition, and repeats this process until there is no further improvement in the modularity of the partitions. This process results in a partition in which communities possess stronger connections to members of their own community than to other network nodes [39,40]. There is a penalty for both algorithms if they identify a number of communities larger than the minimum number required to maximize their respective objective functions (InfoMap: description length, Leiden: modularity).

We compared the effect of the different community detection algorithms, and found that they aligned hierarchically, where the Leiden algorithm identified geographically larger communities. If the communities detected by one method are largely a superset of the communities detected by another, with shared boundaries between the defined communities, this likely represents a differing hierarchical structure, compared to a different interpretation of community structure. We assessed the agreement between community detection methods to understand the stability of detected communities by comparing the proportion of nodes in each community detected using InfoMap with all communities determined using Leiden, and vice versa (S9 Fig). This comparison allows for the computation of the proportion of shared nodes between both algorithms. The maximum and mean overlap of communities in each algorithm helped to identify the agreement between each method of community detection. In general we found that Leiden detected larger communities, for which the InfoMap communities were (for the most part) sub-communities.

Community label inheritance

The community detection methods used in this study identified communities each day. To track the evolution of communities over the study period, we employed a heuristic approach, assigning the label of a given community identified in a certain time step to that community with the highest number of shared nodes in the following time step [41]. When multiple communities in a certain time step “claim” the same community in the following timestep, the community with the closest size to the community in the following timestep “wins” the right to pass its own label to the following timestep.

COVID-19 data

We used confirmed COVID-19 cases from the UK Pillar 1 and Pillar 2 testing schemes available for England [42]. Pillar 1 is predominantly hospital-based tests including patients and health care workers. Pillar 2 is symptomatic community testing on demand, and represents the bulk of the testing in the UK. Data on the number of confirmed SARS-CoV-2 positive tests by specimen date were available at the Lower Tier Local Authority (LTLA) level [43].

To compare confirmed COVID-19 cases to movement indicators, we measured the total proportion of travellers leaving a grid cell in monthly periods for all cells in England. Cells were then assigned to LTLAs by their maximum areal overlap.

Results

Movement patterns observed

To understand the representativeness of Facebook data to the general population, we explored the size of the population of Facebook users included in the movement dataset, and compared this population to 2019 UK census population estimates. Facebook recorded an average of 4.5 million users per 8 hour period, ranging from 5.8 million on March 29th between 4pm and midnight, to 3.7 million on August 9th between midnight and 8am (Fig 1A).

Fig 1. Characteristics of the between-cell Facebook mobility dataset.

Fig 1

a) The daily average number of users included in the movement dataset. b) The probability density functions of the percentage of Facebook users in the census population for cells by country. c) The relationship between the number of Facebook users and census population for all cells. d) The spatial distribution of the percentage of Facebook users in the census population for all cells. The proportion of Facebook users to population is homogenous between regions although there are local variations in the proportion of Facebook users to population. White cells are censored due to low numbers. Note that 12 cells around the town of Swindon are missing due to a data processing error prior to data sharing. Base map data from Natural Earth [55].

The percentage of Facebook users per grid cell was comparable in the four nations of the UK (Fig 1B). The population of Facebook users was highly correlated (R = 0.97, p < 2.2e-16) with census population estimates for all cells (Fig 1C) although there is variance in the proportion of Facebook users to census population across cells, t (Fig 1D). We compared the proportion of Facebook users to identify whether the distribution of Facebook users was biased in relation to a specific demographic variable, finding no strong associations between the percentage of Facebook users and the average age, percent minority ethnic, population density, or IMD of each cell (S3 and S5 Figs).

Using COVID-19 case data at LTLA level in England, we identified a consistent association between the proportion of users travelling outside of grid cells and the number of cases in LTLAs per month during the study period (Figs 2 and S6). While the strength of this association varies at different stages of the pandemic, it supports previous research demonstrating the relationship between increased rates of movement, which we measure as the percentage of travel outside of local areas, and increased COVID-19 incidence.

Fig 2. The relationship between movement and COVID-19 cases per capita.

Fig 2

a) Daily confirmed SARS-CoV-2 cases in each NHS region. b) The relationship between the percentage of users travelling outside their cell and the total number of confirmed SARS-CoV-2 cases per capita, by month of the study period. The rate of between-cell movement is correlated with the number of COVID-19 cases and the strength of this correlation varies through time. Dots show LTLAs coloured by their NHS region.

Network structure

To quantify how the structure of the overall network changed through time, we computed the edge betweenness centrality of connections between cells for the weighted movement networks, a measure of the relative importance of a given connection in the network (Fig 3A and 3B). Overall, the network experiences a reduction in travel following the announcement of movement restrictions in the first national lockdown, introduced on March 23, 2020 (Fig 3C). Comparing the period preceding the first national lockdown between March 10 and March 22, 2020, with the period immediately following, from March 23 to April 4, 2020, we compared which edges remained in the same betweenness quantile during the intervention period (Fig 3D). We observe a disproportionate reduction in edges which were highly central before the introduction of national restrictions, reflecting the reduction of travel along long distance connections (Fig 3E and 3F) which tend to be highly central in the network, and an increased fragmentation of the network.

Fig 3. Network structure through time.

Fig 3

All network connections in a) the period preceding the first national lockdown (March 10 to March 22, 2020) and b) following the first national lockdown (March 23 to April 4, 2020). The thickness of edges corresponds to their edge betweenness centrality. c) The percentage of users travelling out of their cell through time, shown as 90%, 75%, and 50% density intervals; and d) the edges that remain highly central after the introduction of national interventions. We compare the betweenness quantile for edges from March 10 to March 23, 2020 (preceding national interventions) and from March 23 to May 10, 2020 (period of national interventions). This shows a disproportionate elimination of network connections for edges which were highly central before the announcement of the first national lockdown. e) The distribution of edge betweenness and distance pre- and post-intervention on March 23rd. The red highlight demonstrates the loss of long-distance, high betweenness edges. f) The reduction of travel by distance during national interventions. Most long-distance edges are removed from the dataset entirely as a result of decreased travel. Note that the longest remaining connection is between Liverpool and Belfast. Base map data from Natural Earth [55].

While we continued to observe a weekly trend of increased between-tile movements during weekdays, the variance of weekly between-tile connections decreased during the period of national interventions (S7 Fig). Overall, the network was reduced by 10,690 edges, with only 46 new edges observed in the post-intervention period. All of these newly created edges had a distance less than 47km. The network also became more disconnected, with the number of components increasing from 23 (largest component size: 3394) to 37 (largest component size: 2579).

We also measured the distance travelled per user in the weighted, directed network, observing a decrease in the overall distance travelled by users during the period of national interventions and a subsequent increase in this distance throughout the summer. As the overall travel in the network decreased, we observed a sharper decrease in the volume of long distance connections, with most long distance connections absent from the movement network during national restrictions (Fig 3F). This decrease reflects both the decreasing volume of long distance travel, and the increased effect of censoring during periods of lower travel volumes.

Community detection

We identified geographically-explicit “communities of interaction” in the network of user movements using the InfoMap and Leiden algorithms (S8, S9 and S10 Figs).

We observed an increase in the number of identified communities and a corresponding decrease in their area and population size following the introduction of nationwide intervention measures on March 23rd, 2020 (Figs 4, S11 and S12). Overall, these communities were smaller than LTLAs and Travel to Work Areas (TTWAs) in terms of their size and population, although in cities like London and Manchester, communities intersected multiple TTWAs and LTLAs (S13 Fig). The cell-level network also became more sparse as cells were censored from the dataset due to the lower number of edges connecting cells. Restrictions were eased incrementally between May and July 2020, during which time we observed an increase in the volume of between-cell movements and an increase in the geographic area and connections between communities.

Fig 4. Community detection using the InfoMap algorithm.

Fig 4

Communities detected on individual days through the time series. Areas delimit the maximum extent of all cells in detected communities and are colored based on the NHS region they intersect. Network plots show the volume of travel between communities for the network of connections between communities on each day. Missing tiles which record fewer than 10 users moving have been censored for privacy and appear white. The number of communities on each day (left to right): 189, 288, 218, 209. Base map data from Natural Earth [55].

We found that the most persistent communities existed in some large population centers (S14 Fig). This reflects both the smaller influence of censoring on higher population cells as well as the continued existence of movement networks, though reduced, around population centres. Persistent communities were identified in Manchester, Newcastle, Glasgow, and Edinburgh, but not in London, which regularly split into more communities on weekends (S15 Fig). We did not find evidence that community stability is associated with population density (S16 Fig). We found strong stability of communities calculated for networks aggregated to daily, weekly, and monthly intervals, with a Normalised Mutual Information, a measure of similarity between network partitions, between 0.89 and 0.92 for all partitions (S17 Fig).

By transferring community labels between time windows, we constructed a network of communities in which each community is a node, connected to other communities in a directed network weighted by the number of users travelling between community pairs (Fig 4). In this network, the degree of all nodes decreased after the implementation of nationwide interventions and the overall reduction of between-cell edges (S18 Fig). Nonetheless, we did not observe a significant reorganisation in the hierarchy of connections between communities.

Local lockdown extents

After the period of national interventions, the UK introduced local area interventions at differing levels of stringency. The first such intervention was implemented in Leicester on June 30, 2020 in response to a local resurgence. To understand the impact on the mobility of users, we assessed changes in the volume of travel and network topology before and after introduction.

We measured the connection between cells overlapping areas of local interventions in four areas: Leicester, Manchester, the North West, and the North East (S19, S20 and S21 Figs) to assess the impact on volume of travel and the isolation of intervention areas from the broader UK movement network. We found that, while movement indicators did decrease, particularly in Leicester, the response was smaller than during the first national lockdown (Fig 5A). The introduction of local interventions measures in Leicester was followed by a reduction in cases in the area, asomething that was not observed for the other local interventions (Figs 5B, S19, S20, and S21).

Fig 5. Response to the introduction of local interventions.

Fig 5

a) The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells. This shows the change in movement indicators through time, as well as the change in connectivity of this community to the remaining network of communities. b) Confirmed cases in the intervention area. c) Changes in the community structure before and after the introduction of local intervention. Base map data from Natural Earth [55].

Motivated by the need to identify communities associated with epidemic resurgences and responses to reactive interventions, we compared the extent and date of local interventions with the spatial extent and temporal persistence of network communities (Fig 5C). We found that network communities remained relatively stable after the introduction of intervention measures in all areas, with some peripheral changes to movement communities in Manchester, the North West, and North East (S19, S20 and S21 Figs).

From July 2020 onwards, the geographic extent of local area interventions was in closer agreement with movement communities, particularly in Manchester (S19 Fig). Some interventions also spanned multiple movement communities, as in the North West and North East of England (S20, and S21 Figs). Early local intervention measures at limited spatial extents may not have fully encompassed the area of transmission resurgence, however UK policy changed over time, beginning to enforce collaborative local area interventions comprising multiple Local Authorities. Additionally, movement communities evolve over time, and have the potential to shift following local area interventions, requiring an understanding of real-time patterns of movement to monitor the appropriateness of a given measure.

Discussion

This study used a large, anonymized movement dataset to quantify changes in the UK movement network and assessed how geographic communities were affected by interventions. Using movement communities, we can identify strongly connected areas that can be used to inform the spatial scale of geographically-targeted interventions to respond to resurgences of COVID-19. These communities can also be used to monitor adherence to public health measures in near-real time, as changes in community structure indicate a reorganization of the patterns of travel for specific populations. We found that overall, these communities were smaller in population and area than LTLAs and TTWAs, providing a finer-grained understanding of mobility connections between areas. We also explored the structure of the UK travel network through the pandemic, identifying variations in the central connections between population centres and changing travel patterns in response to the introduction of public health measures.

Gridded mobility datasets such as those used in this study provide granular, near real-time information about the movement patterns of a large sample of the UK population. While these datasets could usefully inform epidemic responses [18,4446], there remain questions about the generalizability of the movement recorded in these datasets to the movements of the overall population [47,48]. The privacy preserving structure of the Facebook movement dataset means that low frequency connections are not recorded, precise locations are replaced by grid cell references, and data is provided in uniform-area grid cells which vary in population size. Ideally, multiple mobility datasets should be analysed to improve the interpretation of changes in mobility indicators.

In response to the nationwide lockdown introduced March 23rd, the UK mobility network changed drastically. The movement network became more sparse, with reductions in travel volume and distance. These changes disproportionately affected long distance and highly central edges, important connections which integrated geographically distant areas in the broader UK travel network.

We identified geographically distinct communities with strong interconnections that are relevant to policy responses focused on limiting transmission in response to geographically-limited disease resurgences. These communities delimit the boundaries of areas with strong internal movement connections and therefore provide empirical boundaries that can inform policy responses or form the basis for spatially-explicit transmission models. Community structure has been shown to influence the progression of modelled disease transmission in simulated and real-world networks [4951]. Variations in community structure can affect or limit mixing between people or between populations, giving rise to epidemics with multiple peaks or geographically-localised resurgences [52]. Identifying communities is therefore important for understanding and projecting epidemic dynamics.

There is a demonstrated relationship between human mobility and transmission of COVID-19 relating both to the spatial introduction of disease, where connectivity is correlated with the time of first introduction [15], and rates of movement in specific locations, which are correlated with disease transmission [53]. In the movement dataset used in this study, we identified groups of network nodes which have strong movement connections to each other, meaning that they share a relatively high number of travellers with other members of their community. Similar delimiting of movement communities has been applied to define “functional zones” in the European Union [54], where these zones have been recommended as a way to define the extent of locally targeted interventions when responding to spatially-limited resurgences of COVID-19.

In response to resurgences in a particular area, determining the geographic extent of reactive interventions should be driven by areas at risk of increased transmission, which may not intersect with administrative boundaries. The geographic areas identified here could be used to delimit the extent of areas with strong connections to a particular resurgence or new outbreak and to define the extent of interventions such as surge testing or social distancing measures. We found that while communities tended to stabilise around settlements, there was disagreement between the extent of these communities and the boundaries at which local area interventions have been introduced in the UK thus far. These communities provide valuable information in near-real time about the extent of typical patterns of travel, their temporal variations, and “catchment areas” of movement around a given area.

There are several caveats to the methods of community detection used in this study, as the extent of communities could be influenced by the level of aggregation of the Facebook mobility data, and cells were assigned to a single community each day. While we conducted a sensitivity analysis using two methods for identifying communities, there are a wide variety of community detection algorithms which emphasize different aspects of network structure. Questions also remain about the general reliability of community detection methods, which have been developed on well understood network structures, when applied to real-world networks [35]. The effect of local area interventions on travel depends on the specifics of each intervention and their stringency. Additionally, interventions occur at multiple spatial scales, and across overlapping time periods. For example, in the UK, national interventions coincide with local interventions, and each may contribute differently to changes in movement behaviour.

Conclusion

Data-driven approaches using mobility data can help to quantify patterns of travel and inform geographically-targeted public health interventions.

Supporting information

S1 Fig. An overview of different geometries used in this study, intersecting London.

a) Zoom level 12 tiles, b) Zoom level 13 tiles, c) Lower Super Output Areas. Base map data from Natural Earth and Office for National Statistics [5556].

(TIF)

S2 Fig. The number of cells included in the mobility dataset (zoom level 12).

Cells recording fewer than 10 persons moving between cells along any connection are censored from the dataset to preserve user privacy.

(TIF)

S3 Fig. The cell-level geographic distribution of the four census variables used in the analysis.

In each case, a white cell means the data were missing from the Facebook mobility data and so are not displayed here. In most cases this is due to censoring of low numbers, except for the small discontinuity around Swindon, mentioned in the Main Text. a) IMD rank. Each country has a different colour because the measure of IMD is different in each country. In each case, the darker shade is higher IMD. b) Population density per cell (log scale). c) Percentage of the population self-identifying as any other ethnicity than “Any white background”. d) Mean age of the population resident in each cell. Base map data from Natural Earth [55].

(TIF)

S4 Fig. A comparison of the percentage of Facebook users and census variables at the cell level.

a) IMD, b) mean age, c) percent minority ethnic, and d) population. Variables were aggregated from mid-level census geographies for each country. The mean value of each variable was assigned to intersecting tiles, weighted by small area population estimates. Correlation is shown on the panel as the Pearson correlation coefficient (R) and two-sided p values.

(TIF)

S5 Fig. A comparison of movement by IMD quintile.

Percent change from baseline for movement between cells by IMD quintiles in each country. IMD data was aggregated to cell level and weighted by small area population estimates. IMD quintiles range from 1 (most deprived) to 5 (least deprived).

(TIF)

S6 Fig. The relationship between movement and COVID-19 cases.

a) Daily confirmed SARS-CoV-2 tests in each NHS region. b) The relationship between the percentage of users travelling outside their cell and the total number of confirmed SARS-CoV-2 positive tests, by month of the study period. Dots show lower-tier local-authorities coloured by their NHS region as in panel a.

(TIF)

S7 Fig. Weekly variance of between-cell movement.

The standard deviation of between cell movements through time. Decreased variance indicated smaller differences in daily between-cell travel measurements per week. This reflects a reduction in the weekly pattern of between-cell movements.

(TIF)

S8 Fig. Spatial comparison of community detection algorithms.

The extent of communities detected by InfoMap (a) and Leiden (b) on March 19th. Leiden communities are largely a superset of communities detected by Infomap, indicating the detection of a different hierarchical structure, but an agreement of community boundaries between the two algorithms. Base map data from Natural Earth [55].

(TIF)

S9 Fig. InfoMap and Leiden community detection methods.

A comparison of communities detected with the InfoMap (red) and Leiden (black) methods for a selection of dates. Base map data from Natural Earth [55].

(TIF)

S10 Fig. Comparing community detection algorithms.

A comparison of the spatial intersection between communities detected by the Infomap and Leiden algorithms through time. a) The average % overlap of InfoMap communities with Leiden communities. b) The number of Leiden communities each InfoMap community intersects, and c) the inverse comparison of Leiden communities to InfoMap communities. For a specific date, d) the maximum % areal overlap of each InfoMap community with all Leiden communities. e) The number of Leiden communities each InfoMap community intersects, and c) the inverse comparison of Leiden communities to InfoMap communities. This shows spatial alignment between InfoMap and Leiden communities, where Leiden communities tend to be larger than those detected by InfoMap.

(TIF)

S11 Fig. Number of InfoMap communities.

The number (red) of InfoMap communities through time. An increase in the number of communities reflects more local patterns of travel during national interventions.

(TIF)

S12 Fig. Area and population of InfoMap communities.

a) The average and median area of InfoMap communities through time. b) The average and median census population of communities through time. The distribution of population is more skewed than that of area, reflecting high populations in specific communities.

(TIF)

S13 Fig. Comparison of TTWAs, LTLAs, and detected communities.

Maps of 2011 TTWAs, LTLAs, and InfoMap communities detected on a specific date. While some InfoMap communities are similar in size to TTWAs and LTLAs, they do not share the same boundaries. There are fewer communities (193) than TTWAs (228) and LTLAs (380) and these communities have a smaller average area across all time periods (389.73 km2) compared to TTWAs (1069.14 km2) and LTLAs (605.28 km2). Base map data from Natural Earth and Office for National Statistics [55,57,58].

(TIF)

S14 Fig. Persistence of communities.

a) the total number of community labels that each cell has had (i.e. number of communities that the cell has ever been in) during the study period. The darkest shade indicates that a cell was always in the same community. b) the number of community labels for a given cell as a proportion of the number of days that cell was present in the dataset. This was calculated as the (number of unique community labels/number of days a cell was present). c) stable communities, marked as those which had the same community label for the entire study period. Base map data from Natural Earth [55].

(TIF)

S15 Fig. Community persistence in the dataset.

a) The most persistent communities, those that existed throughout the timeseries, as on March 19th, 2020. b) Community membership from March 19th to March 26th, 2020 in Leicestershire, and c) community membership from March 19th to April 5th, 2020 in London. For both figures, panels are ordered row-wise from top left. Base map data from Natural Earth [55].

(TIF)

S16 Fig. The relationship between community labels and population density.

The relationship between the population density of individual cells and the number of community labels assigned during the period. We do not observe a strong association between population density and the number of community labels assigned to individual cells.

(TIF)

S17 Fig. Community detection with different time aggregations.

Normalised Mutual Information, a measure of similarity between community partitions, measured for InfoMap communities detected in networks aggregated to daily, weekly, and monthly periods. The partition of each network is compared to its parent time aggregation (days to overlapping weeks, weeks to overlapping months).

(TIF)

S18 Fig. The relationship between community degrees during the first national lockdown.

The total degree of movement communities assigned to individual tiles. We observe a relationship between the degree of communities before and during national interventions, indicating that highly connected communities remained highly connected during the period of national interventions. The dashed line indicates where the Pre-lockdown degree is equal to the Post-lockdown degree. All points would fall along this line in the event of no changes to the community network after the intervention.

(TIF)

S19 Fig. Communities connected to Manchester local area restrictions.

The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

(TIF)

S20 Fig. Communities connected to North East England local area restrictions.

The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

(TIF)

S21 Fig. Communities connected to North West England local area restrictions.

The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

(TIF)

Acknowledgments

The following authors were part of the Centre for Mathematical Modelling of Infectious Disease COVID-19 working group. Each contributed in processing, cleaning and interpretation of data, interpreted findings, contributed to the manuscript, and approved the work for publication: Sam Abbott, Kaja Abbas, Kiesha Prem, Sebastian Funk, Jon C Emery, Georgia R Gore-Langton, Fiona Yueqian Sun, Arminder K Deol, Alicia Showering, Nicholas G. Davies, Nikos I Bosse, Samuel Clifford, Anna M Foss, Graham Medley, C Julian Villabona-Arenas, Timothy W Russell, Amy Gimma, W John Edmunds, Gwenan M Knight, Yung-Wai Desmond Chan, Yalda Jafari, Quentin J Leclerc, Rein M G J Houben, Akira Endo, Sophie R Meakin, Petra Klepac, Joel Hellewell, Naomi R Waterlow, Kevin van Zandvoort, Christopher I Jarvis, Rachel Lowe, Matthew Quaife, Charlie Diamond, Megan Auzenbergs, Simon R Procter, Rosanna C Barnard, Oliver Brady, Katherine E. Atkins, Katharine Sherratt, Thibaut Jombart, Stéphane Hué, Kathleen O’Reilly, Jack Williams, David Simons, Stefan Flasche, Mark Jit, James D Munday, Billy J Quilty, Frank G Sandmann, Damien C Tully, James W Rudge, Alicia Rosello.

Data Availability

Mobility Data used in this study are proprietary data owned by Facebook. The data used in this study is are not publicly available but are available to researchers by application to the Facebook Data for Good Partner Program (https://dataforgood.fb.com/, diseaseprevmaps@fb.com). Census data from national statistics agencies is publicly available. Dataset information is available in the references. Code used to conduct this analysis is available from https://github.com/cmmid/uk_movement_communities_covid.

Funding Statement

The following funding sources are acknowledged as providing funding for the named authors. This research was partly funded by the Bill & Melinda Gates Foundation (https://www.gatesfoundation.org/) (INV-003174: YL; NTD Modelling Consortium OPP1184344: CABP; OPP1183986: EN). DFID/Wellcome Trust (https://www.gov.uk/government/organisations/foreign-commonwealth-development-office, https://wellcome.org/) (Epidemic Preparedness Coronavirus research programme 221303/Z/20/Z: CABP). This project has received funding from the European Union's Horizon 2020 research and innovation programme - project EpiPose (https://cordis.europa.eu/project/id/101003688) (101003688: YL). HDR UK (https://www.hdruk.ac.uk/) (MR/S003975/1: RME). UK DHSC/UK Aid/NIHR (https://www.gov.uk/government/organisations/department-of-health-and-social-care, https://www.ukaiddirect.org/, https://www.nihr.ac.uk/) (PR-OD-1017-20001: HPG). UK MRC (https://mrc.ukri.org/) (MC_PC_19065: RME, YL). Wellcome Trust (https://wellcome.org/) (206250/Z/17/Z: AJK), UK MRC (https://mrc.ukri.org/) (MC_PC_19067, MR/V038613/1 LD), UK EPSRC (https://epsrc.ukri.org/) (EP/V051555/1: LD), The Alan Turing Institute under the EPSRC (https://www.turing.ac.uk/) (grant EP/N510129/1: LD). This research was partly funded by the National Institute for Health Research (NIHR) using UK Aid from the UK Government to support global health research. The views expressed in this publication are those of the author(s) and not necessarily those of the NIHR or the UK Department of Health and Social Care (16/137/109: YL; NIHR200908: RME). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The following funding sources are acknowledged as providing funding for the working group authors. Alan Turing Institute (AE). BBSRC LIDO (BB/M009513/1: DS). This research was partly funded by the Bill & Melinda Gates Foundation (INV-001754: MQ; INV-003174: KP, MJ; NTD Modelling Consortium OPP1184344: GFM; OPP1180644: SRP; OPP1191821: KO'R, MA). BMGF (OPP1157270: KA). DFID/Wellcome Trust (Epidemic Preparedness Coronavirus research programme 221303/Z/20/Z: KvZ). DTRA (HDTRA1-18-1-0051: JWR). Elrha R2HC/UK DFID/Wellcome Trust/This research was partly funded by the National Institute for Health Research (NIHR) using UK aid from the UK Government to support global health research. The views expressed in this publication are those of the author(s) and not necessarily those of the NIHR or the UK Department of Health and Social Care (KvZ). ERC Starting Grant (#757699: JCE, MQ, RMGJH). This project has received funding from the European Union's Horizon 2020 research and innovation programme - project EpiPose (101003688: KP, MJ, PK, RCB, WJE). This research was partly funded by the Global Challenges Research Fund (GCRF) project 'RECAP' managed through RCUK and ESRC (ES/P010873/1: AG, CIJ, TJ). MRC (MR/N013638/1: NRW). Nakajima Foundation (AE). NIHR (16/136/46: BJQ; 16/137/109: BJQ, CD, FYS, MJ; Health Protection Research Unit for Immunisation NIHR200929: NGD; Health Protection Research Unit for Modelling Methodology HPRU-2012-10096: TJ; NIHR200929: FGS, MJ; PR-OD-1017-20002: AR, WJE). Royal Society (Dorothy Hodgkin Fellowship: RL; RP\EA\180004: PK). UK MRC (LID DTP MR/N013638/1: GRGL, QJL; MC_PC_19065: AG, NGD, SC, TJ, WJE; MR/P014658/1: GMK). Authors of this research receive funding from UK Public Health Rapid Support Team funded by the United Kingdom Department of Health and Social Care (TJ). Wellcome Trust (206250/Z/17/Z: TWR; 206471/Z/17/Z: OJB; 208812/Z/17/Z: SC, S Flasche; 210758/Z/18/Z: JDM, JH, KS, NIB, SA, SFunk, SRM). No funding (AKD, AMF, AS, CJVA, DCT, JW, KEA, SH, YJ, YWDC).

References

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009162.r001

Decision Letter 0

Virginia E Pitzer, Alex Perkins

19 Apr 2021

Dear Mr. Gibbs,

Thank you very much for submitting your manuscript "Detecting behavioural changes in human movement to inform the spatial scale of interventions against COVID-19." for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Alex Perkins

Associate Editor

PLOS Computational Biology

Virginia Pitzer

Deputy Editor-in-Chief

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this work, the authors use human mobility data from Facebook users in the UK to analyze the associated spatial network of movements before, during and after the national lockdown and in the period of local interventions. They applied community detection techniques and measured the evolution of the detected communities over time. These analyses revealed that the mobility network became sparser, the number of communities was decreased, and long-distance journeys were less frequent during the lockdown. Local interventions did not change the community structure and only decreased travel outside of local areas weakly.

The authors address a highly relevant topic with potentially big implication for public health and future implementations of non-pharmaceutical interventions against the spread of infections diseases. The authors make sure that their data set is not biased by comparing the local fraction of Facebook users between regions and checking for correlations with census variables.

However, I have reservations regarding the interpretation, implications, and novelties of this study. The paper lacks a clear story and does not live up to its title, because I cannot identify the “spatial scale of interventions against COVID-19”.

Major observations

1. The correlation between movement and new COVID-19 cases was reported in many previous studies, eg in

Badr, Hamada S., et al. "Association between mobility patterns and COVID-19 transmission in the USA: a mathematical modelling study." The Lancet Infectious Diseases 20.11 (2020): 1247-1254.

This should be at least mentioned, if not discussed.

2. To quantify changes in the network structure the authors calculate the edge betweenness of connections. It is unclear whether the authors assigned weights to the links when computing this quantity. If not, it seems to me that only the removal of links due to the censoring of links with less than 10 travelers changes the observable. Also, the motivation for this observable is not clear to me. What do we learn about behavioural changes from it, additionally to just quantifying the length and frequency of trips?

3. Next, in the central part of the paper, the authors identify communities in the network and, subsequently, a corresponding network of communities, with links weighted by travelers. Here, unexpectedly, the study finds that the average node degree decreased during lockdown. However, any real analysis of this community network relevant to disease spread is missing. The same is true for the obtained community distribution. Do they match political organizational units or commuting zones? Do they have typical size?

The network could also serve as the basis for an agent-based or multiscale models of disease spread, which is not discussed.

4. Lastly, the authors analyze the effect of local interventions. Here the main result is that the effect of local interventions was weaker than a national lockdown. Also, the extent of communities did not always match the boundaries of local interventions in the UK. It is unclear if the authors suggest using their identified communities as boundaries for local interventions. I am missing a clear evidence that the proposed strategy of identifying communities using mobility data can improve interventions against COVID-19.

5. Captions of all figures are too short and do not guide the reader to quickly grasp the message of the figure. Two examples:

a. Fig1: The caption only repeats the axes labels but does not contain the message: that the number of travelers is positively correlated with new covid cases!

b. Fig2: Again, it seems to me that the message of the figure is that the percentage of Facebook users is quite homogeneous between regions and can therefore be used to say something about the full population.

Minor observations

1. Caption of fig 1. Speaks of daily reported SARS-CoV-2 tests in a) but the figure shows confirmed cases.

2. Caption of fig 2. Speaks of daily average number of users in a) but the axis label speaks about average movements. Also, areas with more than 75% of the population registered as users were removed. Why?

3. Fig 3d is absolutely unclear to me. The axis label says “number of edges”. The caption speaks about the likelihood of a journey to remain highly central in the lockdown. How do you read this?

4. Fig 4. Is missing labels a),b)

5. Fig 5 is also quite difficult to understand. Clarify message in the caption. Maybe zoom in into the relevant date range for the local interventions? Maybe show the number of travellers in the intervention area relative to the corresponding community, otherwise it is quite hard to see a decrease here.

6. Fig S6. What is IMD?

7. Why is Fig S7 not in the main manuscript? It seems to be quite central as it tells us about the spatial scale of movement in the lockdown (max 50-100 km).

8. Same with Fig S8. It makes it clearer, which journeys are removed in the lockdown. Also, add small ticks to this plot to make the log-log axes clearer and easier to read.

9. Fig S11 is unclear to me. What should the reader see?

10. In Fig S16, what is the dashed line? It does not seem to be a fit.

11. The message of Figs S17-19 is also unclear. They could also benefit from zooming in the time series into the relevant time periods.

Reviewer #2: The paper investigates the emerging reorganisation of mobility network during the first wave of COVID-19 in UK by means of a large-scale fine-grained mobility dataset.

The paper is well written, easy to follow and understand, even if I think that might further improve with a slightly different structure (details below). The Introduction and Methods sections provide a suitable context for the work, but some relevant references are missing (details below). The figures are high quality and appropriate for illustrating the ideas and work presented. Overall, the analysis is sound, the paper is self-contained and presents a suitable selection of results to support the main findings.

While I think the paper is interesting and has merit, it mainly reports a descriptive data analysis, therefore it will greatly improve if the authors will widen the discussion and implications of their findings in the context of public health interventions.

Major comments

- Relationship between movement and COVID-19 cases: corroborated by the correlations shown in Figure 1, the authors suggest that there is an “intuitive relationship between increased rates of travel outside of local areas and increased COVID-19 prevalence”. This statement needs better framing and a deeper discussion. First, the volume of travels outside a local area would be relevant for the local spreading only under the hypothesis that such travels are mainly due to commuting patterns that come back to the local area of origin. While this might be true for the data used in this paper, some discussion about this point is needed. Secondly, it is well known that -in the context of infectious diseases spreading-, travel fluxes are associated with the arrival time of the disease in not-yet-affected areas [Brockmann2013, Balcan2009]. However, after the initial seeding the local outbreak unfolds with an exponential growth and thus further importation of cases should not play a major role in shaping the epidemic curve. Therefore, the author should elaborate on this concept to let the reader grasp their intuition, and why such result is interesting. Finally, I think that a normalized quantity for the number of cases (as reported in Supplemental Figure 3) would be a more suitable observable, as the raw count of cases is clearly influenced by the number of people living in certain area.

- Characteristics of the between-cell Facebook mobility dataset: the authors considered “the percentage of Facebook users per cell […] homogeneous across the study area (Figure 2d)”. However, Figure 2B shows distributions with an average value ~10% and long tails up to ~70%, that seems quite heterogeneous. The authors should clarify or better frame this point. Moreover, the correlation reported in Figure 2C would definitely benefits from a log-log transformation, to limit the impact of few outlying data points in skewing the estimate of the correlation coefficient.

- Network Structure: the authors compared the betweenness quantile for links of the mobility networks extracted before the national interventions (from March 10 to March 23) and during the national interventions (from March 23 to May 10). My first concern is that the two networks are not comparable as they are built aggregating 13 days and 47 days respectively. Therefore, I would expect some major differences even in the absence of exogenous interventions. In principle, a longer aggregation window might result in a denser network (light links from rare travels are reinforced with longer observation time). The authors might consider to perform the analysis on mobility networks extracted on comparable time windows. Moreover, the community detection analysis seems to be performed on daily mobility networks: relying on the same temporal aggregation would help the reader compare the findings of the different network analysis.

- Since the mobility network is largely destroyed by the national interventions, besides the information on betweenness quintiles, some coarser information about mobility network resilience could be helpful to better understand how the system evolved (e.g. how many links of the pre-intervention network are preserved in the post-intervention period? How many new links were created, if any?).

- Some communities detected on mobility networks of individual days are quite stable across time. A natural question is to verify if such pattern persists also when mobility data are aggregated at different temporal scales (e.g. weekly, monthly,…).

- The authors show how the community structure in four areas (Leicester, Manchester, the North West, and the North East) seems to be stable before and after some local interventions, with the only exceptions of some grid cells. There has been some recent work in assessing the association between socio-economic status and change of mobility induced by COVID-19. I am aware that it is rather uncouth of me to suggest referencing my own work, but our paper www.medrxiv.org/content/10.1101/2020.11.16.20232413v1 seems particularly appropriate here as it explore such phenomenon at the microscale of city neighborhoods. I would not expect the authors to implement an extensive analysis in the current paper, but it would be great if they could add some insights about deprivation levels of the cells that change community membership and in general it should certainly be considered as further work. The authors could also consider to widen their discussion to include some missing reference in this direction, such as [Jay2020, Pullano2020, Gauvin2020]. Overall, the authors might consider elaborating a bit on this point, as the only analysis performed is in Supplemental Figure 6 and poorly commented in the main paper. I believe that this aspect could be important to strengthen the paper, as the implication of such analysis might suggest that deprived areas respond differently under top-down intervention, thus potentially limiting their effectiveness on more deprived and vulnerable population.

- The authors might be interested in reading (and probably citing) some references closely related to this work [Iacus2020, Bonato2020]. In those papers, the authors investigated the time evolution of data-driven Mobility Functional Areas (geographic zones with high degree of intra-mobility exchanges) before and after national interventions.

Minor comments

- For the sake of readability, I suggest to move Figure 2 and the considerations about the representativeness of the dataset before Figure 1, where such data are used.

- Figure 3C: an explanation about the meaning of the different colors of the shaded areas is missing.

- In the first paragraph of page 8, the authors refer to Supplemental Figure 9 as a comparison of Infomap vs. Leiden communities, but they should probably refer to Supplemental Figures 10 and 11.

Brockmann, D. and Helbing, D., 2013. The hidden geometry of complex, network-driven contagion phenomena. science, 342(6164), pp.1337-1342.

Balcan, D., Colizza, V., Gonçalves, B., Hu, H., Ramasco, J.J. and Vespignani, A., 2009. Multiscale mobility networks and the spatial spreading of infectious diseases. Proceedings of the National Academy of Sciences, 106(51), pp.21484-21489.

Jay, J., Bor, J., Nsoesie, E.O., Lipson, S.K., Jones, D.K., Galea, S. and Raifman, J., 2020. Neighbourhood income and physical distancing during the COVID-19 pandemic in the United States. Nature human behaviour, 4(12), pp.1294-1302.

Pullano, G., Valdano, E., Scarpa, N., Rubrichi, S. and Colizza, V., 2020. Evaluating the effect of demographic factors, socioeconomic factors, and risk aversion on mobility during the COVID-19 epidemic in France under lockdown: a population-based study. The Lancet Digital Health, 2(12), pp.e638-e649.

Gauvin, L., Bajardi, P., Pepe, E., Lake, B., Privitera, F. and Tizzoni, M., 2020. Socioeconomic determinants of mobility responses during the first wave of COVID-19 in Italy: from provinces to neighbourhoods. medRxiv.

Iacus, S., Santamaria, C., Sermi, F., Spyratos, S., Tarchi, D. and Vespe, M., Mapping Mobility Functional Areas (MFA) using Mobile Positioning Data to Inform COVID-19 Policies, EUR 30291 EN, Publications Office of the European Union, Luxembourg, 2020, ISBN 978-92-76-20429-9, doi:10.2760/076318, JRC121299.

Bonato, P., Cintia, P., Fabbri, F., Fadda, D., Giannotti, F., Lopalco, P.L., Mazzilli, S., Nanni, M., Pappalardo, L., Pedreschi, D. and Penone, F., 2020. Mobile phone data analytics against the covid-19 epidemics in italy: flow diversity and local job markets during the national lockdown. arXiv preprint arXiv:2004.11278.

Reviewer #3: This was a great read. It was very interesting to see fine-scaled movement data quantified and associated in this way. I think the content stands on its own, but could be made more compelling by a stronger connection between movement data and virus reproduction rates.

General

I find myself wondering about the relationship between cell-to-cell travel and transmissible contact rates. Is it possible to characterize the types or proportion of trips that are captured by these movement data (for the average/median individual)? Commutes to work or school, trips to the grocery, etc. For instance there might be statistics on commuting that would allow you to estimate the proportion of commutes that leave a level-13 cell.

Introduction

Perhaps remove “, and network service providers” from the last sentence of first paragraph. Repeated ‘and’ and ‘network providers’.

ONS data citation?

Materials and Methods

Facebook Data: From the text it is difficult to determine if the movement data is (1) simply a geolocation taken every 8 hours, (2) change of map cell at any time during each 8 hour period, or (3) something else. If it is (2), can an individual have multiple journeys in a single 8 hour period?

Facebook Data: is there any latency or is the data for a given day released the following day?

S2 Fig caption: are these level 12 or 13 cells? For what administrative boundary? I assume for all of the U.K., but please clarify.

Demographic information: The first sentence is accurate, but a lot to unpack. I had to read through this section, look at fig S5 and then go back and reread the first sentence to fully understand it. I would suggest leading with “to identify relationships between the percentage of Facebook users and demographic factors, we…” or somesuch. Then describe the details. This is only a readability suggestion, I respect the authors’ preference.

Community Detection: What is the cost/objective function? Is there a penalty in the algorithms for adding additional communities?

S10 Fig: If it is relatively easy, it would be interesting to draw the outline of Leiden communities over the InfoMap communities.

COVID-19 Data: I do not believe LTLA is previously defined

Results

S13 Fig b and c: are these ordered row-wise or column-wise?

Discussion/Conclusion

In addition to using these methods for geographic definition of interventions and an indicator of higher transmission, I would consider discussing how these metrics could be used as real-time feedback concerning the effectiveness of intervention mandates.

A stronger connection between transmission and movement data would be compelling. It would be interesting to compare the movement data to effective reproduction number estimates and check for temporal correlation. This would remove transmission lags and ‘ramp-up’ from the comparison to cases and potentially make for a more direct connection to transmission dynamics. Is this something you considered?

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No: Data used in this study are available from the Facebook Data for Good Partner

Program by application. The computational code is not available.

Reviewer #2: No: Data used in this study are available from the Facebook Data for Good Partner Program by application.

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Paolo Bajardi

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009162.r003

Decision Letter 1

Virginia E Pitzer, Alex Perkins

5 Jun 2021

Dear Mr. Gibbs,

We are pleased to inform you that your manuscript 'Detecting behavioural changes in human movement to inform the spatial scale of interventions against COVID-19.' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Alex Perkins

Associate Editor

PLOS Computational Biology

Virginia Pitzer

Deputy Editor-in-Chief

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009162.r004

Acceptance letter

Virginia E Pitzer, Alex Perkins

5 Jul 2021

PCOMPBIOL-D-21-00363R1

Detecting behavioural changes in human movement to inform the spatial scale of interventions against COVID-19.

Dear Dr Gibbs,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Zsofi Zombor

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. An overview of different geometries used in this study, intersecting London.

    a) Zoom level 12 tiles, b) Zoom level 13 tiles, c) Lower Super Output Areas. Base map data from Natural Earth and Office for National Statistics [5556].

    (TIF)

    S2 Fig. The number of cells included in the mobility dataset (zoom level 12).

    Cells recording fewer than 10 persons moving between cells along any connection are censored from the dataset to preserve user privacy.

    (TIF)

    S3 Fig. The cell-level geographic distribution of the four census variables used in the analysis.

    In each case, a white cell means the data were missing from the Facebook mobility data and so are not displayed here. In most cases this is due to censoring of low numbers, except for the small discontinuity around Swindon, mentioned in the Main Text. a) IMD rank. Each country has a different colour because the measure of IMD is different in each country. In each case, the darker shade is higher IMD. b) Population density per cell (log scale). c) Percentage of the population self-identifying as any other ethnicity than “Any white background”. d) Mean age of the population resident in each cell. Base map data from Natural Earth [55].

    (TIF)

    S4 Fig. A comparison of the percentage of Facebook users and census variables at the cell level.

    a) IMD, b) mean age, c) percent minority ethnic, and d) population. Variables were aggregated from mid-level census geographies for each country. The mean value of each variable was assigned to intersecting tiles, weighted by small area population estimates. Correlation is shown on the panel as the Pearson correlation coefficient (R) and two-sided p values.

    (TIF)

    S5 Fig. A comparison of movement by IMD quintile.

    Percent change from baseline for movement between cells by IMD quintiles in each country. IMD data was aggregated to cell level and weighted by small area population estimates. IMD quintiles range from 1 (most deprived) to 5 (least deprived).

    (TIF)

    S6 Fig. The relationship between movement and COVID-19 cases.

    a) Daily confirmed SARS-CoV-2 tests in each NHS region. b) The relationship between the percentage of users travelling outside their cell and the total number of confirmed SARS-CoV-2 positive tests, by month of the study period. Dots show lower-tier local-authorities coloured by their NHS region as in panel a.

    (TIF)

    S7 Fig. Weekly variance of between-cell movement.

    The standard deviation of between cell movements through time. Decreased variance indicated smaller differences in daily between-cell travel measurements per week. This reflects a reduction in the weekly pattern of between-cell movements.

    (TIF)

    S8 Fig. Spatial comparison of community detection algorithms.

    The extent of communities detected by InfoMap (a) and Leiden (b) on March 19th. Leiden communities are largely a superset of communities detected by Infomap, indicating the detection of a different hierarchical structure, but an agreement of community boundaries between the two algorithms. Base map data from Natural Earth [55].

    (TIF)

    S9 Fig. InfoMap and Leiden community detection methods.

    A comparison of communities detected with the InfoMap (red) and Leiden (black) methods for a selection of dates. Base map data from Natural Earth [55].

    (TIF)

    S10 Fig. Comparing community detection algorithms.

    A comparison of the spatial intersection between communities detected by the Infomap and Leiden algorithms through time. a) The average % overlap of InfoMap communities with Leiden communities. b) The number of Leiden communities each InfoMap community intersects, and c) the inverse comparison of Leiden communities to InfoMap communities. For a specific date, d) the maximum % areal overlap of each InfoMap community with all Leiden communities. e) The number of Leiden communities each InfoMap community intersects, and c) the inverse comparison of Leiden communities to InfoMap communities. This shows spatial alignment between InfoMap and Leiden communities, where Leiden communities tend to be larger than those detected by InfoMap.

    (TIF)

    S11 Fig. Number of InfoMap communities.

    The number (red) of InfoMap communities through time. An increase in the number of communities reflects more local patterns of travel during national interventions.

    (TIF)

    S12 Fig. Area and population of InfoMap communities.

    a) The average and median area of InfoMap communities through time. b) The average and median census population of communities through time. The distribution of population is more skewed than that of area, reflecting high populations in specific communities.

    (TIF)

    S13 Fig. Comparison of TTWAs, LTLAs, and detected communities.

    Maps of 2011 TTWAs, LTLAs, and InfoMap communities detected on a specific date. While some InfoMap communities are similar in size to TTWAs and LTLAs, they do not share the same boundaries. There are fewer communities (193) than TTWAs (228) and LTLAs (380) and these communities have a smaller average area across all time periods (389.73 km2) compared to TTWAs (1069.14 km2) and LTLAs (605.28 km2). Base map data from Natural Earth and Office for National Statistics [55,57,58].

    (TIF)

    S14 Fig. Persistence of communities.

    a) the total number of community labels that each cell has had (i.e. number of communities that the cell has ever been in) during the study period. The darkest shade indicates that a cell was always in the same community. b) the number of community labels for a given cell as a proportion of the number of days that cell was present in the dataset. This was calculated as the (number of unique community labels/number of days a cell was present). c) stable communities, marked as those which had the same community label for the entire study period. Base map data from Natural Earth [55].

    (TIF)

    S15 Fig. Community persistence in the dataset.

    a) The most persistent communities, those that existed throughout the timeseries, as on March 19th, 2020. b) Community membership from March 19th to March 26th, 2020 in Leicestershire, and c) community membership from March 19th to April 5th, 2020 in London. For both figures, panels are ordered row-wise from top left. Base map data from Natural Earth [55].

    (TIF)

    S16 Fig. The relationship between community labels and population density.

    The relationship between the population density of individual cells and the number of community labels assigned during the period. We do not observe a strong association between population density and the number of community labels assigned to individual cells.

    (TIF)

    S17 Fig. Community detection with different time aggregations.

    Normalised Mutual Information, a measure of similarity between community partitions, measured for InfoMap communities detected in networks aggregated to daily, weekly, and monthly periods. The partition of each network is compared to its parent time aggregation (days to overlapping weeks, weeks to overlapping months).

    (TIF)

    S18 Fig. The relationship between community degrees during the first national lockdown.

    The total degree of movement communities assigned to individual tiles. We observe a relationship between the degree of communities before and during national interventions, indicating that highly connected communities remained highly connected during the period of national interventions. The dashed line indicates where the Pre-lockdown degree is equal to the Post-lockdown degree. All points would fall along this line in the event of no changes to the community network after the intervention.

    (TIF)

    S19 Fig. Communities connected to Manchester local area restrictions.

    The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

    (TIF)

    S20 Fig. Communities connected to North East England local area restrictions.

    The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

    (TIF)

    S21 Fig. Communities connected to North West England local area restrictions.

    The daily percentage of users travelling between cells in the intervention area and the connected community, and the weekly average degree of intervention cells (a). Confirmed cases in the intervention area (b). Changes in the community structure before and after the introduction of local intervention (c). Base map data from Natural Earth [55].

    (TIF)

    Attachment

    Submitted filename: [Review 1] UK Movement Communities.docx

    Data Availability Statement

    Mobility Data used in this study are proprietary data owned by Facebook. The data used in this study is are not publicly available but are available to researchers by application to the Facebook Data for Good Partner Program (https://dataforgood.fb.com/, diseaseprevmaps@fb.com). Census data from national statistics agencies is publicly available. Dataset information is available in the references. Code used to conduct this analysis is available from https://github.com/cmmid/uk_movement_communities_covid.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES