Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2025 Nov 13;21(11):e1013642. doi: 10.1371/journal.pcbi.1013642

Incorporating human mobility to enhance epidemic response and estimate real-time reproduction numbers

Mousumi Roy 1,*, Hannah E Clapham 1, Swapnil Mishra 1,2,3,4,*
Editor: Nicola Perra5
PMCID: PMC12614531  PMID: 41231844

Abstract

Human mobility plays a critical role in the transmission dynamics of infectious diseases, influencing both their spread and the effectiveness of control measures. In the process of quantifying the real-time situation of an epidemic, the instantaneous reproduction number Rt appears to be one of the useful metrics widely used by public health researchers, officials, and policy makers. Since individuals can contract infections both within their region of origin and in other regions they visit, ignoring human mobility in the estimation process overlooks its impact on transmission dynamics and can lead to biased estimates of Rt, potentially misrepresenting the true epidemic situation. Our study explicitly integrates human mobility into a renewal-equation based disease transmission model to capture the mobility-driven effect on transmission. By incorporating pathogen-specific generation-time distribution, observational delay, the framework is epidemiologically informed and flexible to a wide range of diseases. We primarily validate the approach using simulated data, and demonstrate the limitations of estimating Rt without considering mobility. We then apply it to two real-world mobility settings using SARS-CoV-2 mortality data: the regions of England and the LTLAs of North East region of England, and uncover the mobility driven effect on transmission at different spatial resolutions. This framework uses non-identifiable and widely accessible publicly available datasets, demonstrating its practical applicability and supporting better-informed and more targeted public health measures.

Author summary

The real-time or instantaneous reproduction number Rt is a key metric for assessing the state of an epidemic at any given time. When estimating these numbers across multiple connected regions, human mobility plays a crucial role, as movement patterns significantly influence disease transmission. Traditional epidemic models often assume homogeneous mixing, which does not reflect real-world interactions. On the other hand, individual-based models incorporate heterogeneous mixing at individual level but demands an extremely refined data and substantial computational support. To address these challenges, we employ a renewal equation-based transmission framework, particularly useful for its effectiveness in real-time epidemic analysis, by incorporating heterogeneous mobility flows at a chosen spatial resolution. This yields the estimates of spatially connected instantaneous reproduction number for each region. This improved understanding enables better assessment of the impact of mobility on disease transmission spread, and provides valuable insights for designing targeted epidemic control and intervention strategies.

1 Introduction

Human mobility significantly influences the transmission dynamics of infectious diseases [1,2]. Hence, its been a constant effort from the researchers to incorporate human mobility within a transmission model in the process of analyzing disease spread [35]. While long-range mobility like air-traffic flow between countries quantifies the importation risks for a country, daily commuting travel structure builds the onward transmission routes within a country after initial importation [68]. By daily commuting flows, people move across different regions and increase contact opportunities at distant places, that enables inter-regional disease spread. In contrast, during COVID-19 pandemic, lockdowns and other non-pharmaceutical interventions (NPIs) led to substantial reductions in mobility within any country, and accordingly disease transmission [9,10]. This highlights the need to account for mobility in the real-time estimation of reproduction numbers, and therefore, ignoring mobility in transmission models can lead to biased parameter estimates and compromise the accuracy of future predictions. In this study, we use human mobility to describe the movement of people between regions, which creates spatial connectivity and leads to a mobile population that affects how diseases spread. These three terms human mobility, spatial connectivity, and mobile population are used interchangeably throughout the paper to convey a similar meaning.

Various modeling strategies have been implemented to study infectious diseases at the population level, each presenting unique benefits and drawbacks. Compartmental mechanistic models [11], in the absence of a spatial component and more advanced mixing assumptions (e.g., contact matrices) may lead to biased estimation due to their assumption of homogeneous mixing, a condition that rarely reflects the real-world mixing pattern [12,13]. This leads to questions about the reliability of the model within a heterogeneous setting. A network-based model [14,15], which overlays a network on top of compartmental mechanistic models, can incorporate information about heterogeneous mixing. In this context, Metapopulation model partitions the total population into subpopulations, with mobility between them [4,16], often used to understand the mechanism and study real-time transmission in a spatially explicit setting [17,18]. However it is associated with several challenges, such as incorporating stochastic extinction and reintroduction of the disease [19]. On the other hand, individual-level models [20,21] capture the nuanced complexities of human mobility and individual heterogeneity, but are computationally intensive and require granular data for validation. In practice, studies often use proxy-mobility datasets, such as, mobile-phone, public-transport records [22,23], or substitute established mobility models when empirical data are unavailable [2426]. An alternative transmission model is the approach based on the renewal equation [27,28], which effectively models infection at the population level and is particularly valuable for real-time analysis of disease transmission. Incorporating a spatial component into the renewal equation-based framework allows to model daily commuting flow within a population and greatly enhance the estimation of spatially connected parameters on real-time. This improves the understanding of real-time transmission dynamics in the context of a population with heterogeneous mobility.

In the process of modeling and understanding a real time epidemic situation, the instantaneous or real-time reproduction number (Rt) serves as a metric that quantifies the average number of secondary cases generated from a primary infection at any particular time point. This metric is crucial for assessing the current state of the epidemic and evaluating the effectiveness of any intervention strategies implemented throughout its course [29,30]. Inaccurate estimation of Rt can lead to a misleading understanding of the epidemic situation, potentially resulting in misallocation of resources and improper implementation of control policies. To estimate the reproduction number Rt, various forms of observational data, such as, cases, hospitalizations, deaths, etc. can be utilized [28,31]. Several estimation methods are available, each with its distinct advantages and methodological limitations [27,31,32]. When building the modeling framework, it is essential to consider the delay between the actual infection and the reporting of observational data that are used for estimation, particularly towards the end of the timeline of the datasets [33]. When daily observational data are lacking, a temporally aggregated dataset can also be used for estimation purposes [34,35].

Despite significant efforts by infectious disease modelers to integrate human mobility into transmission models in order to understand its influence on transmission pattern, [3638], this area still requires further investigation. In contrast to previous literature with renewal-equation based framework where in most of the studies mobility reduction appears as an aspect of behavioral changes [28,39], we use explicit human mobility flow within the model to account for inter-regional mobility effect over the transmission dynamics. We apply the framework to the initial phase of the COVID-19 pandemic in England as a case study to illustrate its real-world applicability. Consequently, this study systematically investigates the influence of commuting populations and the epidemic conditions of interconnected regions on disease transmission dynamics, using both simulated data and a case study with necessary adaptations to the model to reflect the specific context. Furthermore, we observe the spatial scale itself is a vital factor, as working with a spatial scale where the joint effects of mobility and epidemic situation are not visible may diminish the observable impact of mobility, which may be more apparent at a comparatively coarse resolution.

2 Materials and methods

In this study, we consider interconnected regions characterized by commuting populations, reflecting daily movement patterns from home to various destinations. We use a discrete-time renewal equation based approach [27], integrated with a spatial component, to assess the real-time impact of mobility on transmission scenario. This approach helps us understand how heterogeneous commuting patterns across different spatial scales influence transmission dynamics. New infections in any region arise not only from the infected population within the region but also from the mobile population between connected regions. It provides a thorough evaluation of how regional connectivity affects epidemic dynamics within a specific region. The estimated instantaneous reproduction number using this framework accounts both within region transmission and the effect of epidemic status of neighboring regions with the corresponding inter-regional mobility.

2.1 Modeling infection within a region

To model the infections within a region, we use the renewal-equation based framework [27,28,32], where new infections at time t are described as,

It=(St/N)Rtτ=0t1Iτgtτ (1)

where, St=N(s=1(t1)Is) represents the susceptible population at time t, and N denotes the total population. Eq 1 illustrates that the number of new infections at any given time depends on the prior infections, with their contributions weighted by the generation time distribution, g, which captures time interval between primary and secondary infections. Here g is derived by discretizing the pathogen-specific density function g(t) as gs=s0.5s+0.5g(t)dt for s=2,3,4,... and g1=01.5g(t)dt. The instantaneous reproduction number Rt quantifies the average number of secondary infections caused by a single primary infection at time t. Eq 1 provides a robust foundation for understanding the transmission dynamics within a single region, which we refer to as the national model.

2.2 Extension of the framework with a spatial component

Central to this study is the advancement of the existing epidemiological framework through the integration of a spatially explicit component, which systematically incorporates human mobility data to elucidate the mechanistic role of population movement in shaping the spatiotemporally heterogeneous disease transmission dynamics. A schematic representation of the extended framework is shown in Fig 1. For simplicity, we consider two regions: region 1 and region 2 represented by two different colors. The population in each region comprises susceptible and infected individuals, with a portion of the population commuting between regions. Additionally, we assume that recovery from the disease confers lifetime immunity, as recovered individuals cannot be reinfected. The mobility between two regions mij(ji), represents the proportion of the population of region j that commutes to region i. Hence, in Fig 1, m12 denotes the fraction of the population of region 2 that commutes to region 1, and similarly, m21 represents the reverse flow. The diagonal elements of the mobility matrix mii indicate the fraction of the population that does not commute, staying within their respective regions. The presence of mobile population implies that the infection in any region are influenced not only by local transmission but also by interactions with infected individuals from other connected regions. To illustrate the transmission dynamics in more detail, we focus on infection pathways affecting the population of region 1. As illustrated in Fig 1, the red ovals indicate three different routes through which individuals in region 1 can acquire infection. Susceptibles in region 1 can be exposed to infected individuals within their own region or can acquire infection by visiting region 2 and contract infection there. In summary, infection within any region can occur through three primary pathways: (i) local transmission within the region by its own infected population, (ii) infection through incoming infected commuters from other regions, and (iii) acquisition of infection during visit to another regions. These mobility-driven transmission mechanisms underscore the complexity of disease spread in interconnected populations.

Fig 1. Schematic representation of disease transmission dynamics with mobility.

Fig 1

The diagram illustrates two interconnected regions (Region 1 and Region 2), where individuals are classified as susceptible (S) and infected (I) can move between regions. The colors indicate the region of origin for each individual. The arrows represent mobility flows: m21 (movement from Region 1 to Region 2) and m12 (movement from Region 2 to Region 1). The diagram describes the disease transmission pathways for region 1, highlighted by red ovals: (i) local transmission due to interactions between susceptible and infected individuals of region 1 within region 1, (ii) infected individuals of region 2 commute to region 1, can transmit infection to the susceptible population of region 1, (iii) susceptible individuals from Region 1 who travel to Region 2, acquire the infection in region 2, and return as infected individuals. This framework captures the role of human mobility in shaping spatial disease transmission patterns.

Drawing from these observations, we develop a modeling framework that explicitly incorporates human mobility into disease transmission dynamics. Using Eq 1, as the foundational structure, we extend the model by integrating a spatial component to account for mobility flows between regions. For the modeling framework, we consider a total of M inter-connected regions, and we model the infection dynamics in a region denoted as i, with the number of infections at time t represented as Ii,t, while allowing for population movement across M regions. At time t, the susceptible individuals from region i commuting to another region j are represented as mji(t)St,i. These susceptible individuals are exposed to all the infected population in region j at that time, given by k=1Mmjk(t)τ=0t1Iτ,kgtτ. This interaction between susceptible individuals from region i and infected population in region j results in new infection of region i, albeit occurring in region j. Since individuals from region i can commute to any regions, summing over all regions j yields the total infections in region i that is given by,

It,i=j=1Mmji(t)St,i[l=1Mmjl(t)Nl]·Rt,j[k=1Mmjk(t)τ=0t1Iτ,kgtτ],i=1,2,3,...,M. (2)

Here, the denominator serves as a normalization term, describing the effective population in region j at that time. Rt,j is the reproduction number of region j, indicates the average number of secondary infections occurring from a primary infection in that region. g is the generation time distribution, and Ni is the total population of region i, while St,i represents the number of susceptible individuals left in region i at time t. In the special case of completely isolated regions, the mobility matrix becomes an identity matrix (unity on the diagonal and with zero elsewhere). With this simplification, in the absence of inter-regional mobility, Eq 2 reduces to Eq 1 for each region. In the context of the framework’s time scale, if individuals do not spend an entire day in another region, the framework is flexible enough to adopt that fact. A detailed simulation is provided in S12 Fig. However, in this work, we consider a daily time scale throughout the study, based on the available data.

As the next step, for each region i, the model is fitted to available observational data, such as infection data, case counts, or death data, to estimate the region-specific instantaneous reproduction number Rt. The model can be adjusted based on the nature of the available datasets and the specific context. We demonstrate the utility of our model through two distinct applications. First, we employ simulated daily incidence data as observations to validate the model in a controlled setting. Second, we apply the model to real-world data by analyzing the early spread of SARS-CoV-2 in England, using weekly mortality data as observations. These examples illustrate the flexibility of the framework in accommodating both different types of data and their aggregated nature. This spatially extended framework offers a comprehensive understanding of the impact of human mobility on disease transmission dynamics across interconnected regions. By explicitly incorporating mobility flows into the infection process, the model captures both local and mobility-driven transmission, providing a more realistic representation of pathogen spread in spatially structured populations. Here, we provide a detailed description of three sources of infection.

2.3 Sources of infections

Infections within a region can originate from three different sources. Using Eq 2, we segregate the infected population based on the three different sources of transmission:

  1. Infections driven by own infected population within the region: These infections occur within the region when susceptible individuals contract the disease from the infected population of their own region. This category captures purely local transmission dynamics and can be expressed as:
    It,iin,loc=mii(t)St,i[l=1MmilNl]·Rt,i[mii(t)τ=0t1Iτ,igtτ],i=1,2,3,...,M. (3)
  2. Mobility induced infections within the region: These infections are caused by infected individuals from other regions visiting the region and transmitting the disease to the local susceptible population. This category is defined as infections due to mobility inside the region and can be expressed as,
    It,iin,mob=mii(t)St,i[l=1MmilNl]·Rt,i[k=1,ki,Mmik(t)τ=0t1Iτ,kgtτ],i=1,2,3,...,M. (4)
  3. Mobility induced infections outside the region: These infections occur when individuals from a region commute to other regions, become exposed there, and get infected. This category is defined as infections due to mobility outside the region and can be expressed as :
    It,iout,mob=j=1,jiMmji(t)St,i[l=1MmjlNl]·Rt,j[k=1Mmjk(t)τ=0t1Iτ,kgtτ],i=1,2,3,...,M. (5)

2.4 Simulated data generation

We use a simulated infection dataset to validate the framework. The simulated dataset assumes three regions with total populations of 20,000,000, 10,000,000, and 15,000,000, respectively. These regions are interconnected through a mobility matrix (see Fig 2b), where each column corresponds to the outflow of individuals from a specific region (i.e., the jth column represents outflows from region j), while each row represents the inflows into a region (i.e., the ith row represents inflows into region i). Consequently, the sum of each column equals unity. According to the intrinsic characteristic of the framework, infections are assumed to occur homogeneously within each region governed by the corresponding reproduction numbers, Rt,i,i = 1,2,3 (see Fig 2c). We randomly choose three different countries (IND, GRB, ITA) from the repository [40], and take the time series of Rts for a time period of 350 days. Based on the reproduction numbers, mobility matrix, and using Eq 2, simulated data are generated as shown in Fig 2d. The colors of the reproduction number and incidence data correspond to their respective regions. The generation time distribution is obtained by discretizing the density function g~Gamma(6.5,0.62) [28,41], a SARS-CoV-2 specific generation time distribution. Here, we consider the gamma distribution with mean 6.5 and CV 0.62. However, in real-world scenarios, human mobility, total population, and observational datasets are typically the only available data sources. Using these, the goal is to estimate the instantaneous reproduction numbers Rt, for all regions to understand the epidemic dynamics in the presence of human mobility.

Fig 2. Simulated dataset.

Fig 2

(a) Three regions with the associated mobility flow. (b) The mobility matrix where each element mij denotes the fraction of population of region i commutes to region j. Therefore, the columns show the outflows from the respective region and each row represents the inflow to that specific region. (c) Instantaneous reproduction numbers Rt for the three regions. (d) With the mobility matrix and reproduction numbers, we simulate daily infection data using Eq 2.

2.5 Model adjustments and assumptions for estimating Rt with the simulated data

We define Rt,i as the effective reproduction number for the ith region at time t and is modeled as geometric random walk, where log(Rt,i) is updated according to a normal distribution with the mean log(Rt−1,i) and a standard deviation σi as follows:

log(Rt,i)~𝒩(log(Rt1,i),σi) (6)

Here, R0,i is the reproduction number at t = 0 defined as the average number of secondary cases coming from an infected individual at the start of the outbreak. It is assigned a weak prior log(R0,i)~𝒩(0.1,0.5) to consider a wide range of values. σi~𝒩+(0,0.05) follows a prior of truncated normal distribution (constrained to positive values). For connected Rt estimation, we use the mobility matrix as defined in Fig 2b. We use CmdStan [42] for model fitting, with NUTS sampler. 4 chains are generated in parallel for 500 iterations after 500 warm-up. There are no divergent transitions.

2.6 Model adjustments and assumptions for case study: SARS-CoV-2 in England and North East region of England (2020)

We apply the framework 2 to analyze the initial phases of the COVID-19 pandemic in England during 2020 as a case study. To address the challenge of low reporting rates in the early stages, we use weekly death data to estimate Rt for the nine regions of England: North East, North West, Yorkshire and the Humber, East Midlands, West Midlands, East, London, South East and South West. The mobility matrix (see S3a Fig) is constructed over the nine regions using data from [43], which reports 2011 census estimates for individuals aged 16 years and over in England and Wales with their place of residence and workplace. We calculate the proportions of outflows to obtain the mobility matrix that is compatible with the model 2. During the lockdown periods, we set the mobility matrix to identity (mij=1 if i=j, and 0 otherwise), ignoring inter-regional mobility. The population data are obtained from [44], and weekly death data [45] for each region are shown in S4 Fig. We outline below the framework used to estimate reproduction numbers.

We use the same generation time distribution, g~Gamma(6.5,0.62) [28,41], as in the previous section to simulate infections. For death modeling, we consider the infection-to-death delay distribution π~Gamma(5.1,0.86)+Gamma(17.8,0.45), which combines the infection-to-onset and onset-to-death [28,39,46] delays. Here, the gamma distributions are parametrized as Gamma(mean, CV).

The weekly death data Dw,i for region i in week w is modeled using a negative binomial distribution:

Dw,i~Negative binomial(dw,i,dw,i+dw,i2ψ) (7)

Here, ψ~𝒩+(0,5) [39]. Here, dw,i is the expected number of weekly deaths, calculated as the sum of daily deaths over seven days. Daily deaths dt,i are derived from daily infections It,i using the discrete sum,

dt,i=ifri*τ=0t1Iτ,iπ(tτ), (8)

where ifri*=ifr·N(1,0.1) represent the infection fatality ratio [28] of England with a noise around it corresponding to region i.

To estimate the instantaneous reproduction number Rt,i for each region, we account for temporal changes in human behavior utilizing activity trends derived from the google mobility report [9]. We use five different types of information from Google mobility data: retail and recreation, groceries and pharmacies, parks, transit stations, and workplaces. Here, we define Rt as follows:

Rt,i=R0,i·f(XiPtαt,iϵwt,i), (9)

where, f(x)=2exp(x)/(1+exp(x)), is twice the inverse logit function. R0,i~𝒩(3.28,κ) with κ~𝒩+(0,0.5) for all regions [28,39]. Here, R0,i is the initial reproduction number for region i. In this context, it is the value of the reproduction number before fitting starts. ϵwt,i is the weekly effect that follows a random walk centered at 0, with the week index wt,i=[tt07]+1, where t0 is the first day of seeding.

Xi is the region-specific effect for covariates αt,i, which are the averages of five different dimensions observed from Google mobility data. The implementation of NPIs typically leads to a significant decline in both population mobility and disease transmissibility. However, in many countries, the adoption of social distancing behaviours weakened the correlation between mobility and transmissibility following the relaxation of control measures [4749]. To account for this effect, we introduce the parameter Pt, which captures behavioural changes associated with different lockdown phases, with distinct values x, y, and z representing the periods up to the end of the first lockdown, from the end of the first lockdown to the end of the second lockdown, and the post-second lockdown period, respectively.

We assume that initial seeding begins six weeks prior to the date on which each region in England reaches 10 cumulative reported deaths. From that date, we seed equal initial infections for six consecutive days at each region, modeled as [39]:

I1,i=I2,i=I3,i=I4,i=I5,i=I6,i~Exponential(1τ), (10)
τ~ Exponential(0.01),i=1,2,3,...,M.

By tailoring the model to account for spatial and temporal variations in epidemic trends, the framework provides a robust tool for understanding and managing regional dynamics of SARS-CoV-2 in England in the year 2020. The model fitting is shown in S5 Fig for nine regions of England.

We then apply the framework to a finer spatial scale, focusing to the North East region, which is selected for detailed analysis because nearly 99% of its population does not commute to other regions. This characteristic makes it an ideal area for studying the relationship between human mobility and disease transmission on a more localized scale. The mobility matrix (see S3b Fig) is constructed for the twelve lower-tier local authorities (LTLAs) within the North East region based on the same mobility data [43], ignoring mobility to regions outside of it. We use the same model described in Eq 2 and Sect 2.6, with death data specific to the North East region (see S9 Fig). The model fitting across the 12 LTLAs are illustrated in S10 Fig.

We use CmdStan [42] for model fitting, with NUTS sampler. 4 chains are generated in parallel for 500 iterations after 1000 warm-up. There are no divergent transitions.

3 Results

3.1 Model validation with simulated dataset

To validate the proposed framework (described in Sect 2), we compare the estimated Rts with the ground truth (see Fig 2c) used in the simulation. This validation ensures that the framework reliably captures the transmission dynamics under controlled conditions before applying it to real-world scenarios. Fig 3a3c presents the results for the simulated dataset, showing estimated reproduction numbers for three regions. The disconnected and connected Rt estimates are shown in red and green, representing scenarios without and with inter-regional mobility, respectively. In the disconnected model, mobility-driven infections are ignored, assuming individuals remain within their respective regions and infections occur exclusively within regional boundaries. This simplification produces biased Rt estimates. Table 1 shows the correlation between one-step increments in Rt for the connected (green) and disconnected (red) estimates against the ground truth (blue). The consistently higher correlation between the connected Rt and ground truth supports the observation that disconnected Rt can lead to a severe misleading understanding of transmission patterns.

Fig 3. Estimation of time-varying reproduction number (Rt) using simulated data and contribution of mobility-driven infections.

Fig 3

Top row (a–c): Blue curves represent the true Rt trajectories for three regions, with horizontal dotted lines indicating the epidemic threshold (Rt = 1). Green and red curves show the estimated Rt values with and without incorporating human mobility, respectively. Shaded areas around the curves denote the 95% credible intervals. The deviations between the two estimates highlights the importance of accounting for human mobility in transmission modeling. Middle row (d–f): Stacked plots illustrate the three sources of daily infections. Orange shows the infections caused by the infected population of that same region. Blue shows the infections that generated within the region by infected incoming population form the other regions. Purple reflects mobility-induced infections acquired outside the region by the outgoing susceptibles. Bottom row (g–i): The proportion of infections from three different sources. The colors are considered same as defined for the Fig 3(d)–3(f). These figures represent the considerable number of mobility driven infections in the disease transmission dynamics.

Table 1. Correlation between one-step increments (RtRt1) of true and estimated Rts.

Connected Rt Disconnected Rt
True Rt Region 1 0.89 0.84
Region 2 0.60 0.50
Region 3 0.80 0.69

To further investigate the relationship between mobility-driven infections and overall transmission dynamics, Fig 3d3i shows the absolute number and proportion of infections arising from three distinct sources. These figures indicate that a considerable proportion of infections is influenced by mobility. This effect becomes particularly important when Rt varies across regions and the epidemic behaves differently from one region to another, an effect overlooked by the disconnected Rt.

Table 2 produces a quantitative support that shows a positive correlation between the residuals of disconnected Rt with respect to the ground truth and the proportion of mobility-driven new infections. When the mobile population gets fewer infections in other regions than they would without commuting, disconnected Rt underestimates (residual negative), and if the number of such infections is higher, it overestimates the epidemic (residual positive). For example, around day 100, the disconnected Rt underestimates the true Rt in Region 2. Around that time, as 15% population of region 2 commutes to regions 1 and 3, where reproduction numbers are lower than region 2, it reduces mobility driven infection outside the region, and as well as the total number of infections in region 2. However, disconnected Rt fails to capture this mobility-driven lowering effect outside the region and leads to an underestimation of the true epidemic situation. On the other hand, around day = 220 in region 2, by failing to account for that mobility-driven increasing infection outside the region, disconnected Rt overestimates the epidemic situation. Therefore, disconnected Rt underestimates or overestimates an epidemic when mobility reduces or increases infections compared to a no-mobility scenario. Further discussion of the effects of mobility and regional epidemic conditions on transmission dynamics is provided in S1 and S2 Figs.

Table 2. Correlation between the residuals (disconnected Rt− ground truth) and the proportion of mobility-driven infections.

Residuals (Estimated disconnected Rt - Ground truth)
Proportion of mobility driven infection Region 1 0.73
Region 2 0.94
Region 3 0.94

These results validate the effectiveness and necessity of incorporating human mobility into the model to accurately estimate reproduction numbers and better understand the epidemic situation. This approach is essentially critical in scenarios involving spatial heterogeneity in transmission dynamics.

3.2 Case study: Instantaneous reproduction number estimates over the regions of England

In Fig 4, we present the estimates of the connected and disconnected instantaneous reproduction number (Rt) (mean and 95% credible interval) for the nine regions of England, comparing results with and without account for mobility. The vertical dotted lines indicate the start and end of the first and second lockdowns. We observe that the connected and disconnected estimated Rts are nearly identical across all regions. The absolute difference between connected and disconnected Rts are shown in S8 Fig. After an early discrepancy for a few regions (East, London, South East) maximum mean absolute difference reaches 0.25. This suggests that at regional scale, mobility does not have a significant impact on infections after the early transmission phase. Upon examining the mobility matrix (see S3a Fig), it is observed that the fraction of the population commuting between regions is very small relative to the total population of each region. This makes the regions of England treatable as disjoint areas, where the disconnected model is sufficient for Rt estimation at this spatial scale. Transmission at the regional level can be treated as homogeneous spreading within regions, with minimal influence from inter-regional mobility. This suggests that, when the disease is already established in all regions, mobility can be reasonably excluded from models at such spatial scales. The absolute number and the proportion of daily infections originating from three distinct sources across the 9 regions are shown in S6 and S7 Figs. The Rhat statistics regarding the estimated Rts are shown in S15 Fig.

Fig 4. Estimated Rt over different regions of England.

Fig 4

The red curve shows the disconnected Rt which is the reproduction number estimated separately for each region without considering mobility across them. Green curve is the connected Rt that is the estimated reproduction number with the considered mobility matrix showed in S3a Fig. Each curve is shown with its corresponding 95% credible interval. The vertical dotted lines show the first and second lockdown period, and the horizontal black dashed line is the threshold for reproduction number that is Rt = 1.

3.3 Findings on higher spatial resolution: LTLAs of North East region

To assess whether the estimates provide any additional or different insights when applied at a higher spatial resolution, where the mobility is considerably greater than the previous case. The high proportion of mobile populations between the LTLAs of North East region has a notable impact on disease transmission, as evidenced by Fig 5. Significant differences in the estimated reproduction numbers are observed in LTLAs such as Hartlepool, Middlesbrough, and North and South Tyneside. The absolute difference between connected and disconnected Rts are shown in S11 Fig. In Middlesbrough, the difference reaches 1.09(0.7-1.3) in the second week of September. Similarly, for North Tyneside it reaches 1.15(1.03-1.2). These variations highlight that a considerable number of infections arise from movement between LTLAs, as shown in Fig 6, and the estimated disconnected Rts fail to accurately capture these transmission dynamics, often resulting in underestimation or overestimation of the disease transmission. This underscores the importance of incorporating intra-regional mobility into models to better understand localized disease transmission patterns.

Fig 5. Estimated Rt over the LTLAs of North East region.

Fig 5

The green and red curve shows the estimation of real-time reproduction number with and without considering mobility respectively. The dotted vertical lines represents the lockdown period and black horizontal line shows the threshold for reproduction number at Rt = 1. Red and green bands show the 95% uncertainty interval corresponding to the disconnected and connected Rt estimates. The mobility matrix is shown in S3b Fig.

Fig 6. Daily infections from three different sources over the LTLAs of North East region.

Fig 6

Orange color represents the infections occur within the LTLA due to the infected population of that same LTLA. Blue and purple depict the mobility induced infections happen within and outside the LTLA. The dotted vertical lines indicate two lockdown periods while mobility driven infections go down and most of the infections are generated by the local transmission. Here, the lines represent the mean of the estimates and band shows the 95% uncertainty around it.

In Figs 6 and 7, we examine the absolute number and the proportion of daily infections originating from three distinct sources across the 12 LTLAs, providing direct evidence of a substantial portion of infections arising from mobility. In particular, areas such as Hartlepool, Middlesbrough, Newcastle upon Tyne, and North Tyneside show mobility-driven infections contributing to approximately 50% of the total cases, with the proportion rising to as high as 75% in North Tyneside. During the lockdown periods, the reduction in mobility led to a marked decrease in mobility-driven infections, resulting in transmission dynamics being predominantly driven by local spread. These findings underline the crucial role of inter-LTLAs mobility in shaping transmission dynamics and emphasize the importance of accounting for such mobility in public health interventions and resource allocation strategies. Additionally, in S13 and S14 Figs, we present an analysis showing that the main findings are robust to the choice of time scale.

Fig 7. Proportion of daily infections from three different sources across the LTLAs of the North East region.

Fig 7

Orange, blue, and purple indicate the proportions of daily infections attributed to three distinct sources. The dotted vertical lines represent two lockdown periods, during which nearly all infections resulted from local transmission.

4 Discussion

In this study, we developed a spatially extended epidemiological framework that improves the renewal equation based disease transmission model by incorporating human mobility. Unlike approaches that treat mobility as a reductive factor of behavioral changes within a single region [27,28], we explicitly accounted for incoming and outgoing infected populations, offering a more comprehensive representation of spatial transmission dynamics. The results highlight the substantial role of mobile population in shaping the dynamics of infectious disease transmission and in influencing the estimation of time-varying reproduction numbers (Rt). While some recent studies have attempted to integrate mobility flows into transmission models [38], further investigation is needed to address key aspects such as the contribution of different sources of infection and the role of spatial scales. Through applications to both simulated data and real-world SARS-CoV-2 death data in England, our study addresses these limitations and demonstrates that overlooking inter-regional mobility can lead to significant biases in epidemic assessment and incorrect depiction of disease transmission trends.

Using simulated data, we validated our framework by comparing estimated Rt values with known ground truth trajectories. In scenarios with heterogeneous epidemic intensities across regions, we observed that disconnected models (which ignores mobility) often tend to overestimate or underestimate the true Rt, as they fail to account for mobility-driven infections that redistribute transmission risk across regions. When mobility is not accounted for, estimated reproduction numbers (Rt) of each region tend to closely follow the rise or fall in total infections in that region. In such cases, increasing trend in infections simply lead to higher disconnected Rt estimates, and similarly, when infections decline, disconnected Rt estimates also decrease. It does not account for the infections that happen by the infected population from other regions due to mobility. In contrast, the connected model incorporated a mobility matrix that captured human mobility flow, hence the estimated Rt aligned closely with the ground truth and revealing the key contribution of mobility-driven infections to the observed epidemic patterns. The decomposition of infection sources into local, incoming, and outgoing mobility-driven transmissions provided more insights about the effect of mobility on the transmission process. These results proves the importance of accounting for human movement to improve the precision of epidemic models and real-time surveillance.

We applied this framework to SARS-CoV-2 weekly death data across nine regions of England, our framework revealed that mobility had a limited impact at regional scale, when disease is already established in every region. At this spatial scale (regional level), the proportion of the population commuting between regions was minimal, and transmission occurred primarily within each region. Consequently, disconnected models provided comparable estimates of Rt, suggesting that for large administrative units with limited inter-regional mobility, the added complexity of mobility integration may not be essential for real-time epidemic tracking. However, the situation changed markedly at a different spatial resolution. Focusing on the 12 Lower Tier Local Authorities (LTLAs) within the North East region, we observed substantial inter-LTLA mobile population and, correspondingly, a clear difference between connected and disconnected Rt estimates. In certain LTLAs such as North Tyneside and Middlesbrough, mobility-driven infections accounted for up to 75% of total infections at some time points. During lockdown periods, reductions in mobility were reflected in a shift back toward local transmission dominance. These findings underscore that the impact of mobility on transmission and the utility of mobility-informed modeling may depend on the considered spatial scale of the analysis.

Our framework provides a flexible and computationally efficient alternative to individual-based models that require extensive data and computational resources. By incorporating real-world human mobility flows, this model supports a more detailed understanding of how regional transmission trends interact and evolve over time. In the context of spatially heterogeneous epidemics, the impact of population movement on transmission dynamics becomes more pronounced, where differences in reproduction numbers across regions can trigger outbreaks in low-risk areas through population movement. This is especially relevant in real-world contexts, where such disparities are common during the early phases of an epidemic. As the epidemic progresses and spreads more uniformly across regions, local variations may still emerge due to differences in policy implementation, healthcare capacity, demographic profiles, or socio-economic factors. For public health policymakers, these insights are extremely useful. Accurate estimation of spatially connected Rt values can guide targeted interventions, such as localized lockdowns or vaccination prioritization, especially in regions where mobility plays a substantial role in disease spread. Failing to account for these dynamics may result in missed opportunities to suppress transmission or, conversely, in unnecessary restrictions in regions not driving epidemic growth.

A key strength of this framework lies in its generalizability. It can be used for different diseases as the disease-specific epidemiological parameters such as serial interval, incubation period, infection-to-death delay are adapted within the framework. It can be applied to different mobility settings, as an example in this study we considered a high and a low inter-regional mobility settings. The ability to integrate various types of observational data like cases data, hospitalization data, mortality data and even with temporally aggregated data, makes this framework extremely valuable during the early phases of an epidemic or in settings with limited case reporting, and researchers need to rely on the alternative data sources.

A significant challenge when working with mobility-induced transmission models is the limited availability of mobility data. Various datasets serve as proxy for human mobility with its advantages and limitations, at this stage in our case study, we relied on publicly available data to mitigate data constraints. However, these data are often aggregated and may not reflect real-time changes in movement patterns, such as those induced by behavioral shifts or NPIs. Future work could enhance this model by integrating dynamic mobility data, such as mobile phone-based metrics, and extending it to capture other heterogeneities, such as age structure or vaccination coverage. Additionally, specifying a particular mobility threshold that impacts disease transmission is beyond the scope of this work, as we obtained that transmission dynamics are jointly shaped by the mobility and epidemic situations across different regions. Furthermore, the choice of spatial resolution requires careful attention, as at very fine spatial scales, the assumption of homogeneous mixing within each region may be violated.

5 Conclusions

This study highlights the importance of incorporating human mobility explicitly within the model to better understand its impact on disease transmission dynamics. The incidence of infections does not solely depend on local outbreaks in a region with commuting populations, instead, our framework demonstrates that commuting populations significantly contribute to the emergence of new infections in connected regions. Consequently, to gain a comprehensive understanding of the epidemic situation in any area, integrating human mobility into the model is essential to mitigate biases that may arise from mobility patterns.

Supporting information

S1 Fig. Impact of mobility in a three-region system with identical epidemic conditions.

Each region has an equal population of 1,000,000, the mobility matrices and Rt values are specified in the figure. From left to right, we consider the scenarios with decreasing outgoing mobility and increasing non-commuting population. In the rightmost panel of S1 Fig the regions are disconnected (no inter-regional travel), so the mobility matrix is an identity matrix (all off-diagonal elements are zero). S1d-f Fig shows daily infections by region. Since the epidemic conditions are homogeneous, the incidence trajectories are identical across regions even when outflows differ. However, mobility redistributes the location of infection acquisition. S1g-o Fig shows that, although daily case counts are equal for the three regions, infections occur in different destination regions depending on the mobility pattern.

(TIFF)

pcbi.1013642.s001.tif (1.8MB, tif)
S2 Fig. Impact of mobility in a three-region system with heterogeneous epidemic conditions.

Each region has an equal population of 1,000,000, the mobility matrices and Rt values are specified in the figure. From left to right, we consider the scenarios with decreasing outgoing mobility and increasing non-commuting population. In the rightmost panel of S2 Fig the regions are disconnected (no inter-regional travel), so the mobility matrix is an identity matrix (all off-diagonal elements are zero). S2d–f Fig shows daily infections by region. As the epidemic conditions are heterogeneous over regions, the incidence trajectories are different based on the mobility and local epidemic condition. For example, region 2 is with a controlled situation with Rt<1, however, in S2d–e Fig, there is an outbreak because of the mobility-driven importation, and in the absence of any mobile population it fails to generate any outbreak (See S2f Fig). S2g–o Fig shows the different sources of infections occur in different destination regions depending on the mobility pattern. These simulations underscore that both human mobility and regional epidemic status jointly shape disease transmission dynamics. Mobility has a significant impact in heterogeneous epidemic scenarios, where differences in transmission intensity across regions can lead to outbreaks in low-risk areas through population movement. This is particularly relevant in real-world settings, where such heterogeneity is common during the early stages of an epidemic. As the epidemic becomes widespread and more uniform across regions, local disparities can appear due to the differences in policy implementation, healthcare capacity, demographic characteristics, or socio-economic conditions.

(TIFF)

S3 Fig. Mobility matrix.

In this figure, the mobility matrices illustrate the commuting patterns across the regions of England (see S3a Fig) and Lower Tier Local Authorities (LTLAs) within the North East region (see S3b Fig). Each column in the matrix represents the fraction of a region’s (or LTLA’s) population that commutes to other regions (LTLAs), signifying the outflow from the corresponding region (LTLA) indicated in the figure. On the other hand, the rows depict the inflow to any given region (LTLA) from all other regions (LTLAs). Notably, the diagonal elements (indicated by green box) represent the fraction of the non-commuting population, those who remain within their own region (LTLA). As each column reflects the distribution of a region’s (LTLA’s) population across all destinations, the sum of each column is equal to one. In S3a Fig, the mobility data for the regions of England reveals that a significant majority of the population does not commute to other regions on a daily basis for work or other purposes. This may be the reason for the lack of observable impact at this broader spatial resolution. In contrast, S3b Fig details a substantial number of commuters among LTLAs within the North East region of England. The data indicates that approximately 15% to 25% of individuals are commuting to other LTLAs, which appears to significantly influence transmission dynamics.

(TIFF)

pcbi.1013642.s003.tif (1.1MB, tif)
S4 Fig. Weekly death data over the nine regions of England.

Vertical lines show the lockdown periods in England.

(TIFF)

pcbi.1013642.s004.tif (799.3KB, tif)
S5 Fig. Model Fitting with the connected and disconnected model for nine regions of England.

The green (red) curve is the fitting with the connected (disconnected) model and 95% credible intervals are shown around the curve. Orange dots are the weekly deaths observed for each region. Dotted vertical lines show the lockdown periods.

(TIFF)

pcbi.1013642.s005.tif (937.4KB, tif)
S6 Fig. Estimated daily infections across nine regions of England from three different sources.

The colored segments illustrate the contributions from each source: orange indicates infections generated by the own infected population within their own region, blue denotes infections acquired locally due to visits of infected individuals from other regions, and purple represents infections contracted outside the region as a result of travel. Vertical lines show the lockdown periods.

(TIFF)

pcbi.1013642.s006.tif (945.8KB, tif)
S7 Fig. Proportion of daily infections for nine regions of.

The proportion of daily infections coming from three different sources and vertical dotted lines represent the lockdown periods.

(TIFF)

pcbi.1013642.s007.tif (744.4KB, tif)
S8 Fig. Absolute residuals between connected and disconnected Rts (shown in Fig 4 in the main text) for 9 regions of England.

Blue dots and red crosses show the mean and median of the absolute differences at each time point. Grey bars show the corresponding 95% credible intervals.

(TIFF)

pcbi.1013642.s008.tif (954.4KB, tif)
S9 Fig. Weekly death data over the twelve LTLAs of North East region.

Vertical lines show the lockdown periods in England.

(TIFF)

pcbi.1013642.s009.tif (1,018.6KB, tif)
S10 Fig. Model Fitting with the connected and disconnected model for twelve LTLA’s of North East region.

The green (red) curve is the fitting with the connected (disconnected) model and 95% credible intervals are shown around the curve. Orange dots are weekly deaths observed for each LTLA. Dotted vertical lines show the lockdown periods.

(TIFF)

pcbi.1013642.s010.tif (1.3MB, tif)
S11 Fig. Absolute residuals between connected and disconnected Rts (shown in Fig 5 in the main text) for 12 LTLAs of North East region of England.

Blue dots and red crosses denote the mean and median of the absolute differences at each time point. Grey bars show the corresponding 95% credible intervals.

(TIFF)

pcbi.1013642.s011.tif (1.6MB, tif)
S12 Fig. Flexibility of the framework in adapting different time-scales.
We consider three regions with populations of 15,00,000, 5,00,000, 10,00,000, respectively. The reproduction numbers are Rt(1)=1.1, Rt(2)=1.7, Rt(3)=1.1, where Rt(i) represents the reproduction number at time t for region i. The mobility matrices are considered as follows:
Cday=[Region1Region 2Region 3Region 10.70.070.21Region 20.250.850.09Region 30.050.080.7]
Cnight=[Region1Region 2Region 3Region 1100Region 2010Region 3001]
where, Cij denotes the fraction of population of region j commute to region i.

In S12a Fig, we consider a daily time-scale, where the mobility matrix is represented by Cday and remains constant throughout the unit time (a full day). In S12b Fig, we consider a finer time scale as half-day intervals. For the first half of the day the mobility matrix is considered as Cday and for the second half of the day the mobility matrix is Cnight to reflect that people typically return to their home regions at night. The generation time distribution is also adjusted accordingly.

Here, Rt(2)>Rt(1),Rt(3) and therefore, in S12b Fig, when population spends half of the day in their home regions, infections increase in region 2 and decrease in region 1 and region 3 in comparison to S12a Fig.

This shows the flexibility of this framework in adapting different time scales.

(TIFF)

pcbi.1013642.s012.tif (439.2KB, tif)
S13 Fig. Effect of time scale on the estimated Rt over the regions of England.

We consider two time scales: “day" (same as connected Rt as in Fig 4 in the main text), which are represented by the green curves in S13a-i Fig, and “day-night" where unit time is half of a day and represented by the orange curves, and disconnected Rts are shown by the red curves. In the “day-night" case, the first half of a day uses the mobility matrix as shown in S3a Fig, and for the second half we consider the identity matrix to account for the fact that people return to their home region at night. No such difference appears in the results based on the time-scale. We quantify this by calculating absolute residuals |Rt,i(day)Rt,i(daynight)| between green and orange curves (see S13a1-i1 Fig). Here, blue dots and purple crosses are the mean and median of the residual distribution at each time point, respectively. The corresponding error bars are represented in light blue color.

(TIFF)

pcbi.1013642.s013.tif (2.3MB, tif)
S14 Fig. Effect of time scale on the estimated Rt for the LTLAs of North East region.

We consider two time scales: “day" (same as connected Rt as in Fig 5 in the main text), which are represented by the green curves in S14a-l Fig, “day-night", where unit time is half of a day and represented by the orange curves, and disconnected Rts are shown by the red curves. In the “day-night" case, the first half of a day uses the mobility matrix as shown in S3b Fig, and for the second half we consider the identity matrix to account for the fact that people return to their home region at night. We quantify this by calculating absolute residuals |Rt,i(day)Rt,i(daynight)| between green and orange curves (see S14a1-l1 Fig). Here, blue dots and purple crosses are the mean and median of the residual distribution at each time point, respectively. The corresponding error bars are represented in light blue color. Therefore, in general results obtained using daily time scale seem robust when comparing them to a potentially more realistic time scale (day-night).

(TIFF)

pcbi.1013642.s014.tif (3.4MB, tif)
S15 Fig. Distribution of Rhat statistics corresponding to the estimated Rts for the 9 regions of England.

Values less than 1.01 indicate MCMC convergence.

(TIFF)

pcbi.1013642.s015.tif (511.2KB, tif)

Acknowledgments

We are thankful to Zhi Ling for helping with computational understanding with STAN. We acknowledge MLGH Network for its role in facilitating scientific exchange that inspired aspects of this research.

Data Availability

All the codes and data related to this study are available at https://github.com/mlgh-sg/Spatial_transmission and https://doi.org/10.5281/zenodo.17077496.

Funding Statement

SM and MR acknowledge support from National Research Foundation, Singapore, under its NRF FELLOWSHIP (NRF-NRFF15 - 2023 - 0010) awarded to SM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Belik V, Geisel T, Brockmann D. Natural human mobility patterns and spatial spread of infectious diseases. Phys Rev X. 2011;1(1). doi: 10.1103/physrevx.1.011001 [DOI] [Google Scholar]
  • 2.Buckee C, Noor A, Sattenspiel L. Thinking clearly about social aspects of infectious disease transmission. Nature. 2021;595(7866):205–13. doi: 10.1038/s41586-021-03694-x [DOI] [PubMed] [Google Scholar]
  • 3.Riley S. Large-scale spatial-transmission models of infectious disease. Science. 2007;316(5829):1298–301. doi: 10.1126/science.1134695 [DOI] [PubMed] [Google Scholar]
  • 4.Balcan D, Colizza V, Gonçalves B, Hu H, Ramasco JJ, Vespignani A. Multiscale mobility networks and the spatial spreading of infectious diseases. Proc Natl Acad Sci U S A. 2009;106(51):21484–9. doi: 10.1073/pnas.0906910106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zhang J, Litvinova M, Liang Y, Wang Y, Wang W, Zhao S, et al. Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science. 2020;368(6498):1481–6. doi: 10.1126/science.abb8001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Klamser PP, Zachariae A, Maier BF, Baranov O, Jongen C, Schlosser F, et al. Inferring country-specific import risk of diseases from the world air transportation network. PLoS Comput Biol. 2024;20(1):e1011775. doi: 10.1371/journal.pcbi.1011775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Russell TW, Wu JT, Clifford S, Edmunds WJ, Kucharski AJ, Jit M, et al. Effect of internationally imported cases on internal spread of COVID-19: a mathematical modelling study. Lancet Public Health. 2021;6(1):e12–20. doi: 10.1016/S2468-2667(20)30263-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Charaudeau S, Pakdaman K, Boëlle P-Y. Commuter mobility and the spread of infectious diseases: application to influenza in France. PLoS One. 2014;9(1):e83002. doi: 10.1371/journal.pone.0083002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Google. COVID-19 community mobility report. 2020. https://www.google.com/covid19/mobility/
  • 10.Xiong C, Hu S, Yang M, Luo W, Zhang L. Mobile device data reveal the dynamics in a positive relationship between human mobility and COVID-19 infections. Proc Natl Acad Sci U S A. 2020;117(44):27087–9. doi: 10.1073/pnas.2010836117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kermack WO. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1927;115(772):700–21.
  • 12.Barabási A-L, Bonabeau E. Scale-free networks. Sci Am. 2003;288(5):60–9. doi: 10.1038/scientificamerican0503-60 [DOI] [PubMed] [Google Scholar]
  • 13.González MC, Hidalgo CA, Barabási A-L. Understanding individual human mobility patterns. Nature. 2008;453(7196):779–82. doi: 10.1038/nature06958 [DOI] [PubMed] [Google Scholar]
  • 14.Bansal S, Grenfell BT, Meyers LA. When individual behaviour matters: homogeneous and network models in epidemiology. J R Soc Interface. 2007;4(16):879–91. doi: 10.1098/rsif.2007.1100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Keeling MJ, Eames KTD. Networks and epidemic models. J R Soc Interface. 2005;2(4):295–307. doi: 10.1098/rsif.2005.0051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Colizza V, Vespignani A. Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: theory and simulations. J Theor Biol. 2008;251(3):450–67. doi: 10.1016/j.jtbi.2007.11.028 [DOI] [PubMed] [Google Scholar]
  • 17.Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;368(6489):395–400. doi: 10.1126/science.aba9757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Tizzoni M, Bajardi P, Poletto C, Ramasco JJ, Balcan D, Gonçalves B, et al. Real-time numerical forecast of global epidemic spreading: case study of 2009 A/H1N1pdm. BMC Med. 2012;10:165. doi: 10.1186/1741-7015-10-165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ball F, Britton T, House T, Isham V, Mollison D, Pellis L, et al. Seven challenges for metapopulation models of epidemics, including households models. Epidemics. 2015;10:63–7. doi: 10.1016/j.epidem.2014.08.001 [DOI] [PubMed] [Google Scholar]
  • 20.Allen LJ. An introduction to stochastic epidemic models. Mathematical epidemiology. Springer. 2008. p. 81–130.
  • 21.DeAngelis DL. Individual-based models and approaches in ecology: populations, communities and ecosystems. CRC Press; 2018.
  • 22.Hu T, Wang S, She B, Zhang M, Huang X, Cui Y, et al. Human mobility data in the COVID-19 pandemic: characteristics, applications, and challenges. International Journal of Digital Earth. 2021;14(9):1126–47. doi: 10.1080/17538947.2021.1952324 [DOI] [Google Scholar]
  • 23.Shi L, Chi G, Liu X, Liu Y. Human mobility patterns in different communities: a mobile phone data-based social network approach. Annals of GIS. 2015;21(1):15–26. doi: 10.1080/19475683.2014.992372 [DOI] [Google Scholar]
  • 24.Barbosa H, Barthelemy M, Ghoshal G, James CR, Lenormand M, Louail T, et al. Human mobility: models and applications. Physics Reports. 2018;734:1–74. doi: 10.1016/j.physrep.2018.01.001 [DOI] [Google Scholar]
  • 25.Simini F, González MC, Maritan A, Barabási A-L. A universal model for mobility and migration patterns. Nature. 2012;484(7392):96–100. doi: 10.1038/nature10856 [DOI] [PubMed] [Google Scholar]
  • 26.Schläpfer M, Dong L, O’Keeffe K, Santi P, Szell M, Salat H, et al. The universal visitation law of human mobility. Nature. 2021;593(7860):522–7. doi: 10.1038/s41586-021-03480-9 [DOI] [PubMed] [Google Scholar]
  • 27.Bhatt S, Ferguson N, Flaxman S, Gandy A, Mishra S, Scott JA. Semi-mechanistic Bayesian modelling of COVID-19 with renewal processes. Journal of the Royal Statistical Society Series A: Statistics in Society. 2023;186(4):601–15. doi: 10.1093/jrsssa/qnad030 [DOI] [Google Scholar]
  • 28.Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–61. doi: 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]
  • 29.Parag KV, Thompson RN, Donnelly CA. Are epidemic growth rates more informative than reproduction numbers?. J R Stat Soc Ser A Stat Soc. 2022;:10.1111/rssa.12867. doi: 10.1111/rssa.12867 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Parag KV, Cori A, Obolski U. When is the R= 1 epidemic threshold meaningful?. medRxiv. 2024;:2024–10.
  • 31.Abbott S, Hellewell J, Thompson RN, Sherratt K, Gibbs HP, Bosse NI, et al. Estimating the time-varying reproduction number of SARS-CoV-2 using national and subnational case counts. Wellcome Open Res. 2020;5:112. doi: 10.12688/wellcomeopenres.16006.1 [DOI] [Google Scholar]
  • 32.Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol. 2013;178(9):1505–12. doi: 10.1093/aje/kwt133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gostic KM, McGough L, Baskerville EB, Abbott S, Joshi K, Tedijanto C, et al. Practical considerations for measuring the effective reproductive number, Rt. PLoS Comput Biol. 2020;16(12):e1008409. doi: 10.1371/journal.pcbi.1008409 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Nash RK, Bhatt S, Cori A, Nouvellet P. Estimating the epidemic reproduction number from temporally aggregated incidence data: a statistical modelling approach and software tool. PLoS Comput Biol. 2023;19(8):e1011439. doi: 10.1371/journal.pcbi.1011439 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ogi-Gittins I, Hart WS, Song J, Nash RK, Polonsky J, Cori A, et al. A simulation-based approach for estimating the time-dependent reproduction number from temporally aggregated disease incidence time series data. Epidemics. 2024;47:100773. doi: 10.1016/j.epidem.2024.100773 [DOI] [PubMed] [Google Scholar]
  • 36.Gorji H, Stauffer N, Lunati I. Emergence of the reproduction matrix in epidemic forecasting. J R Soc Interface. 2024;21(216):20240124. doi: 10.1098/rsif.2024.0124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Green W, Ferguson N, Cori A. Inferring the reproduction number using the renewal equation in heterogeneous epidemics. J R Soc Interface. 2022;19(188):20210429. doi: 10.1098/rsif.2021.0429 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Trevisin C, Bertuzzo E, Pasetto D, Mari L, Miccoli S, Casagrandi R, et al. Spatially explicit effective reproduction numbers from incidence and mobility data. Proc Natl Acad Sci U S A. 2023;120(20):e2219816120. doi: 10.1073/pnas.2219816120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Unwin HJT, Mishra S, Bradley VC, Gandy A, Mellan TA, Coupland H, et al. State-level tracking of COVID-19 in the United States. Nat Commun. 2020;11(1):6189. doi: 10.1038/s41467-020-19652-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Reproduction numbers for simulated data are chosen arbitrarily for three countries (IND, GRB, ITA) from the repository. [cited 2023 Oct 1]. https://github.com/covid-19-Re/dailyRe-Data
  • 41.Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. Lancet Infect Dis. 2020;20(8):911–9. doi: 10.1016/S1473-3099(20)30287-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Stan Development Team. Stan Modeling Language Users Guide and Reference Manual. 2024. https://mc-stan.org/docs/2_36/reference-manual/index.html
  • 43.Census 2011 travel to work matrix, England and Wales. https://www.nomisweb.co.uk/census/2011/wf02ew
  • 44.National population projections: 2020 -based interim. https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationprojections/bulletins/nationalpopulationprojections/2020basedinterim
  • 45.Death registrations and occurrences by local authority and health board. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/causesofdeath/datasets/deathregistrationsandoccurrencesbylocalauthorityandhealthboard
  • 46.Verity R, Okell LC, Dorigatti I, Winskill P, Whittaker C, Imai N, et al. Estimates of the severity of coronavirus disease 2019 : a model-based analysis. Lancet Infect Dis. 2020;20(6):669–77. doi: 10.1016/S1473-3099(20)30243-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Alessandretti L. What human mobility data tell us about COVID-19 spread. Nat Rev Phys. 2022;4(1):12–3. doi: 10.1038/s42254-021-00407-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Nouvellet P, Bhatia S, Cori A, Ainslie KEC, Baguelin M, Bhatt S, et al. Reduction in mobility and COVID-19 transmission. Nat Commun. 2021;12(1):1090. doi: 10.1038/s41467-021-21358-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Gozzi N, Perra N, Vespignani A. Comparative evaluation of behavioral epidemic models using COVID-19 data. Proc Natl Acad Sci U S A. 2025;122(24):e2421993122. doi: 10.1073/pnas.2421993122 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1013642.r001

Decision Letter 0

Denise Kühnert

24 Jun 2025

PCOMPBIOL-D-25-00829

Incorporating human mobility to enhance epidemic response and estimate real-time reproduction numbers

PLOS Computational Biology

Dear Dr. Roy,

Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

​Please submit your revised manuscript within 60 days Aug 24 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter

We look forward to receiving your revised manuscript.

Kind regards,

Nicola Perra

Academic Editor

PLOS Computational Biology

Denise Kühnert

Section Editor

PLOS Computational Biology

Journal Requirements:

1) Please ensure that the CRediT author contributions listed for every co-author are completed accurately and in full.

At this stage, the following Authors/Authors require contributions: Mousumi Roy, Hannah E. Clapham, and Swapnil Mishra. Please ensure that the full contributions of each author are acknowledged in the "Add/Edit/Remove Authors" section of our submission form.

The list of CRediT author contributions may be found here: https://journals.plos.org/ploscompbiol/s/authorship#loc-author-contributions

2) We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type u2018LaTeX Source Fileu2019 and leave your .pdf version as the item type u2018Manuscriptu2019.

3) Please upload all main figures as separate Figure files in .tif or .eps format. For more information about how to convert and format your figure files please see our guidelines: 

https://journals.plos.org/ploscompbiol/s/figures

4) Some material included in your submission may be copyrighted. According to PLOSu2019s copyright policy, authors who use figures or other material (e.g., graphics, clipart, maps) from another author or copyright holder must demonstrate or obtain permission to publish this material under the Creative Commons Attribution 4.0 International (CC BY 4.0) License used by PLOS journals. Please closely review the details of PLOSu2019s copyright requirements here: PLOS Licenses and Copyright. If you need to request permissions from a copyright holder, you may use PLOS's Copyright Content Permission form.

Please respond directly to this email and provide any known details concerning your material's license terms and permissions required for reuse, even if you have not yet obtained copyright permissions or are unsure of your material's copyright compatibility. Once you have responded and addressed all other outstanding technical requirements, you may resubmit your manuscript within Editorial Manager. 

Potential Copyright Issues:

i) Figure 1. Please confirm whether you drew the images / clip-art within the figure panels by hand. If you did not draw the images, please provide (a) a link to the source of the images or icons and their license / terms of use; or (b) written permission from the copyright holder to publish the images or icons under our CC BY 4.0 license. Alternatively, you may replace the images with open source alternatives. See these open source resources you may use to replace images / clip-art:

- https://commons.wikimedia.org

- https://openclipart.org/.

5) We note that your Data Availability Statement is currently as follows: "All relevant data are within the manuscript and its Supporting Information files.". Please confirm at this time whether or not your submission contains all raw data required to replicate the results of your study. Authors must share the “minimal data set” for their submission. PLOS defines the minimal data set to consist of the data required to replicate all study findings reported in the article, as well as related metadata and methods (https://journals.plos.org/plosone/s/data-availability#loc-minimal-data-set-definition). 

For example, authors should submit the following data:  

1) The values behind the means, standard deviations and other measures reported;

2) The values used to build graphs;

3) The points extracted from images for analysis.. 

Authors do not need to submit their entire data set if only a portion of the data was used in the reported study. 

If your submission does not contain these data, please either upload them as Supporting Information files or deposit them to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of recommended repositories, please see https://journals.plos.org/plosone/s/recommended-repositories.  

If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. If data are owned by a third party, please indicate how others may request data access. 

6) Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published.

1) State what role the funders took in the study. If the funders had no role in your study, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

2) If any authors received a salary from any of your funders, please state which authors and which funders.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Authors:

Please note that one of the reviews is uploaded as an attachment.

Reviewer #1: This work presents a new mathematical approach including mobility between regions to the computation of the net reproduction number. It presents results from a simulation study and from applying the methodology to real data from SARS-CoV-2 in England.

Overall, I found the study interesting. The new equation proposed is quite straightforward and easy to understand, and at the same time it aims to tackle the mobility issue in the computation of the transmissibility in a certain area, which I personally did not see before.

Major suggestion:

However, this study doesn’t take into account the fact that people potentially do not spend all day in another region different from the one where their residence is. An option would be to divide the day in segments in which the matrix “m” changes. Perhaps it would be interesting to see how much this affects the results. Also / Alternatively, it would be nice to justify how your model somehow can account for the fact that the presence of some individuals in other regions only lasts a fraction of the day (week).

Please, find below more detailed comments:

- Abstract: “This analysis also investigates the spatial scalability of the framework, indicating that lower spatial resolution can diminish the effect of inter-regional mobility on the disease transmission, and we conclude that utilizing a finer spatial scale is advantageous on the basis of data availability and computational resources to obtain a better picture of the detailed transmission dynamics.” - Isn’t this the usual result? Is it possible to quantify somehow “how much better / worse” results are with a certain increase / decrease of resolution? Or explain it in terms of the significance of the mobility network?

- Introduction, line 6: SARS-CoV-2.

- Line 7: NPI, to be mentioned first the meaning of the acronym: “non-pharmaceutical interventions (NPIs)”.

- Line 18: models… lead…

- Consider adding parenthesis in equation 2 for easier readability.

- I assumed the “time” in the model is “daily” (discrete nature of the sum), but that could be a bit unrealistic w.r.t. the mobility patterns. I understand that each time step could be anything, in theory? Could it be specified? Human mobility does not usually affect the whole day.

- A bit of repetitions in the text of the methods section: e.g. lines 184-185 explaining the meaning of the diagonal elements of the “m” matrix (people who stay in their region) was already implied before (line 143). Repetition again in lines 206-207.

- Personally, section 2.5 is not fully clear to me. You simulated Rt and not the cases in this experiment? And then the cases are generated from the mobility-inclusive renewal equation. Perhaps an extra line in the prior section specifying Rt will be simulated (assumed), and an extra line in this section explaining the choice to simulate Rt would improve the readability and clarity of the work.

- I miss a short explanation of the choices made in section 2.6 (some distributions, mean choices, etc.). Can be in the Appendix and referenced here.

- For example, line 254: “The 95% credible intervals representing the uncertainty, are shown as the shaded bands…” – shouldn’t all these kind of explanations be restricted to the caption of the figure?

- Results in Figure 3, aren’t they to be expected? The epidemic was simulated using the same equations that now predict Rt better (i.e. connected model). Perhaps results could be presented a bit differently: not in terms of “how much better this model fits w.r.t. to not considering mobility” (expected) but in terms of the risks of computing Rt without mobility and therefore misreading the real transmissibility of the epidemic – pointing out at specific scenarios when this happens (over/under estimation).

- Sentence in rows 280-282 is a clear example of the point above.

- Lines 288-296: again, shouldn’t it be just in the caption of the figure? + Some of those panels described in these lines were already mentioned before. As a reader, I already went through those graphics and now it feels like an unnecessary / misplaced / repetitive explanation.

- Section 3.2: slightly repetitive / long to present a single and clear result: no significant differences observed when mobility affects only a small % of the studied population.

- Section 3.3: Personally, I see this section as one of the key results of this work. Perhaps I missed it, but is the mobility matrix different during lockdown periods? I believe it should change when lockdown was in place. From Figures 6 and 7, it seems mobility was not considered (mij = 0 for i!=j) during lockdown? Please specify it in the text how were lockdowns dealt with (full lockdown, reduced mobility, …).

- Lines 352-355: in the caption.

- The discussion in terms of “finer resolutions” (line 369, for example) shouldn’t it be more accurate saying: when > xx% of the population commutes between studied regions. I understand getting a specific percentage as a threshold might not be possible / easy, but perhaps the text would benefit from a discussion in terms of “presence / absence of significant mobility” and not only in terms of “resolution”. It has been better described in the discussion section already. Also, because, if the “scale is too fine”, the underlying assumption of homogenous mixing inside each region would become problematic. Perhaps add this as a limitation in the discussion.

- Slight “double sense” that came to my mind: finer scale in terms of more regions or in terms of smaller regions? I bet at a scale in which mobility is significant. Just try to be clear on that.

- Lines 452-453: “within” alone (no “into”). // In regions with commuting populations, …

- Lines 461-463: add as limitation in the discussion.

Reviewer #2: the review has been uploaded as an attachment

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No: References to key data sources are missing

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. If there are other versions of figure files still present in your submission file inventory at resubmission, please replace them with the PACE-processed versions.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Attachment

Submitted filename: roy_pcb_review.pdf

pcbi.1013642.s016.pdf (102.8KB, pdf)
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1013642.r003

Decision Letter 1

Denise Kühnert

5 Oct 2025

  1. PCOMPBIOL-D-25-00829R1

Incorporating human mobility to enhance epidemic response and estimate real-time reproduction numbers

PLOS Computational Biology

Dear Dr. Roy,

Thank you for submitting your manuscript to PLOS Computational Biology. After careful consideration, we feel that it has merit but does not fully meet PLOS Computational Biology's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript within 30 days Dec 05 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at ploscompbiol@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pcompbiol/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

* A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. This file does not need to include responses to formatting updates and technical items listed in the 'Journal Requirements' section below.

* A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

* An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, competing interests statement, or data availability statement, please make these updates within the submission form at the time of resubmission. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Nicola Perra

Academic Editor

PLOS Computational Biology

Denise Kühnert

Section Editor

PLOS Computational Biology

Journal Requirements:

If the reviewer comments include a recommendation to cite specific previously published works, please review and evaluate these publications to determine whether they are relevant and should be cited. There is no requirement to cite these works unless the editor has indicated otherwise.

1) We have noticed that you have uploaded Supporting Information files, but you have not included a list of legends. Please add a full list of legends for your Supporting Information files after the references list.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Thank you for the detailed responses and the throughout revision of my comments. I’m in general satisfied with the changes made on response to my initial comments, and I would suggest it for publication. Only a couple of additional comments:

Fig R3 shows in some cases non-negligible differences on the estimation of Rt. Although is true that, in general, Rt presents a similar trend by considering as time unit either 1 or ½ a day, differences of up to ~0.7 were observed, which can lead to quite different transmission outcomes. Also, finding somewhat different Rt’s when considering different mobility patterns seems reasonable, and somewhat the aim of your work. Perhaps it would be better to say that in the absence of mobility data at a finer scale than “daily”, results seem still robust when comparing them to a potentially “more realistic scenario” (day-night).

Perhaps I am missing something, but section 2.5 is still not clear to me. How are you exactly estimating Rt? Via an MCMC or ABC or…? I miss it to be clearer in this section how the fitting is done, other than the priors and “jumps”. From what I read in page 9, you are using the software CmdStan for model fitting? Try to be organized and clear on the methodological details.

Reviewer #2: I would like to thank the authors for successfully addressing all of my previous points. Methods are now clearly explained and results are supported by quantitative evidence.

I have a couple of (very) minor suggestions on novel additions to the introduction:

1) Lines 20-22: in more advanced compartmental models we can relax, at least partially, the assumption of homogeneous mixing via strategies such as contact matrices. As such, I think it would be more correct to say

"Compartmental mechanistic models [11], in the absence of a spatial component and more advanced mixing assumptions (e.g., contact matrices) may lead to biased estimation due to their assumption of homogeneous mixing, a condition that rarely reflects the real-world mixing pattern."

2) Similarly for the metapopulation model. It is not necessary to assume homogeneous mixing within subpopulations. As such, that claim can be removed or relaxed. I also think the following should be either removed or changed “...approach, it is widely used to study the 29 mechanism behind an outbreak, rather than to evaluate the real time transmission 30 dynamics”. There are notable examples of metapopulation approaches used to study real-time dynamics of infectious diseases, especially at the beginning of emerging outbreaks when it is crucial to estimate importation routes and probabilities, see for example: https://www.science.org/doi/10.1126/science.aba9757, https://pmc.ncbi.nlm.nih.gov/articles/PMC3585792/

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

Figure resubmission:

While revising your submission, we strongly recommend that you use PLOS’s NAAS tool (https://ngplosjournals.pagemajik.ai/artanalysis) to test your figure files. NAAS can convert your figure files to the TIFF file type and meet basic requirements (such as print size, resolution), or provide you with a report on issues that do not meet our requirements and that NAAS cannot fix.

After uploading your figures to PLOS’s NAAS tool - https://ngplosjournals.pagemajik.ai/artanalysis, NAAS will process the files provided and display the results in the "Uploaded Files" section of the page as the processing is complete. If the uploaded figures meet our requirements (or NAAS is able to fix the files to meet our requirements), the figure will be marked as "fixed" above. If NAAS is unable to fix the files, a red "failed" label will appear above. When NAAS has confirmed that the figure files meet our requirements, please download the file via the download option, and include these NAAS processed figure files when submitting your revised manuscript.

Reproducibility:

To enhance the reproducibility of your results, we recommend that authors of applicable studies deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1013642.r005

Decision Letter 2

Denise Kühnert

21 Oct 2025

Dear Dr Roy,

We are pleased to inform you that your manuscript 'Incorporating human mobility to enhance epidemic response and estimate real-time reproduction numbers' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Nicola Perra

Academic Editor

PLOS Computational Biology

Denise Kühnert

Section Editor

PLOS Computational Biology

***********************************************************

I am happy to accept the manuscript. The new version of the paper without blue highlights has some formatting issues (references are missing) that should be solved in the next steps

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1013642.r006

Acceptance letter

Denise Kühnert

PCOMPBIOL-D-25-00829R2

Incorporating human mobility to enhance epidemic response and estimate real-time reproduction numbers

Dear Dr Roy,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

For Research, Software, and Methods articles, you will receive an invoice from PLOS for your publication fee after your manuscript has reached the completed accept phase. If you receive an email requesting payment before acceptance or for any other service, this may be a phishing scheme. Learn how to identify phishing emails and protect your accounts at https://explore.plos.org/phishing.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Judit Kozma

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Impact of mobility in a three-region system with identical epidemic conditions.

    Each region has an equal population of 1,000,000, the mobility matrices and Rt values are specified in the figure. From left to right, we consider the scenarios with decreasing outgoing mobility and increasing non-commuting population. In the rightmost panel of S1 Fig the regions are disconnected (no inter-regional travel), so the mobility matrix is an identity matrix (all off-diagonal elements are zero). S1d-f Fig shows daily infections by region. Since the epidemic conditions are homogeneous, the incidence trajectories are identical across regions even when outflows differ. However, mobility redistributes the location of infection acquisition. S1g-o Fig shows that, although daily case counts are equal for the three regions, infections occur in different destination regions depending on the mobility pattern.

    (TIFF)

    pcbi.1013642.s001.tif (1.8MB, tif)
    S2 Fig. Impact of mobility in a three-region system with heterogeneous epidemic conditions.

    Each region has an equal population of 1,000,000, the mobility matrices and Rt values are specified in the figure. From left to right, we consider the scenarios with decreasing outgoing mobility and increasing non-commuting population. In the rightmost panel of S2 Fig the regions are disconnected (no inter-regional travel), so the mobility matrix is an identity matrix (all off-diagonal elements are zero). S2d–f Fig shows daily infections by region. As the epidemic conditions are heterogeneous over regions, the incidence trajectories are different based on the mobility and local epidemic condition. For example, region 2 is with a controlled situation with Rt<1, however, in S2d–e Fig, there is an outbreak because of the mobility-driven importation, and in the absence of any mobile population it fails to generate any outbreak (See S2f Fig). S2g–o Fig shows the different sources of infections occur in different destination regions depending on the mobility pattern. These simulations underscore that both human mobility and regional epidemic status jointly shape disease transmission dynamics. Mobility has a significant impact in heterogeneous epidemic scenarios, where differences in transmission intensity across regions can lead to outbreaks in low-risk areas through population movement. This is particularly relevant in real-world settings, where such heterogeneity is common during the early stages of an epidemic. As the epidemic becomes widespread and more uniform across regions, local disparities can appear due to the differences in policy implementation, healthcare capacity, demographic characteristics, or socio-economic conditions.

    (TIFF)

    S3 Fig. Mobility matrix.

    In this figure, the mobility matrices illustrate the commuting patterns across the regions of England (see S3a Fig) and Lower Tier Local Authorities (LTLAs) within the North East region (see S3b Fig). Each column in the matrix represents the fraction of a region’s (or LTLA’s) population that commutes to other regions (LTLAs), signifying the outflow from the corresponding region (LTLA) indicated in the figure. On the other hand, the rows depict the inflow to any given region (LTLA) from all other regions (LTLAs). Notably, the diagonal elements (indicated by green box) represent the fraction of the non-commuting population, those who remain within their own region (LTLA). As each column reflects the distribution of a region’s (LTLA’s) population across all destinations, the sum of each column is equal to one. In S3a Fig, the mobility data for the regions of England reveals that a significant majority of the population does not commute to other regions on a daily basis for work or other purposes. This may be the reason for the lack of observable impact at this broader spatial resolution. In contrast, S3b Fig details a substantial number of commuters among LTLAs within the North East region of England. The data indicates that approximately 15% to 25% of individuals are commuting to other LTLAs, which appears to significantly influence transmission dynamics.

    (TIFF)

    pcbi.1013642.s003.tif (1.1MB, tif)
    S4 Fig. Weekly death data over the nine regions of England.

    Vertical lines show the lockdown periods in England.

    (TIFF)

    pcbi.1013642.s004.tif (799.3KB, tif)
    S5 Fig. Model Fitting with the connected and disconnected model for nine regions of England.

    The green (red) curve is the fitting with the connected (disconnected) model and 95% credible intervals are shown around the curve. Orange dots are the weekly deaths observed for each region. Dotted vertical lines show the lockdown periods.

    (TIFF)

    pcbi.1013642.s005.tif (937.4KB, tif)
    S6 Fig. Estimated daily infections across nine regions of England from three different sources.

    The colored segments illustrate the contributions from each source: orange indicates infections generated by the own infected population within their own region, blue denotes infections acquired locally due to visits of infected individuals from other regions, and purple represents infections contracted outside the region as a result of travel. Vertical lines show the lockdown periods.

    (TIFF)

    pcbi.1013642.s006.tif (945.8KB, tif)
    S7 Fig. Proportion of daily infections for nine regions of.

    The proportion of daily infections coming from three different sources and vertical dotted lines represent the lockdown periods.

    (TIFF)

    pcbi.1013642.s007.tif (744.4KB, tif)
    S8 Fig. Absolute residuals between connected and disconnected Rts (shown in Fig 4 in the main text) for 9 regions of England.

    Blue dots and red crosses show the mean and median of the absolute differences at each time point. Grey bars show the corresponding 95% credible intervals.

    (TIFF)

    pcbi.1013642.s008.tif (954.4KB, tif)
    S9 Fig. Weekly death data over the twelve LTLAs of North East region.

    Vertical lines show the lockdown periods in England.

    (TIFF)

    pcbi.1013642.s009.tif (1,018.6KB, tif)
    S10 Fig. Model Fitting with the connected and disconnected model for twelve LTLA’s of North East region.

    The green (red) curve is the fitting with the connected (disconnected) model and 95% credible intervals are shown around the curve. Orange dots are weekly deaths observed for each LTLA. Dotted vertical lines show the lockdown periods.

    (TIFF)

    pcbi.1013642.s010.tif (1.3MB, tif)
    S11 Fig. Absolute residuals between connected and disconnected Rts (shown in Fig 5 in the main text) for 12 LTLAs of North East region of England.

    Blue dots and red crosses denote the mean and median of the absolute differences at each time point. Grey bars show the corresponding 95% credible intervals.

    (TIFF)

    pcbi.1013642.s011.tif (1.6MB, tif)
    S12 Fig. Flexibility of the framework in adapting different time-scales.
    We consider three regions with populations of 15,00,000, 5,00,000, 10,00,000, respectively. The reproduction numbers are Rt(1)=1.1, Rt(2)=1.7, Rt(3)=1.1, where Rt(i) represents the reproduction number at time t for region i. The mobility matrices are considered as follows:
    Cday=[Region1Region 2Region 3Region 10.70.070.21Region 20.250.850.09Region 30.050.080.7]
    Cnight=[Region1Region 2Region 3Region 1100Region 2010Region 3001]
    where, Cij denotes the fraction of population of region j commute to region i.

    In S12a Fig, we consider a daily time-scale, where the mobility matrix is represented by Cday and remains constant throughout the unit time (a full day). In S12b Fig, we consider a finer time scale as half-day intervals. For the first half of the day the mobility matrix is considered as Cday and for the second half of the day the mobility matrix is Cnight to reflect that people typically return to their home regions at night. The generation time distribution is also adjusted accordingly.

    Here, Rt(2)>Rt(1),Rt(3) and therefore, in S12b Fig, when population spends half of the day in their home regions, infections increase in region 2 and decrease in region 1 and region 3 in comparison to S12a Fig.

    This shows the flexibility of this framework in adapting different time scales.

    (TIFF)

    pcbi.1013642.s012.tif (439.2KB, tif)
    S13 Fig. Effect of time scale on the estimated Rt over the regions of England.

    We consider two time scales: “day" (same as connected Rt as in Fig 4 in the main text), which are represented by the green curves in S13a-i Fig, and “day-night" where unit time is half of a day and represented by the orange curves, and disconnected Rts are shown by the red curves. In the “day-night" case, the first half of a day uses the mobility matrix as shown in S3a Fig, and for the second half we consider the identity matrix to account for the fact that people return to their home region at night. No such difference appears in the results based on the time-scale. We quantify this by calculating absolute residuals |Rt,i(day)Rt,i(daynight)| between green and orange curves (see S13a1-i1 Fig). Here, blue dots and purple crosses are the mean and median of the residual distribution at each time point, respectively. The corresponding error bars are represented in light blue color.

    (TIFF)

    pcbi.1013642.s013.tif (2.3MB, tif)
    S14 Fig. Effect of time scale on the estimated Rt for the LTLAs of North East region.

    We consider two time scales: “day" (same as connected Rt as in Fig 5 in the main text), which are represented by the green curves in S14a-l Fig, “day-night", where unit time is half of a day and represented by the orange curves, and disconnected Rts are shown by the red curves. In the “day-night" case, the first half of a day uses the mobility matrix as shown in S3b Fig, and for the second half we consider the identity matrix to account for the fact that people return to their home region at night. We quantify this by calculating absolute residuals |Rt,i(day)Rt,i(daynight)| between green and orange curves (see S14a1-l1 Fig). Here, blue dots and purple crosses are the mean and median of the residual distribution at each time point, respectively. The corresponding error bars are represented in light blue color. Therefore, in general results obtained using daily time scale seem robust when comparing them to a potentially more realistic time scale (day-night).

    (TIFF)

    pcbi.1013642.s014.tif (3.4MB, tif)
    S15 Fig. Distribution of Rhat statistics corresponding to the estimated Rts for the 9 regions of England.

    Values less than 1.01 indicate MCMC convergence.

    (TIFF)

    pcbi.1013642.s015.tif (511.2KB, tif)
    Attachment

    Submitted filename: roy_pcb_review.pdf

    pcbi.1013642.s016.pdf (102.8KB, pdf)
    Attachment

    Submitted filename: Response to Reviewers.pdf

    pcbi.1013642.s017.pdf (4.1MB, pdf)
    Attachment

    Submitted filename: Response to reviewer2.pdf

    pcbi.1013642.s018.pdf (212.3KB, pdf)

    Data Availability Statement

    All the codes and data related to this study are available at https://github.com/mlgh-sg/Spatial_transmission and https://doi.org/10.5281/zenodo.17077496.


    Articles from PLOS Computational Biology are provided here courtesy of PLOS

    RESOURCES