Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Oct 8;113:47–54. doi: 10.1016/j.ijid.2021.10.007

Predicting the effective reproduction number of COVID-19: inference using human mobility, temperature, and risk awareness

Sung-mok Jung 1,2, Akira Endo 3, Andrei R Akhmetzhanov 4, Hiroshi Nishiura 1,
PMCID: PMC8498007  PMID: 34628020

Abstract

Objectives

The effective reproduction number (Rt) has been critical for assessing the effectiveness of countermeasures during the coronavirus disease 2019 (COVID-19) pandemic. Conventional methods using reported incidences are unable to provide timely Rt data due to the delay from infection to reporting. Our study aimed to develop a framework for predicting Rt in real time, using timely accessible data — i.e. human mobility, temperature, and risk awareness.

Methods

A linear regression model to predict Rt was designed and embedded in the renewal process. Four prefectures of Japan with high incidences in the first wave were selected for model fitting and validation. Predictive performance was assessed by comparing the observed and predicted incidences using cross-validation, and by testing on a separate dataset in two other prefectures with distinct geographical settings from the four studied prefectures.

Results

The predicted mean values of Rt and 95% uncertainty intervals followed the overall trends for incidence, while predictive performance was diminished when Rt changed abruptly, potentially due to superspreading events or when stringent countermeasures were implemented.

Conclusions

The described model can potentially be used for monitoring the transmission dynamics of COVID-19 ahead of the formal estimates, subject to delay, providing essential information for timely planning and assessment of countermeasures.

Keywords: COVID-19, effective reproduction number, regression model, mobility, temperature, Japan

1. Introduction

The first confirmed case of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection was reported in Japan on January 15, 2020; since then, the transmission of coronavirus disease 2019 (COVID-19) has continuously affected the entire country. As the incidence of COVID-19 began to surge, a state of emergency was declared by the national government on April 16, requesting the voluntary reduction of physical contact, which likely helped to suppress the epidemic (Jung et al., 2021). However, resuming socioeconomic activities in late May led to a resurgence of cases. Although a temporal decline in incidence was observed, the country experienced the third wave from late October 2020. In response, prefectural governments with a large number of cases requested bars and restaurants to curtail their operational hours from late November (Ministry of Health, Labour and Welfare, 2020a). Despite these measures, the spread of disease continued, leading the national government to declare a second state of emergency on January 7, 2021, asking citizens to refrain from non-essential outings (Cabinet Relations Office, 2021).

As part of evaluation of countermeasures, the effective reproduction number (Rt), defined as the expected number of secondary cases arising from a single primary case at calendar time t, is widely used for monitoring trends in community transmission (Nishiura et al., 2009). However, timely and accurate estimation of Rt using incidence data remains challenging. First, to precisely link the timing of a control measure and resulting changes in the transmission trend, it is vital to estimate Rt as a function of the infection time (Gostic et al., 2020). Since infection times are rarely observed in practice, they need to be estimated while accounting for empirical delay distributions. Moreover, the estimation of Rt is further complicated because of right truncation with respect to the time interval from infection to reporting. In real-time practice, the number of recent infections is underestimated due to cases that are already infected but not yet reported. Considering the empirically observed time delay of around 9 days from infection to reporting in Japan, the estimated Rt from reported incidences is likely to be biased, at least within 9 days from the latest reporting date.

Considering that SARS-CoV-2 transmission is facilitated by human-to-human contact, digital proxies of human mobility patterns can provide an important route to inferring the directly unobservable transmission patterns as a function of time. Indeed, various datasets of mobility patterns have started to become widely available during the ongoing COVID-19 pandemic, and have been used to monitor the time-dependent patterns of physical distancing (Buckee et al., 2020; Kishore et al., 2020; Leung et al., 2021; Nouvellet et al., 2021; Xiong et al., 2020). Moreover, published studies have reported that temperature is inversely associated with COVID-19 transmission (Li et al., 2020; Pequeno et al., 2020; Qi et al., 2020; Smith et al., 2021; Ujiie et al., 2020; Wang et al., 2021). These data are often more readily accessible than case counts (which are typically subject to delays of ∼ 9 days), and thus may enable near real-time assessment of interventions if they can be used to predict Rt. Furthermore, quantified risk awareness can also help to predict more accurate Rt values, because induced adherence to personal protective behaviors (e.g. wearing a mask or washing hands) can reduce virus transmissions (West et al., 2020).

Accumulated evidence suggests that integrating human mobility with temperature and risk awareness reflects contact patterns as a function of time. Thus, the integrative model could provide an opportunity for the timely prediction of Rt during the ongoing COVID-19 pandemic. Our study aimed to develop a simple statistical framework to predict values of Rt, using key driving factors of COVID-19 transmission that can be used before a formal estimate relying on reported case counts is available.

2. Methods

2.1. Empirical datasets in Japan

2.1.1. Epidemiological data

The information on confirmed cases of COVID-19 in six prefectures in Japan — Tokyo, Osaka, Aichi, Hokkaido, Fukuoka, and Okinawa — was retrieved from a publicly available data source, from January 16, 2020 through February 15, 2021 (Ministry of Health, Labour and Welfare, 2020b). The data for the six prefectures were categorized into two types: ‘training data’ and ‘test data’. The training data were used for the model training and cross-validation, while the test data were from regions not included in the training data, and were used to evaluate the predictive performance of the trained model in different geographical settings. To ensure that a sufficient number of cases was included in the training data, four prefectures (Tokyo, Osaka, Aichi, and Hokkaido), accounting for more than 200 cases before the first state of emergency, were selected for the training data. The test data comprised those for the other two prefectures (Fukuoka and Okinawa), which are geographically distinct from those in the training data. According to the history of travelling abroad within 7 days prior to the illness onset, all cases were classified as either imported or domestic. In total, 217 258 confirmed COVID-19 cases, including 166 imported cases, were included in the study. Within the training data, 32 130 and 150 706 cases were assigned to the period for model training (‘training period’) and cross-validation (‘validation period’), respectively, while 22 379 cases were included in the test data.

2.1.2. Reconstruction of COVID-19 incidence according to infection time

Of the 217 258 total cases, were 114 277 (53%) were confirmed COVID-19 cases with unknown illness onset dates. Thus, to estimate Rt as a function of infection time, a back-projection was conducted in two steps. First, the missing illness onset dates for these cases were back-projected from the date of laboratory confirmation, with a right-truncated time interval distribution from illness onset to laboratory confirmation, in each prefecture (Jung et al., 2021). Second, infection times were back-projected for all reported COVID-19 cases, from either the observed or back-projected illness onset date, using the incubation period distribution (Linton et al., 2021). The R package ‘Surveillance’ (Höhle, 2007) was used for nonparametric back-projection.

2.1.3. Explanatory variable data

Human mobility patterns and temperatures were hypothesized as driving factors for COVID-19 transmission, and so those datasets for the abovementioned time period were collected. Google community mobility reports (hereafter Google mobility) were used to capture the human mobility patterns (Google, 2021). Google mobility data provide six different categories (‘retail and recreation’, ‘grocery and pharmacy’, ‘parks’, ‘transit stations’, ‘workplaces’, and ‘residential’) of changes in human mobility, relative to the average for each category on the same day of the week in the pre-pandemic period (i.e. January 3 to February 6, 2020). Mobility patterns relating to ‘retail and recreation’ were used in our analyses, based on our domain knowledge that this category likely represents mobility in the close-contact settings associated with COVID-19 transmission (Cazelles et al., 2021). Daily temperature data were retrieved from the Japan Meteorological Agency (Japan Meteorological Agency, 2021) by selecting one representative observatory close to the location of the government office for each prefecture. To extract the overall trend for these time-series, without being influenced by daily noise, both datasets were smoothed by taking a 7-day moving average.

2.2. Model and statistical analysis

A prediction model for Rt, integrating human mobility, temperature, and risk awareness, was designed and evaluated using the following steps. First, three candidate regression models were fitted to the reconstructed COVID-19 incidences during the training period via the renewal process. The performances of these candidate models were then compared by cross-validation against the estimated Rt from data in the validation period, and the best model was selected. Last, the predictive performance of the best-ranked model was evaluated using the separate test data, by comparing estimated Rt values with the predicted values produced by the trained model to determine the applicability of the model to other geographic settings.

2.2.1. Model for predicting the effective reproduction number

Three simple regression models that incorporated different combinations of explanatory factors for COVID-19 transmission were proposed: only Google mobility (Model 1), Google mobility and temperature (Model 2), and Google mobility, temperature, and risk awareness over COVID-19 (Model 3). In each model, corresponding variables were included in a log-linear regression form. The Rt in the model with all three factors (Model 3) was formulated as follows:

Rt,i=R0,iexp(ami(t)+bki(t)+cmin(wlimit,i,wi(t))), (1)

where R0,i is the baseline effective reproduction number in prefecture i. R0,i was assumed to be similar for all four prefectures of the training data group, following a gamma distribution with a mean R¯ and fixed coefficient of variation (CV) of 0.5 (Park et al., 2020). The value of CV was fixed due to the small number of prefectures, but a varied CV in the range 0.25–1 produced similar results. Covariates mi(t) and ki(t) are smoothed values of Google mobility and temperature at calendar time t and in prefecture i, respectively. wi(t) is the degree of risk awareness at calendar time t and in prefecture i; this was graded by assuming that it was linearly associated with the smoothed number of newly reported COVID-19 cases, following a positive association between confirmed cases and risk perception found in a UK study, using longitudinal data (Schneider et al., 2021). The effect of this variable wi(t) was capped at a predefined upper limit (wlimit,i), corresponding to the government definition of ‘highest alert level’ incidence in Japan (i.e. 25 confirmed cases per 100 000 population in a week) (Cabinet Relations Office, 2020). According to the definition, a daily number of cases giving an upper limit for each prefecture i (wlimit,i) was specified as 497 in Tokyo, 315 in Osaka, 270 in Aichi, 188 in Hokkaido, 182 in Fukuoka, and 52 in Okinawa. The Rt values in Models 1 and 2 were specified for models whose coefficients for non-included variables in Equation (1) were fixed at 0.

A likelihood function was constructed for the proposed regression models of Rt (Models 1–3), based on the renewal process, and the corresponding parameters of each regression model were estimated by fitting them to the COVID-19 incidence during the training period. The expected number of daily reported domestic cases cdomestic,i(t) at calendar time t in a given prefecture i was calculated using the equation:

E(cdomestic,i(t))=Rt,iτ=1t1ctotal,i(tτ)g(τ)Fi(Tt)Fi(Tt+τ), (2)

where ctotal,i(t) denotes the total (imported + domestic) daily number of COVID-19 cases at time t reported in prefecture i. g(·) is the probability mass function (PMF) of generation time (Nishiura et al., 2020). To account for the right truncation, the cumulative mass function, Fi(·), for the time delay from infection to report was calculated by convoluting the PMF of the incubation period and of the time interval from illness onset to reporting in prefecture i (see Section 2 - 1, 2). The training period used covered the first pandemic wave in Japan (March 15 to May 1, 2020) and part of the second wave (July 15 to August 31, 2020). The intra-wave period from May 1 to 15 July 15, 2020 was excluded because of the low count numbers observed during that period. For parameter estimation, a Poisson likelihood was used:

L(θ;cdomestic,i(t))=t=1Texp(E(cdomestic,i(t)))(E(cdomestic,i(t)))cdomestic,i(t)cdomestic,i(t)!, (3)

where the set θ includes all or some of the parameters (i.e., R0,i, R¯, a, b, and c) specific for each of the three proposed models. The maximum likelihood method was employed, with 95% confidence intervals (CIs) for each parameter derived from 10 000 samples from a Laplace approximation normal distribution.

2.2.2. Model selection through cross-validation

To select the best model among those proposed, a cross-validation was conducted by comparing the predicted and estimated ‘ground-truth’ Rt during the validation period (from September 1, 2020 to January 31, 2021) in the four prefectures. Each regression model produced predictive Rt values for the validation period using the explanatory variables based on the estimated parameters in Method 2-2-1. The Rt values estimated from incidence data via the renewal process (Equation (2)) were used as the ground-truth. Rt was estimated as a free time-dependent parameter from the COVID-19 incidences during the validation period using the following likelihood function:

L(Rt,i;cdomestic,i(t))=t=1Texp(E(cdomestic,i(t)))(E(cdomestic,i(t)))cdomestic,i(t)cdomestic,i(t)!. (4)

The 95% CIs were derived using the profile likelihood method. Considering the right-truncation in the recent reported incidences, the estimated Rt for the latest 15 days (February 1–15, 2020) were excluded from the cross-validation. Furthermore, to smooth out abrupt fluctuations (e.g. superspreading events) in the estimated Rt values, a 7-day moving average was taken. Next, a predictive performance of each model (i.e. a comparison between the predicted and estimated Rt during the validation period) was quantitatively assessed using four different measurements: bias, root-mean-square error (RMSE), ranked probability score, and Dawid-Sebastiani score (Funk et al., 2019), and the best model was selected. In addition, the number of predicted infections using the conditional forecasting method (i.e. forecasting a future incidence based on the predicted Rt and empirically reported incidences in the past) was also compared against the back-projected incidence by infection dates.

2.2.3. Evaluation of the predictive performance using the test data

Lastly, a predictive performance of the finally selected model was evaluated using the test data, indicating a potential applicability of the proposed model to other geographical settings. Accordingly, the Rt values in two test data prefectures (Fukuoka and Okinawa) were predicted from July 15, 2020 through January 31, 2021, relying only on the trained model, and were compared against the estimated Rt from the renewal process. In addition, the predicted number of infections using the conditional forecasting method were compared with the empirical data in each of the two prefectures.

3. Results

Figure 1 shows the epidemic curve of COVID-19 by reporting date in the four selected training data prefectures, along with the time trend of Google mobility and temperature. Substantial reductions in Google-based mobility were observed in all regions during the first and second state of emergency, consistent with only small numbers of reported cases by the end of the declaration. The human mobility patterns tended to show abrupt increases on consecutive national holidays and, accordingly, the number of reported cases increased slightly after roughly 9 days, consistent with the empirical time delay from infection to reporting in Japan.

Figure 1.

Figure 1

Epidemiological curves for COVID-19 by date of report, along with trends of Google mobility and temperature in Tokyo, Osaka, Aichi, and Hokkaido, Japan.

The number of COVID-19 cases by date of report (yellow bars) from March 15, 2020 to January 12, 2021 in the four training data prefectures of Japan: (A) Tokyo, (B) Osaka, (C) Aichi, and (D) Hokkaido. Red lines indicate trends of changes in human mobility patterns (the ‘retail and recreation’ category in Google community mobility reports) compared with those in the pre-pandemic period (January 3 to February 6, 2020). Blue lines show trends for the daily temperature in each region; both mobility and temperature were smoothed using 7-day moving averages. In each figure, two red-shaded areas indicate the time periods for the first and second nationwide states of emergency, respectively, while purple-shaded areas show the time periods when stringent restrictions on bars and restaurants were implemented by each prefectural government right before the second state of emergency.

Table 1 shows the estimated parameters from the data in the training period and summarizes the predictive performance of proposed models during the validation period. Among the three models, the model that incorporated Google mobility, temperature, and risk awareness (Model 3) was selected as the best model, based on the Dawid-Sebastiani score, while the model that accounted for only Google mobility (Model 1) showed the best performance with regard to RMSE, bias, and ranked probability score. The predicted time trend of Rt using Model 1 (an almost linear trend close to the value of 1; Figure S1) was less informative compared with Model 3, indicating that Model 1 would be not suitable for time trend analysis. Thus, Model 3 was selected as the best model. Using Model 3, the baseline value R0,i was estimated at 4.40 (95% CI 4.07–4.73) for Tokyo, 2.85 (95% CI 2.62–3.07) for Osaka, 2.20 (95% CI 2.02–2.38) for Aichi, and 1.99 (95% CI: 1.84–2.15) for Hokkaido. The regression coefficient accounting for Google mobility was estimated to be 0.03 per percentage point (95% CI 0.03–0.03), while temperature and risk awareness were negatively associated with estimated coefficients of −0.02 per degree Celsius (95% CI −0.02 to −0.01) and −0.12 per 100 reported cases (95% CI −0.15 to −0.10), respectively. All coefficients were statistically significant.

Table 1.

Summary of the estimated parameters and predictive performances of the proposed models

Model Parameter Value Bias RMSE Ranked probability score Dawid-Sebastiani score
Model 1 R0 (Tokyo) 1.74 (1.66–1.82) 0.28 1.65 168.94 479 953
R0 (Osaka) 1.37 (1.32–1.41)
R0 (Aichi) 1.15 (1.11–1.19)
R0 (Hokkaido) 1.33 (1.25–1.41)
Google mobility 0.02 (0.01–0.02)
Model 2 R0 (Tokyo) 4.39 (4.05–4.72) -1.06 2.29 264.41 328 849
R0 (Osaka) 3.23 (3.02–3.44)
R0 (Aichi) 2.59 (2.43–2.75)
R0 (Hokkaido) 2.09 (1.95–2.23)
Google mobility 0.03 (0.03–0.03)
Temperature −0.03 (−0.03 to −0.03)
Model 3 R0 (Tokyo) 4.40 (4.07–4.73) 0.55 1.77 171.60 147 027
R0 (Osaka) 2.85 (2.62–3.08)
R0 (Aichi) 2.20 (2.02–2.38)
R0 (Hokkaido) 1.99 (1.84–2.15)
Google mobility 0.03 (0.03–0.03)
Temperature −0.02 (−0.02 to −0.01)
Risk awareness −0.12 (−0.15 to −0.10)

Model 1: Google mobility; Model 2: Google mobility and temperature; Model 3: Google mobility, temperature, and risk awareness; RMSE: root-mean-square error

Each value describes the estimated parameters of three different models and their 95% confidence intervals derived by maximum likelihood estimation and Laplace approximation normal distribution. Predictive performances of each model were compared using three measures (root-mean-square error, ranked probability score, and Dawid-Sebastiani score) and by summing estimates of four regions in Japan (prefectures for the training data — Osaka, Tokyo, Aichi, and Hokkaido).

Figure 2 shows the comparison of estimated and predicted Rt for the four training data prefectures, using the best-ranked Model 3. Although the model was fitted to cases over a relatively short period of time, the predicted values and 95% CIs captured the temporal patterns of the estimated Rt in all the prefectures well, showing clear signs of Rt rising above 1 at the beginning of the second and third waves of the epidemic. However, with the implementation of stringent countermeasures (e.g. requesting reduced opening hours for night-life spots) from November 2020, the predicted Rt showed larger deviations from the estimated values, suggesting a diminished predictive performance. The predicted Rt values using Model 1 (Figure S1) and Model 2 (Figure S2) were less consistent with the estimated Rt.

Figure 2.

Figure 2

Comparison of estimated and predicted effective reproduction numbers for COVID-19, using the model accounting for Google mobility, temperature, and risk awareness.

Comparison between the estimated and predicted effective reproduction numbers (Rt) for COVID-19. The model (Model 3; considering Google mobility, temperature, and risk awareness of COVID-19) was applied to the four training data prefectures of Japan: (A) Tokyo, (B) Osaka, (C) Aichi, and (D) Hokkaido. Yellow bars represent reconstructed numbers of COVID-19 cases by infection time; grey bars show the number of predicted incidences using the conditional forecasting method. Blue lines and shaded areas indicate the estimated Rt values and their 95% confidence intervals using the renewal process and profile likelihood, while purple lines and shaded areas indicate the predicted Rt values and their 95% confidence intervals derived from the Laplace approximation normal distribution. Red shaded areas indicate the training period (i.e. March 15 to May 1, 2020 and July 15 to August 31, 2020) used for estimating the parameters of the model.

Figure 3 shows the test performance of the best model (Model 3), by comparing the estimated and predicted Rt in Fukuoka and Okinawa (test data). With the default value of R0,i offset to 2.50, the inferred values of Rt were consistent with the overall trend of the ground-truth Rt values, indicating the applicability of the proposed model to other geographical settings.

Figure 3.

Figure 3

Comparison of estimated and predicted effective reproduction numbers for COVID-19, using the best model and two other geographic settings (test data) in Japan

(A–B) The number of COVID-19 cases by date of report (yellow bars) from March 15, 2020 to January 12, 2021 in two of the test data prefectures: (A) Fukuoka and (B) Okinawa. Red lines indicate trends of changes in human mobility patterns (the ‘retail and recreation’ category in Google community mobility reports) and the green lines show trends in daily temperatures for each region. Both mobility and temperature were smoothed using 7-day moving averages. (C–D) Comparison between the estimated and predicted effective reproduction number (Rt) for COVID-19, using the best model (Model 3; considering Google mobility, temperature, and risk awareness) and the test data: (C) Fukuoka and (D) Okinawa. Yellow bars represent reconstructed numbers of COVID-19 cases by infection time; grey bars show the number of predicted incidences using the conditional forecasting method. Blue lines and shaded areas indicate the estimated Rt values and their 95% confidence intervals, using the renewal process and profile likelihood, while purple lines and shaded areas are the predicted Rt values and their 95% confidence intervals from the best model and its estimated parameters.

4. Discussion

Our study proposed a simple regression model for predicting the real-time Rt of COVID-19, accounting for human mobility, temperature, and risk awareness. Our analysis suggested that the human mobility pattern was positively associated with COVID-19 transmission, while temperature and risk awareness were negatively associated.

These findings indicate that the reduction in socioeconomic activities and increased level of risk awareness may be linked to a reduction in transmission, highlighting the potential of social distancing interventions and risk communication for controlling the COVID-19 epidemic (Anderson et al., 2020; Heydari et al., 2021). The inverse association between temperature and COVID-19 transmission was also in line with other published papers (Li et al., 2020; Ma et al., 2021; Smith et al., 2021). This finding could be explained by two possible mechanisms. First, cold temperature induces behavioral changes and increases indoor contact, which is associated with the transmission risk of SARS-CoV-2 (McClymont and Hu, 2021). Second, the virus enjoys greater survivability in cold environments (Riddell et al., 2020), as has been the case for other human coronaviruses (Chan et al., 2011; van Doremalen et al., 2013). Although the cumulative number of reported cases during the summer season was higher than that of the winter season in 2020 (Figure 1), this observation should be approached with a caution, considering the mixture of multiple factors, including the increase in physical contact after the lifting of the initial population-wide restrictions). Indeed, the majority of infections in the summer season were reported to be associated with night/town activities (Ministry of Health, Labour and Welfare, 2020c).

Values of Rt predicted by the simple linear regression model fitted to the very limited data points aligned well with the estimates empirically obtained from the time series of COVID-19 incidence, showing a clear emerging signal (Rt> 1) for the second and third waves in Japan. This performance of the proposed model suggests that our framework can provide a plausible proxy of the latest Rt for COVID-19, using readily accessible data, which conventional methods relying on reported incidences are not able to generate in a timely manner, due to the inherent delays. Timely assessment of Rt is essential to inform public health policies, for example those aiming to bring the epidemic under control before the hospital and intensive care unit occupancies reach full capacity. Although abrupt changes in Rt values — presumably induced by temporary local surges of cases (e.g. clusters in hospitals and nursing homes) — could not be fully captured by the proposed model, it was still able to provide a timely signal of changes in Rt before the formal estimates become available.

Despite the overall good performance of the proposed model, our framework over- or underestimated Rt when stringent interventions (e.g. reduced opening hours for restaurants and bars from November 2020) were in place. Although the inclusion of mobility patterns associated with retail and recreation, to represent the physical mixing in high-risk settings, was considered a reasonable choice, such data — with limited temporal and spatial resolution — may not fully reflect the detailed social contact patterns.

It has been suggested that the transmission of COVID-19 involves substantial individual variations, characterized by a highly dispersed offspring distribution (Endo et al., 2020). Thus, stringent control measures were imposed, primarily on settings regarded as high risk (e.g. nightclubs, bars, and restaurants). The resulting changes in detailed contact patterns in those places may not have been fully reflected by the simple summary data for human mobility. Moreover, the digital proxies for human mobility patterns were suggested to be not very informative regarding changes in densities of individuals within high-risk places; this metric may play a crucial role in SARS-CoV-2 transmission (Chang et al., 2021). These limitations of the mobility data may account for the temporary deviations in the prediction.

Our study had some additional limitations. First, the relationship between the number of COVID-19 cases and the degree of risk awareness may change in the long run. Indeed, a decrease in adherence to non-pharmaceutical interventions was reported in the USA from April to November, 2020 (Crane et al., 2021), in spite of the continuously increasing number of reported COVID-19 cases. Second, the upper limit for the effect of risk awareness was rather arbitrarily chosen, and not necessarily theoretically justified. It was assumed that risk awareness affects the transmission risk via personal behavioral changes that are not reflected in the changes in mobility (e.g. wearing a mask or avoiding crowded places during outside visits). It is likely that there is a certain limit to the risk reduction achieved by such behavioral changes, which we incorporated in our model as a prespecified cap. Third, with the roll-out of COVID-19 vaccines, the proposed model might become insufficient for predicting Rt, due to the herd immunity effect conferred by vaccination. Lastly, more accurate prediction may require an extended model that accounts for age-stratified transmission dynamics (e.g. age-specific susceptibility), along with age-specific mobility patterns.

5. Conclusion

Our study suggests that human mobility, temperature, and risk awareness can be integrated into the renewal process to provide timely predictions of the effective reproduction number during the ongoing COVID-19 transmission — i.e. ahead of the formal empirical estimates, which are subject to delays. This would generate essential information for timely planning and assessment of epidemic control measures.

Funding

This research was mainly supported by the Environment Research and Technology Development Fund (JPMEERF20S11804) of the Environmental Restoration and Conservation Agency of Japan. S-m.J received funding from the Japan Society for the Promotion of Science (JSPS) KAKENHI (20J2135800). A.E. is financially supported by the Nakajima Foundation. H.N. received funding from Health and Labor Sciences Research Grants (19HA1003, 20CA2024, and 20HA2007); the Japan Agency for Medical Research and Development (AMED; JP19fk0108104, JP20fk0108140 and JP20fk0108535s0101); JSPS KAKENHI (17H04701 and 21H03198); the GAP fund program of Kyoto University; the Japan Science and Technology Agency (JST) CREST program (JPMJCR1413); and the SICORP program (JPMJSC20U3 and JPMJSC2105). This study was also supported by German Federal Ministry of Health (BMG) COVID-19 research and development funding to the World Health Organization. The funders were not involved in the collection, analysis, or interpretation of the data, the writing of the manuscript, or the decision to submit the work for publication.

Ethical approval and consent to participate

Our study analyzed publicly available data (Ministry of Health, Labour and Welfare, 2021). As such, the datasets used in our study were de-identified and fully anonymized in advance, and the analysis of publicly available data without identity information did not require ethical approval.

Availability of data and materials

The time-series data on COVID-19 were publicly available on the websites of the Ministry of Health, Labour and Welfare (Ministry of Health, Labour and Welfare, 2021). The relevant code for this research work is stored in the GitHub: https://github.com/SungmokJung/Prediction_Rt_COIVD19.

Consent for publication

Not applicable

Conflicts of interest

A.E. received research funding from Taisho Pharmaceutical Co., Ltd.

Declaration of interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

The authors would like to acknowledge the local health centers and the Infectious Disease Surveillance sections of Tokyo Metropolitan Government, Osaka Prefecture, Aichi Prefecture, Hokkaido Prefecture, Fukuoka Prefecture, and Okinawa Prefecture for their work in making time-series data publicly available in real time. The authors also thank Edanz Group (https://en-author-services.edanzgroup.com/ac) for editing a draft of this manuscript.

Footnotes

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.ijid.2021.10.007.

Appendix. Supplementary materials

mmc1.docx (12.9MB, docx)

References

  1. Anderson RM, Heesterbeek H, Klinkenberg D, Hollingsworth TD. How will country-based mitigation measures influence the course of the COVID-19 epidemic? Lancet. 2020;395:931–934. doi: 10.1016/S0140-6736(20)30567-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Buckee CO, Balsari S, Chan J, Crosas M, Dominici F, Gasser U, et al. Aggregated mobility data could help fight COVID-19. Science. 2020;368:145–146. doi: 10.1126/science.abb8021. [DOI] [PubMed] [Google Scholar]
  3. Cabinet Relations Office. Ongoing topics 2021. https://japan.kantei.go.jp/ongoingtopics/_00038.html (accessed February 20, 2021).
  4. Cabinet Relations Office. Advisory committee for COVID-19 2020. https://www.cas.go.jp/jp/seisaku/ful/bunkakai/kongo_soutei_taisaku.pdf (accessed September 19, 2021).
  5. Cazelles B, Comiskey C, Nguyen-Van-Yen B, Champagne C, Roche B. Parallel trends in the transmission of SARS-CoV-2 and retail/recreation and public transport mobility during non-lockdown periods. Int J Infect Dis. 2021;104:693–695. doi: 10.1016/j.ijid.2021.01.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chan KH, Peiris JSM, Lam SY, Poon LLM, Yuen KY, Seto WH. The effects of temperature and relative humidity on the viability of the SARS coronavirus. Adv Virol. 2011 doi: 10.1155/2011/734690. 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chang S, Pierson E, Koh PW, Gerardin J, Redbird B, Grusky D, et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature. 2021;589:82–87. doi: 10.1038/s41586-020-2923-3. [DOI] [PubMed] [Google Scholar]
  8. Crane MA, Shermock KM, Omer SB, Romley JA. Change in reported adherence to nonpharmaceutical interventions during the COVID-19 pandemic, April-November 2020. JAMA. 2021;325:883–885. doi: 10.1001/jama.2021.0286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. van Doremalen N, Bushmaker T, Munster VJ. Stability of Middle East respiratory syndrome coronavirus (MERS-CoV) under different environmental conditions. Euro Surveill. 2013;18 doi: 10.2807/1560-7917.es2013.18.38.20590. [DOI] [PubMed] [Google Scholar]
  10. Endo A, Abbott S, Kucharski AJ, Funk S. Estimating the overdispersion in COVID-19 transmission using outbreak sizes outside China. Wellcome Open Res. 2020;5:67. doi: 10.12688/wellcomeopenres.15842.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Funk S, Camacho A, Kucharski AJ, Lowe R, Eggo RM, Edmunds WJ. Assessing the performance of real-time epidemic forecasts: a case study of Ebola in the Western Area region of Sierra Leone, 2014–15. PLOS Comput Biol. 2019;15 doi: 10.1371/journal.pcbi.1006785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Google. COVID-19 Google community mobility reports 2021. https://www.google.com/covid19/mobility/ (accessed April 20, 2021).
  13. Gostic KM, McGough L, Baskerville EB, Abbott S, Joshi K, Tedijanto C, et al. Practical considerations for measuring the effective reproductive number, Rt. PLOS Comput Biol. 2020;16 doi: 10.1371/journal.pcbi.1008409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Heydari ST, Zarei L, Sadati AK, Moradi N, Akbari M, Mehralian G, et al. The effect of risk communication on preventive and protective behaviours during the COVID-19 outbreak: mediating role of risk perception. BMC Public Health. 2021;21:54. doi: 10.1186/s12889-020-10125-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Höhle M. Surveillance: an R package for the monitoring of infectious diseases. Comput Stat. 2007;22:571–582. doi: 10.1007/s00180-007-0074-8. [DOI] [Google Scholar]
  16. Japan Meteorological Agency. Climatological data 2021. http://www.data.jma.go.jp/gmd/risk/obsdl/index.php (accessed February 20, 2021).
  17. Jung S, Endo A, Kinoshita R, Nishiura H. Projecting a second wave of COVID-19 in Japan with variable interventions in high-risk settings. R Soc Open Sci. 2021;8 doi: 10.1098/rsos.202169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Kishore N, Kiang M V, Engø-Monsen K, Vembar N, Schroeder A, Balsari S, et al. Measuring mobility to monitor travel and physical distancing interventions: a common framework for mobile phone data analysis. Lancet Digit Heal. 2020;2:e622–e628. doi: 10.1016/S2589-7500(20)30193-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Leung K, Wu JT, Leung GM. Real-time tracking and prediction of COVID-19 infection using digital proxies of population mobility and mixing. Nat Commun. 2021;12:1501. doi: 10.1038/s41467-021-21776-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Li H, Xu X-L, Dai D-W, Huang Z-Y, Ma Z, Guan Y-J. Air pollution and temperature are associated with increased COVID-19 incidence: a time series study. Int J Infect Dis. 2020;97:278–282. doi: 10.1016/j.ijid.2020.05.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Linton NM, Akhmetzhanov AR, Nishiura H. Correlation between times to SARS-CoV-2 symptom onset and secondary transmission undermines epidemic control efforts. MedRxiv. 2021 doi: 10.1101/2021.08.29.21262512. 2021.08.29.21262512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ma Y, Pei S, Shaman J, Dubrow R, Chen K. Role of meteorological factors in the transmission of SARS-CoV-2 in the United States. Nat Commun. 2021;12:3602. doi: 10.1038/s41467-021-23866-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. McClymont H, Hu W. Weather variability and COVID-19 transmission: a review of recent research. Int J Environ Res Public Health. 2021;18:396. doi: 10.3390/ijerph18020396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Ministry of Health Labour and Welfare. Latest information. 2021 https://www.mhlw.go.jp/stf/houdou/houdou_list_202102.html (accessed April 16, 2021) [Google Scholar]
  25. Ministry of Health, Labour and Welfare. Requests for reducing operation hours from November 2020 2020a. https://www.mhlw.go.jp/stf/newpage_16945.html.
  26. Ministry of Health, Labour and Welfare. Updates on COVID-19 in Japan 2020b. https://www.mhlw.go.jp/content/10900000/000620718.pdf (accessed March 25, 2021).
  27. Ministry of Health, Labour and Welfare. Evaluation report for the latest COVID-19 infections 2020c. https://www.mhlw.go.jp/content/10900000/000649236.pdf (accessed May 11, 2021).
  28. Nishiura H, Klinkenberg D, Roberts M, Heesterbeek JAP. Early epidemiological assessment of the virulence of emerging infectious diseases: a case study of an influenza pandemic. PLoS One. 2009;4 doi: 10.1371/journal.pone.0006852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Nishiura H, Linton NM, Akhmetzhanov AR. Serial interval of novel coronavirus (COVID-19) infections. Int J Infect Dis. 2020 doi: 10.1016/j.ijid.2020.02.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Nouvellet P, Bhatia S, Cori A, Ainslie KEC, Baguelin M, Bhatt S, et al. Reduction in mobility and COVID-19 transmission. Nat Commun. 2021;12:1090. doi: 10.1038/s41467-021-21358-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Park SW, Bolker BM, Champredon D, Earn DJD, Li M, Weitz JS, et al. Reconciling early-outbreak estimates of the basic reproductive number and its uncertainty: framework and applications to the novel coronavirus (SARS-CoV-2) outbreak. J R Soc Interface. 2020;17 doi: 10.1098/rsif.2020.0144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Pequeno P, Mendel B, Rosa C, Bosholn M, Souza JL, Baccaro F, et al. Air transportation, population density and temperature predict the spread of COVID-19 in Brazil. PeerJ. 2020;8:e9322. doi: 10.7717/peerj.9322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Qi H, Xiao S, Shi R, Ward MP, Chen Y, Tu W, et al. COVID-19 transmission in mainland China is associated with temperature and humidity: a time-series analysis. Sci Total Environ. 2020;728 doi: 10.1016/j.scitotenv.2020.138778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Riddell S, Goldie S, Hill A, Eagles D, Drew TW. The effect of temperature on persistence of SARS-CoV-2 on common surfaces. Virol J. 2020;17:145. doi: 10.1186/s12985-020-01418-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Schneider CR, Dryhurst S, Kerr J, Freeman ALJ, Recchia G, Spiegelhalter D, et al. COVID-19 risk perception: a longitudinal analysis of its predictors and associations with health protective behaviours in the United Kingdom. J Risk Res. 2021;24:294–313. [Google Scholar]
  36. Smith TP, Flaxman S, Gallinat AS, Kinosian SP, Stemkovski M, Unwin HJT, et al. Temperature and population density influence SARS-CoV-2 transmission in the absence of nonpharmaceutical interventions. Proc Natl Acad Sci. 2021;118 doi: 10.1073/pnas.2019284118. e2019284118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Ujiie M, Tsuzuki S, Ohmagari N. Effect of temperature on the infectivity of COVID-19. Int J Infect Dis. 2020;95:301–303. doi: 10.1016/j.ijid.2020.04.068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Wang J, Tang K, Feng K, Lin X, Lv W, Chen K, et al. Impact of temperature and relative humidity on the transmission of COVID-19: a modelling study in China and the United States. BMJ Open. 2021;11 doi: 10.1136/bmjopen-2020-043863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. West R, Michie S, Rubin GJ, Amlôt R. Applying principles of behaviour change to reduce SARS-CoV-2 transmission. Nat Hum Behav. 2020;4:451–459. doi: 10.1038/s41562-020-0887-9. [DOI] [PubMed] [Google Scholar]
  40. Xiong C, Hu S, Yang M, Luo W, Zhang L. Mobile device data reveal the dynamics in a positive relationship between human mobility and COVID-19 infections. Proc Natl Acad Sci. 2020;117 doi: 10.1073/pnas.2010836117. 27087 LP –27089. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (12.9MB, docx)

Data Availability Statement

The time-series data on COVID-19 were publicly available on the websites of the Ministry of Health, Labour and Welfare (Ministry of Health, Labour and Welfare, 2021). The relevant code for this research work is stored in the GitHub: https://github.com/SungmokJung/Prediction_Rt_COIVD19.


Articles from International Journal of Infectious Diseases are provided here courtesy of Elsevier

RESOURCES