Abstract
Widespread growth signatures in COVID‐19 confirmed case counts are reported, with sharp transitions between three distinct dynamical regimes (exponential, superlinear, and sublinear). Through analytical and numerical analysis, a novel framework is developed that exploits information in these signatures. An approach well known to physics is applied, where one looks for common dynamical features, independently from differences in other factors. These features and associated scaling laws are used as a powerful tool to pinpoint regions where analytical derivations are effective, get an insight into qualitative changes of the disease progression, and infer the key infection parameters. The developed framework for joint analytical and numerical analysis of empirically observed COVID‐19 growth patterns can lead to a fundamental understanding of infection progression under strong control measures, applicable to outbursts of both COVID‐19 and other infectious diseases.
Keywords: dynamical growth patterns, infections disease modeling, physics and society, scaling of epidemics growth
Widespread growth signatures are reported for COVID‐19 confirmed case counts, with sharp transitions between distinct dynamical regimes. These signatures provide important quantitative information for understanding the disease spread and inferring key infection progression parameters, which can lead to a more fundamental understanding of infection progression under strong control measures.
COVID‐19 pandemic introduced unprecedented worldwide social distancing measures.[ 1 ] While interventions such as quarantine or vaccination have been extensively studied in quantitative epidemiology, effects of social distancing are not well understood,[ 2 , 3 , 4 ] and when addressed, they have been studied only numerically. Unique opportunity to understand these effects has been provided by COVID‐19 tracing through confirmed case counts, active cases and fatalities, in a variety of countries with different demographic and environmental conditions.[ 5 , 6 ] We here show that focusing on analytical and numerical derivations in distinct epidemics growth regimes, is a novel and effective approach in revealing infection progression mechanisms that may be a valuable alternative to detailed numerical simulations.
We start by introducing our COVID‐19 dynamics model. We then extract COVID‐19 count data[ 7 ] and select those countries that systematically trace not only confirmed cases and fatalities, but also active cases (Andorra, Austria, Czechia, Croatia, Cuba, Germany, Israel, New Zealand, Switzerland and Turkey), which allows tight constraint of numerical analysis. We observe three characteristic growth regimes in confirmed case counts, show that our model is well constrained by these regimes for a wide range of countries, and provide an intuitive explanation behind the emergence of such regimes. Our analytical results for the characteristic (inflection and maximum) points of the infective curve will allow to i) explain the nearly constant value of the scaling exponent in the superlinear regime of confirmed counts; ii) understand the relation between the duration of this regime and strength of social distancing; iii) pinpoint changes in the reproduction number from outburst to extinguishing the infection, and iv) constrain the main parameter quantifying the effect of social distancing by analyzing scaling of the infection growth with time in the sublinear regime. The obtained constraints provide a basis for successful analysis of countries that did not continuously track the active cases (here demonstrated for France, Italy, Spain, United Kingdom, and Serbia). We will finally present the key infection parameters inferred through combined analytical and numerical analysis.
We develop a mechanistic model (nonlinear and nonhomogeneous), which takes into account gradual introduction of social distancing (as relevant for most countries’ response), in addition to other important infection progression mechanisms. We start from standard compartments for epidemiological models, that is, susceptible (), exposed (), infective (), and recovered ().[ 2 , 3 , 4 ] To account for social distancing and observable quantities, we introduce additional compartments: protected ()—where individuals effectively move from susceptible category due to social distancing; total number of diagnosed (confirmed and consequently quarantined) cases (), active cases (), and fatalities (). , , and correspond to directly observable (measured) quantities, but are indirect observables of , as only part of infective individuals gets diagnosed, due to a large number of mild/asymptomatic cases.[ 8 ]
We implement the model deterministically, as COVID‐19 count numbers are very high wherever reasonable testing capacities are employed. This makes model analytically tractable, and allows robust parameter inference through combination of analytically derived expressions and tightly constrained numerical analysis, as we show below. Our analysis is applied separately to each country, as the effect of social distancing, initial numbers of infected and exposed cases, diagnosis/detection efficiency and transmission rates may be different. However, within a given country, we do not take into account different heterogeneities−demographic, spatial, population activity, or seasonality effects.[ 2 , 9 , 10 ] Alternatively, global dynamical properties of the outbreak can be analyzed in a probabilistic framework employing partial differential equations in an age‐structured model.[ 11 , 12 ] These can readily be included in our model, but would lead to model structure which is not analytically tractable, so these extensions are left for future work.
Given this, the model equations are:
(1) |
(2) |
(3) |
where is the total population number; ‐the transmission rate; ‐inverse of the latency period; ‐inverse of the infectious period; ‐inverse of the detection/diagnosis period; ‐detection efficiency; —the recovery rate; ‐the mortality rate. Social distancing is included through Equation (1) (second equation), which represents the rate at which the population moves (on average) from susceptible to protected category. The term corresponds to a sigmoidal dependence (similar to Fermi–Dirac function, in quantitative biology known as the Hill function[ 13 ]). Time determines the half‐saturation, so that well before the social distancing is negligible, while well after the rate of transition to the protected category approaches . Parameter (the Hill constant) determines how rapidly the social distancing is introduced, that is, large leads to rapid transition from OFF to ON state, and vice versa.[ 13 ] Equation (3) considers that only a fraction of the infected is diagnosed, so that takes into account the diagnosis and the subsequent quarantine process.
To make the problem analytically tractable, we approximate the Hill function in the first relation of Equation (1) by unit step function, so that after the second term in Equation (1) becomes and dominates over the first term, that is, . We checked that this approximation agrees well with full‐fledged numerical simulations (Figure 1D and Supporting Information). In all comparisons with analytical results, numerical analysis is done with the full model, allowing an independent check of both analytical derivations and employed approximations. Under this assumption, Equations (1) and (2) reduce to:
(4) |
We next introduce two time regions: I) and II) and solve Equations (4) separately within these regions, where corresponding solutions are denoted as and . As in the above expressions always appear together, we further denote .
For , we take , and restrict to dominant (positive) Jacobian eigenvalue, leading to the exponential regime:
(5) |
By shifting , is determined by
(6) |
Equation (6) is highly nontrivial, due to variable coefficient (). By substituting variable it can be shown that Equation (6) reduces to transformed form of Bessel differential equation:[ 14 ]
(7) |
whose general solution for noninteger is given by:
(8) |
where represents Bessel function of the first kind, and are arbitrary constants. In our case , , while is indeed noninteger. If we return to variable, taking into account the following relation between standard and modified () Bessel functions of the first kind:[ 15 , 16 ] , the general solution of Equation (6) reads:
(9) |
To determine , , we use the following boundary conditions: and , where the first derivative in region II has the following expression:
(10) |
In obtaining the expression above, the following identities were frequently used:15, 16 ]
(11) |
After derivations, where the following relation[ 16 ]
(12) |
together with and the identity relating modified Bessel function of the first and second kind are used,[ 15 , 16 ] we finally obtain a surprisingly simple result:
(13) |
where is the modified Bessel function of the second kind.
At maximum and inflection points, and , respectively. After extensive simplification of the results, this leads to (, where is the basic reproduction number in the absence of social distancing:[ 6 , 17 ])
(14) |
(15) |
Equations (14) and (15) have to be solved numerically, but, as and are constants, we, interestingly, obtain that solutions will depend only on . Since, for the analysis of superlinear and sublinear regimes, only the left inflection point and the maximum are important, we will further omit the second solution of Equation (15) (Equation (14) has one solution), and denote , (these two solutions are presented as upper and lower curves on Figure 2C, respectively), so that the effective reproduction numbers at inflection and maximum points ( and ) are:
(16) |
From this follows the length of superlinear regime (between inflection and maximum points):
(17) |
We further Taylor expand around the inflection point:
(18) |
In the superlinear regime , where is the scaling exponent and marks the beginning of this regime. By Taylor expanding around , using Equations (18) and (3):
(19) |
which is always larger than 1, as expected for the superlinear regime. As is localized toward the beginning of the regime, we estimate , where .
Finally, to provide analytical constrain on , we Taylor expand around the maximum:
(20) |
As , we see that the quadratic term in Equation (20) is always negative, that is, curve enters sublinear regime around maximum of the infection. By fitting to in this regime, and by using Equation (20) together with Equation (3), we obtain:
(21) |
which allows to directly constrain .
We first numerically analyze outburst dynamics in the countries that continuously updated[ 18 ] three observable categories (, , and ). For a large majority of countries active cases were either not tracked or were not continuously updated, so the analysis is done for ten countries listed in the outline above.
In the exponential regime, the analytical closed‐form solution is given by Equation (5). From this, and the initial slope of curve (once the number of counts are out of the stochastic regime), can be directly determined, while the corresponding eigenvector sets the ratio of to . The intercept of the initial exponential growth of at sets the product of and . and can also be readily constrained, as from Equation (3), they depend only on integrals of the corresponding counts; here note that . Also,[ 17 , 19 , 20 ] day and day, characterize fundamental infectious process, which we assume not to change between different countries.
Only parameters related with the intervention measures () are left to be inferred numerically, leading to tightly constrained numerical results. For this, we individually performed joint fit to all three observable quantities () for each country. The errors are estimated through Monte‐Carlo[ 21 , 22 ] simulations, assuming that count numbers follow Poisson distribution.
Representative numerical results are shown in Figure 1 for Germany, while other countries are shown in the Supporting Information. In Figure 1A–C (and Supporting Information) we see a good agreement of our numerical analysis with all three classes of the case counts. In Figure 1D, we see sharp transitions between the three growth patterns indicated in the figure: i) exponential growth, observed as a straight line in log–linear plot in Figure 1E; ii) superlinear growth, a straight line in log–log plot in Figure 1F; iii) sublinear growth, a straight line in linear–log plot in Figure 1G.
Transition between the growth patterns can be qualitatively understood from Equation (3), and curve in Figure 1D. The exponential growth has to break after the inflection point of , that is, once the maximum of its first derivative ( in Figure 1D) is reached. In the superlinear regime, confirmed counts case () curve is convex (), so this regime breaks once (dashed blue curve) becomes negative. Equivalently, curve becomes concave (enters sublinear regime) once the maximum of the is reached. Note that the growth of can reemerge if the social distancing measures are alleviated. Our model can account for this by allowing transition from protected back to susceptible category, which is out of the scope of this study, but may improve the agreement with the data at later times (see Figure 1A–C). In addition to this numerical/intuitive understanding, we also showed that we analytically reproduce the emergence of these growth regimes (Equations (6), (14), (15)). Can we also analytically derive the parameters that characterize these regimes?
The exponential regime is straightforward to explain, as described above. The superlinear regime is in between the left inflection point and the maximum of , so that infective numbers grow, but with a decreasing rate. While the derivations are straightforward in the exponential regime, they are highly non‐trivial during the subsequent subexponential (superlinear and sublinear) growth. As the superlinear regime spans the region between the left inflection point () and the maximum (), its duration is given by Equation (17), with dependence, so that weak measures lead to protracted superlinear growth (see Figure 2A). This tendency is also confirmed by independent numerical analysis in Figure 2A, where for each individual country we numerically infer and extract the length of the superlinear regime. Therefore, the duration of the superlinear regime indicates the effectiveness of introduced social distancing.
The scaling exponent of the superlinear regime is given by Equation (19), and shown in Figure 2B, where we predict that all countries are roughly in the same range of (surprisingly, weakly dependent on ), despite significant differences in the applied measures, demographic and environmental factors. This result is (independently from our model) confirmed from case count numbers (the slope in Figure 1F, and equivalently for other countries, see Figure 2B).
How the effective reproduction number changes during this regime, that is, between the left inflection point and the maximum of ? quantifies the average number of secondary cases per infectious case, so that signifies disease outburst, while for the disease starts to be eliminated from the population.[ 17 ] The Equation (16) provides expressions for (at the inflection point) and (at the maximum). Interestingly, from Figure 2C, we observe that and do not depend on and are, respectively, significantly larger and smaller than 1, which shows that transition from infection outburst to extinguishing happens during the superlinear growth. Consequently, the steepness of change over the superlinear regime significantly increases (larger change over smaller time interval, see Figure 2C) with the measure strength.
Finally, in the sublinear regime, in a wide vicinity of maximum (which marks the beginning of the sublinear growth) leading non‐linear term of is cubic (, with negative prefactor). This is consistent with the expansion of around , which has leading negative quadratic () dependence (see Equations (3) and (20)). The ratio between the prefactors in expansion is given by Equation (21), from which we see that can be directly constrained, as shown in Figure 2D. For the ten countries with consistent tracking of , and , we independently numerically determined and compared it with analytical results coming from Equation (21), obtaining an excellent agreement between our derivations and numerical results. The obtained values should be understood as an effective epidemic containment measure—that is, estimating the true result of the introduced measures, which can be used to evaluate the practical effectiveness of the official policies.
To demonstrate how constraining can aid numerical analysis in the cases when is not continuously tracked, we next analyze five additional countries listed in the outline above, so that altogether our study covers majority of COVID‐19 hotspots, which (at the time of this analysis) are close to saturation in confirmed counts. Furthermore, in the specific cases of UK and Italy, where we analytically obtained both very low and very constrained (), we chose five times larger parameter span in in the numerical analysis, to confirm that these low values are indeed preferred by the exhaustive numerical search. For example, the finally obtained for Italy ( and UK (), together with previously obtained agreements shown in Figure 2A–C, strongly confirm that the observed growth patterns provide invaluable information for successful analysis of the infection progression data.
To further illustrate this, the synergy of analytical derivations and numerical analysis presented above enables us to, directly from the publicly available data, infer key infection parameters necessary to assess epidemics risks (provided in Table S1, Supporting Information). We estimate these parameters by the same model/analysis, for a number of diverse countries, allowing their direct comparison. In Figure 3 , we show together case fatality rate (CFR), infected fatality rate (IFR) and infection attack rate (AR).[ 17 , 24 ] CFR is the number of fatalities per confirmed cases. CFR can, in principle, be inferred directly from the data, but since different countries are in different phases of infection, we project forward the number of confirmed cases until a saturation is reached for each country, from which we calculate CFR. IFR (crucial parameter for assessing the risks for infection progression under different scenarios) is the number of fatalities per total number of infected cases, which is a genuine model estimate, due to the unknown total number of infected cases. AR (necessary for understanding the virus recurrence risk) is also determined from our model and provides an estimate of the fraction of the total population that got infected and possibly resistant.
From Figure 3, we see that CFR takes very different values for different countries, from below (New Zealand) to above (France). On the other hand, IFR is consistent with a constant value (the dashed red line in the figure) of . In distinction to IFR, AR also takes diverse values for different countries, ranging from to as high as (though with large errorbars). Although diverse, these AR values are well bellow the classical herd immunity threshold of 60–70%
To summarize, we here developed a novel quantitative framework through which we showed that: i) The emergence of three distinct growth regimes in COVID‐19 case counts can be reproduced both analytically and numerically. ii) Typically, a brief superlinear regime is characterized by a sharp transition from outburst to extinguishing the infection, where effective reproduction number changes from much larger to much smaller than one; more effective measures lead to shorter superlinear growth, and to a steeper change of the effective reproduction number. iii) Scaling exponent of the superlinear regime is surprisingly uniform for countries with diverse environmental and demographic factors and epidemics containment policies; this highly non‐trivial empirical result is well reproduced by our model. iv) Scaling prefactors in the sublinear regime contain crucial information for analytically constraining infection progression parameters, so that they can be straightforwardly extracted through numerical analysis. Interestingly, we found that the number of COVID‐19 fatalities per total number of infected is highly uniform across diverse analyzed countries, in distinction to other (highly variable) infection parameters, and about twice higher than commonly quoted for influenza (0.3–0.4% compared to 0.1–0.2%), which may be valuable for direct assessment of the epidemics risks.
While state‐of‐the‐art approach in epidemiological modeling uses computationally highly demanding numerical simulations, the results above demonstrate a shift of paradigm toward simpler, but analytically tractable models, that can both explain common dynamical features of the system and be used for straightforward and highly constrained parameter inference. This shift is based on a novel framework that relates universal growth patterns with characteristic points of the infective curve, followed by analytical derivations in the vicinity of these points, in an approach akin to those in a number of physics problems. The framework presented here can be, in principle, further extended toward, for example, including stochastic effects or different heterogeneities such as age‐structure. However, these are non‐trivial tasks, and it remains to be seen to what extent the analytical results can be obtained in those more complex models. Overall, as our approach does not depend on any COVID‐19 specifics, the developed framework can also be readily applied to potential outbursts of future infections.
Conflict of Interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
This work was supported by the Ministry of Education, Science and Technological Development of the Republic of Serbia.
Djordjevic M., Djordjevic M., Ilic B., Stojku S., Salom I., Understanding Infection Progression under Strong Control Measures through Universal COVID‐19 Growth Signatures. Global Challenges 2021, 5, 2000101. 10.1002/gch2.202000101
Contributor Information
Magdalena Djordjevic, Email: magda@ipb.ac.rs.
Marko Djordjevic, Email: dmarko@bio.bg.ac.rs.
Data Availability Statement
The data used in this study are openly available in Worldometer at https://www.worldometers.info/coronavirus/, reference number [7]. Parameters inferred through the analysis are available in the supplementary material of this article.
References
- 1.WHO report. https://www.who.int/emergencies/diseases/novel‐coronavirus‐2019/situation.
- 2. Diekmann O., Heesterbeek H., Britton T., Mathematical Tools for Understanding Infectious Disease Dynamics, Princeton University Press, Princeton, NJ: 2012. [Google Scholar]
- 3. Martcheva M., An Introduction to Mathematical Epidemiology, Springer, Berlin: 2015. [Google Scholar]
- 4. Keeling M. J., Rohani P., Modeling Infectious Diseases in Humans and Animals, Princeton University Press, Princeton, NJ: 2011. [Google Scholar]
- 5. Tian H., Liu Y., Li Y., Wu C. H., Chen B., Kraemer U. G., Li B., Cai J., Xu B., Yang Q., Wang B., Yang P., Cui Y., Song Y., Zheng P., Wang Q., Bjornstad O. N., Yang R., Grenfell B. T., Pybus O. G., Dye C., Science 2020, 368, 638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Chowell G., Sattenspiel L., Bansal S., Viboud C., Physics of Life Reviews 2016, 18, 66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Worldometer 2020. COVID‐19 Coronavirus Pandemic. https://www.worldometers.info/coronavirus/ (accessed: June 2020).
- 8. Day M., BMJ: British Medical Journal 2020, 368, m1165. [DOI] [PubMed] [Google Scholar]
- 9. Wong G. N., Weiner Z. J., Tkachenko A. V., Elbanna A., Maslov S., Goldenfeld N., arXiv:200602036, 2020.
- 10. Dormand J. R., Prince P. J., J. Comput. Appl. Math. 1980, 6, 19. [Google Scholar]
- 11. Vilar J. M., Saiz L.. medRxiv 2020.11.26.20239434, 2020.
- 12. Grassly N. C., Fraser C., Nat. Rev. Microbiol. 2008, 6, 477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Phillips R., Kondev J., Theriot J., Garcia H., Physical Biology of the Cell, Garland Science, New York, NY: 2012. [Google Scholar]
- 14. Bowman F., Introduction to Bessel Functions, Dover Publications, New York, NY: 1958. [Google Scholar]
- 15. Zwillinger D., Standard Mathematical Tables and Formulae, CRC Press, Boca Raton, FL: 1995. [Google Scholar]
- 16. Abramowitz M., Stegun T. A., Handbook of Mathematical Functions (Ed: Abramowitz M.), Dover Publications, New York, NY: 1972. [Google Scholar]
- 17. Bar‐On Y. M., Flamholz A. I., Phillips R., Milo R., eLife 2020, 9, e57309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Dong E., Du H., Gardner L., Lancet Infect. Dis. 2020, 20, 533.32087114 [Google Scholar]
- 19. Li R., Pei S., Chen B., Song Y., Zhang T., Yang W., Shaman J., Science 2020, 368, 489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. He X., Lau E. H. Y., Wu P., Deng X., Wang J., Hao X., Lau Y. C., Wong J. Y., Guan Y., Tan X., Mo X., Chen Y., Liao B., Chen W., Hu F., Zhang Q., Zhong M., Wu Y., Zhao L., Zhang F., Cowling B. J., Li F., Leung G. M., Nat. Med. 2020, 26, 672. [DOI] [PubMed] [Google Scholar]
- 21. Press W. H., Flannery B. P., Teukolsky S. A., Vetterling W. T., Numerical Recipes: The Art of Scientific Computing, Cambridge University Press, Cambridge: 1986. [Google Scholar]
- 22. Cunningham R. W., Comput. Phys. 1993, 7, 570. [Google Scholar]
- 23. Djordjevic M., Djordjevic M., Salom I., Rodic A., Zigic D., Milicevic O., Ilic B., arXiv:2005.09630, 2020.
- 24. Eubank S., Eckstrand I., Lewis B., Venkatramanan S., Marathe M., Barrett C. L., Bull. Math. Biol. 2020, 82, 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Britton T., Ball F., Trapman P., arXiv:2005.03085, 2020.
- 26. Havers F. P., Reed C., Lim T., Montgomery J. M., Klena J. D., Hall A. J., Fry A. M., Cannon D. L., Chiang C.‐F., Gibbons A., Krapiunaya I., Morales‐Betoulle M., Roguski K., Ata Ur Rasheed M., Freeman B., Lester S., Mills L., Carroll D. S., Owen S. M., Johnson J. A., Semenova V., Blackmore C., Blog D., Chai S. J., Dunn A., Hand J., Jain S., Lindquist S., Lynfield R., Pritchard S., et al., JAMA Intern. Med. 2020, 180, 1576. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data used in this study are openly available in Worldometer at https://www.worldometers.info/coronavirus/, reference number [7]. Parameters inferred through the analysis are available in the supplementary material of this article.