Predicting COVID-19 in very large countries: The case of Brazil

V C Parro; M L M Lafetá; F Pait; F B Ipólito; T N Toporcov

doi:10.1371/journal.pone.0253146

. 2021 Jul 1;16(7):e0253146. doi: 10.1371/journal.pone.0253146

Predicting COVID-19 in very large countries: The case of Brazil

V C Parro ^1,^*,^#, M L M Lafetá ^1,^#, F Pait ², F B Ipólito ¹, T N Toporcov ³

Editor: Thippa Reddy Gadekallu⁴

PMCID: PMC8248665 PMID: 34197489

Abstract

This work presents a practical proposal for estimating health system utilization for COVID-19 cases. The novel methodology developed is based on the dynamic model known as Susceptible, Infected, Removed and Dead (SIRD). The model was modified to focus on the healthcare system dynamics, rather than modeling all cases of the disease. It was tuned using data available for each Brazilian state and updated with daily figures. A figure of merit that assesses the quality of the model fit to the data was defined and used to optimize the free parameters. The parameters of an epidemiological model for the whole of Brazil, comprising a linear combination of the models for each state, were estimated considering the data available for the 26 Brazilian states. The model was validated, and strong adherence was demonstrated in most cases.

Introduction

In December 2019, the new coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) was first identified in Wuhan, China. On March 11, the World Health Organization designated COVID-19 as a pandemic. As of August 2020, more than 23 million COVID-19 cases and 80,000 deaths had been reported worldwide [1]. In Brazil, the virus was first identified in São Paulo city on February 26, and the first death occurred in March in Rio de Janeiro. The new cases detected at the beginning of the pandemic largely coincided with Brazilian cities with airport access, with approximately 2 million Brazilians exposed in approximately 20 weeks. As of the 7th week after the first case, the virus had spread to cities without airports, probably via road transport, increasing the population at risk. Within 5 weeks, according to Wesley Cotta/Ministry of Health data [2], all Brazilian states had registered active cases of the disease. In addition to the different characteristics of the states, which have HDI index values ranging from 0.631 for the state of Alagoas to 0.824 for the state of Distrito Federal, different measures were taken to achieve social isolation, implying different courses of the pandemic.

Brazil, with a population of approximately 200 million inhabitants, is composed of 26 states and the federal district. Approximately 30 years ago, the country implemented a universal and decentralized health system (Sistema Único de Saúde) [3]. There was instability in the federal management of the health crisis caused by the pandemic, with a number of changes in the Ministry of Health during a short period of time. States independently made several important decisions for controlling the epidemic, leading to high heterogeneity in the non-pharmacological measures taken to mitigate the pandemic. The utilization of the health system and decisions about isolation guidelines served as a guide for most of the official communications from states (26) and municipalities (5570) in the Brazilian press [4].

Internationally, machine learning has been widely used to predict disease behaviour, to forecast demand for health services, to plan and evaluate measures to reopen quarantined sites [5–7], and also for medical diagnoses [8]. The choice of the best model to forecast demand has been discussed in the literature and remains controversial. COVID-19 is a new disease, and its transmission dynamics and natural history are not yet completely clear; in addition, there is a variable proportion of asymptomatic and mildly symptomatic cases. Those are not notified to health authorities, and although individuals affected with milder cases do not need treatment, they may transmit the disease [9]. The modeling challenge is even greater in large countries with significant inequality and a heterogenous evolution of the pandemic, such as Brazil. This country has high-quality data on COVID-19, which originate from the epidemiological surveillance of acute influenza and respiratory syndromes, and are available electronically.

We propose a modification of the Susceptible, Infected, Removed, and Dead (SIRD) [10–13] model to describe the dynamics of health system usage based on reported cases only, and not on the epidemic as a whole [14]. A comparison between models applicable to CoViD19 can be found in [15], where the authors test eight empirical functions, four methods of statistical inference, and five dynamic models built from variations of the SIR model, all of them with data from the epidemic in China. In their work, the models are compared using the Akaike information criterion (AIC), mean square error (MSE) and robustness index, allowing assessment of overestimation and underestimation. In the specific case of dynamic models, they establish a cost-benefit relationship between model complexity and predictive capacity.

The simplicity of the SIRD model compared to more complete and sophisticated models [16, 17] makes it easier to tune its parameters and simplifies its use by public agents in management and public communication. Only a portion of those infected by COVID-19 will use the health system. It is known that the peak of the SIRD model is dependent on the population considered. This led us to consider a weighting of the total population to estimate the utilization of the public health system. The proposal was validated by applying the model to each Brazilian state. Tuning the modified SIRD model for each state permits a comparative assessment of its main parameters, infection rate and removal rate, in addition to an assessment of the basic reproduction rate R₀ and the effective reproduction rate R_t, all of them relevant parameters for public health management and decision making [18]. The model can be used together with solutions for tracking individuals with the purpose of monitoring the epidemic in contagion and geographic location [19].

The global model for all of Brazil was obtained from a linear combination of the estimated active cases for each state. The main contribution of the novel model developed is the demonstration that the data from Brazil as a whole does not follow a simple SIRD model, and the prediction that the epidemic would intensify in the second half of the year due to the natural risk associated with its presence in different states and locations. We have proposed an algorithm that describes this behaviour. Additional important contributions are the possibility of using the model estimates to predict the infection rate and the reproducibility index for the whole country. These indices, which can be reliably estimated from our model, are important for public management and can be easily communicated to society.

The model

In this section, the machine learning problem is introduced based upon the so-called SIRD model. With this particular model structure, an optimization algorithm using a heuristic search is introduced into the learning algorithm. Additionally, a data-driven optimization technique is introduced as a solution to the susceptible data unavailability problem by introducing a degree of freedom to the algorithm. To start the optimization problem structure, consider the SIRD differential model described by equations Eqs (1)–(4):

\begin{matrix} \frac{d S (t)}{d t} = \frac{- β I (t) S (t)}{P}, \end{matrix}

(1)

\begin{matrix} \frac{d I (t)}{d t} = \frac{β I (t) S (t)}{P} - γ I (t) - μ I (t), \end{matrix}

(2)

\begin{matrix} \frac{d R (t)}{d t} = γ I (t), \end{matrix}

(3)

\begin{matrix} \frac{d D (t)}{d t} = μ I (t), \end{matrix}

(4)

where β, γ and μ are the average number of contacts per person per period of time, the inverse of the number of days required for a person to pass from the infected to the recovered state, and the average number of deaths per period of time. The continuous time series of the susceptible, infected, removed and death are represented respectively by S(t), I(t), R(t) and D(t). Considering that the existing data sets are usually sampled uniformly, there is an advantage of using the discrete representation of the SIRD model, which can be achieved by Eqs (5)–(8), where Δt is the sample time of the data-sets, P is the total population that should be considered, and the discrete time series of the susceptible, infected, removed and death are represented respectively by S(k), I(k), R(k), and D(k).

\begin{matrix} S (k + 1) = S (k) + Δ t (- β I (k) S (k) / P), \end{matrix}

(5)

\begin{matrix} I (k + 1) = I (k) + Δ t (β I (k) S (k) / P - γ I (k) - μ I (k)), \end{matrix}

(6)

\begin{matrix} R (k + 1) = R (k) + Δ t γ I (k), \end{matrix}

(7)

\begin{matrix} D (k + 1) = D (k) + Δ t μ I (k), \end{matrix}

(8)

From model Eqs (5)–(8), determining the mean squared error e(k + 1) of the model from the provided data is straightforward. Therefore it is possible to consider the error equation with an aggregated value for each component, such as the maximum value of each component, resulting in the weighted error given by Eq (9). The complete set of data for each model variable is represented by the vectors S, I, R, and D:

\begin{matrix} e_{p} (k) = \frac{{(S (k) - \tilde{S} (k))}^{2}}{‖ S ‖} + \frac{{(I (k) - \tilde{I} (k))}^{2}}{‖ I ‖} + \frac{{(R (k) - \tilde{S} (k))}^{2}}{‖ R ‖} + \frac{{(D (k) - \tilde{S} (k))}^{2}}{‖ D ‖}, \end{matrix}

(9)

where the components with the upper tilde are the output components of the differential model (5)–(8) for a particular set of parameters β, γ and μ, e.g., the component $\tilde{I} (k)$ is the simulation of Eq (6) for a particular set of parameters at k time instants from the initial sample. From that, it is possible to compute the mean squared error using Eq (10), where N is the number of data samples.

\begin{matrix} MSE = \frac{\sum_{k = 0}^{N} e (k)}{N} \end{matrix}

(10)

The MSE defined could be employed as the cost function for the data-driven problem, but due to high amplitude difference of the model components mean values, the cost function must take into consideration a weighting parameter. This parameter is used to attribute the same importance to the error of each component of the model. This equalizes the importance of all components on the cost function, and enhances the backward and forward stability of the optimization search space.

Notice that to simulate the components, $\tilde{S} (k)$ , $\tilde{I} (k)$ , $\tilde{R} (k)$ , and $\tilde{D} (k)$ , the learning algorithm must solve Eqs (5)–(8) for a particular set of parameters. This is straightforward provided that the initial conditions I(0), R(0), and D(0) are known, as they are related to the size of the population P by

\begin{matrix} P = S (k) + I (k) + R (k) + D (k) . \end{matrix}

(11)

Definition of susceptible component

The fraction of susceptible individuals is usually not available on data-sets. It is usually computed from Eq (11), using the components I(k), R(k), D(k), and the estimated population size P. But this assumption is not quite accurate, as the entire population, P, cannot be considered susceptible, specially in case of COVID-19. The model we propose will fit the data-set when the information provided by is actually the number of people who visited a health care facility, and are then tracked by the data. In the next section, we will discuss an algorithm capable of computing the influence of the susceptible component into the cost function.

The susceptible component is subject to imprecision because it depends on environmental aspects such as isolation, disease health impact, targeted people, and even climate conditions. Several studies suggest that it should not be used in the optimization problem. This is problematic: to predict the behavior of the epidemic, it is necessary to consider the initial value of the susceptible components in (11), the value of the considered population size. For example, we could determine the susceptible component value at the time where (2) is zero. So for that can write (2) when t = t_p resulting in

\begin{matrix} \frac{d I (t_{p})}{d t} = \frac{β S (t_{p}) I (t_{p})}{P} - γ I (t_{p}) - μ I (t_{p}) ≜ 0 \end{matrix}

(12)

where t_p is the instant of t where the peak occurs. This leads to the condition

\begin{matrix} S (t_{p}) = \frac{γ + μ}{β} P \end{matrix}

(13)

which shows that the peak moment t_p depends on the correct selection of the population size, P. Given a susceptible component value computed from (11) and a population size P, the parameters β, μ and γ in (13) are bounded by this particular representation. Now consider that the correct initial value of the susceptible component, S(0), is not the total population, but actually a proportion of it, λP. This happens in scenarios where part of the population is immune or are not impacted by the disease symptoms, and therefore are only carriers. In this particular case it is possible to rewrite (13) as

\begin{matrix} S (t_{p}) = \frac{γ + μ}{β} λ P . \end{matrix}

(14)

Any distortion of the susceptible component, or of the population value, will be acknowledged by the new degree of freedom of the model, λ. The previous parameters are guided by their particular components, (2)–(4), and the existent data-set. The susceptible component can be computed by considering λ from (15). Considering the characteristics of CoViD 19, where not all infectious people seek the health care system, and that our interest is to model the dynamics of the people who use to the health care system, and specially estimating the peak S(t_p), the weighting of the total population λP allows the SIRD model to represent this dynamic:

\begin{matrix} λ P = S (k) + I (k) + R (k) + D (k) . \end{matrix}

(15)

Optimization problem

The optimization problem can be structured in the form

\begin{matrix} {arg}_{{β, γ, μ, λ} \in Ω} min \frac{\sum_{k = 0}^{N} e_{p} (k)}{N} \end{matrix}

(16)

with e_p(k) representing the weighted error at sample k, given by (9), and the component data reference of S(k) being obtained by (15). In the usual formulation of heuristic optimization algorithms, the arguments must be bounded by the search space Ω.

The search argument boundaries determination is straightforward as each parameter presents a physical interpretation in the model.

β is the amount of people one contagious individual infects per time unit;
γ⁻¹ is the amount of time that an infected individual takes to recover;
μ is the proportion of infected people that dies per time unit;
λ is the proportion of the population considered as initially susceptible.

Limits for each of these parameters are given by common sense. Even better is to obtain them numerically, considering the influence of the basic reproduction number, R₀. For this model, R₀ can be obtained from the relation

\begin{matrix} R_{0} = \frac{β λ}{γ} . \end{matrix}

(17)

The basic reproduction number measures the average number of people one contagious person will infect during the contamination period. When R₀ > 1, one person will infect more than one other, and therefore the disease will be capable of self-sustaining growth. Conversely, when R₀ < 1, the disease by itself will not become epidemic. There is a vast literature concerning the value of R₀ for the most common epidemics. We propose the following approach for solving the optimization problem (16): instead of searching for the set of arguments {β, γ, μ, λ}, we search for {R₀, D, μ, λ}, where D = γ⁻¹. The search for R₀ and D is better conditioned then the search for β and γ, as R₀ directly define the existence of the epidemic.

The optimization problem can thus be rewritten in the form

\begin{matrix} {arg}_{{R_{0}, D, μ, λ} \in \bar{Ω}} min \frac{\sum_{k = 0}^{N} e_{p} (k)}{N} \end{matrix}

(18)

where $\bar{Ω}$ is the new search space, considering R₀ and D.

Validation for Brazilian states

Brazil is a large country, and there are several cultural and environmental aspects that make its states diverse. Each state can be treated as an isolated epidemic environment, and we can fit a model for each individual state. Fig 1 shows the data and the model predicted for two distinct Brazilian states, Maranhão and São Paulo. Using all data to fit each model, it is possible to see that in scenarios where the data were collected rigorously and strict isolation was implemented, such as in Maranhão, the algorithm was able to fit the data pattern with high fidelity. Even in scenarios where the data were not collected properly, such as São Paulo, the model was able to visualize the main pattern of the data.

The modified SIRD model exhibited strong adherence to the data for most states with R2 values between 0.99 and 0.82. Fig 1 illustrates the adjustment of the model for the date 7/21/20 for the state of Maranhão, which has a population of approximately 7 million inhabitants (approximately 20 inhabitants per km²) and declared 12 days of lockdown in May 2020; and the state of São Paulo, which has a population of approximately 46 million inhabitants (approximately 166 inhabitants per km²). The basic reproducibility index estimated for Maranhão was R₀ = 3.52 and the basic reproducibility index estimated for São Paulo was R₀ = 4.78.

In Fig 2, the R² values for all states are presented for each scenario: searching for {R₀, D, μ, λ} and searching for {R₀, D, μ} with the previous setting λ = 1. For each state, R² is smaller when λ is selected by the model, i.e., the extra degree of freedom is considered in the population size. Without the reduction of the population considered, there is no set of values for the argument {R₀, D, μ} that can properly fit the data set analysed. Note that this fitness performance of the algorithm is only possible due to the new degree of freedom introduced represented by the parameter λ.

Results from the model

The global model for Brazil is determined via the linear combination of state models. The epidemic curve of active cases, estimated on June 21, 2020, can be analysed in Fig 3. The peak observed in June 2020 is strongly influenced by the peak in the state of São Paulo. The decay of the curve followed by support indicates that Brazil is expected to have a stable number of active cases until September or October 2020 and an increasing number of active cases until the first quarter of 2021. An important finding is that application of the SIRD model to Brazil as a whole (dotted in the figure) results in a different prediction for the case dynamics, indicating control of the epidemic in October 2020. The simulation was carried out on 06/27/2020 and the result clearly demonstrates that the model for Brazil does not follow the SIR model as well. The proposed model shows more realistic behaviour about the duration of the epidemic. Some models mistakenly predicted the end of the epidemic at the end of August 2020 [20].

In Fig 4, the predicted value for use of the Brazilian health system is presented, based on the average proportion of the population that will attend hospitals and health centres throughout the epidemic. In Fig 4, this value is shown as more data were provided to the model, i.e., as the epidemic progressed in the country. The value becomes close to the current (most recent) value of 1.0% of the population on 5/17/20, approximately 1.5 months before the first peak, when the predicted use was 0.8% of the country’s population.

It can also be seen in Fig 4 that the growth of the number of individuals seeking health care and its future predictions have approximately linear behaviour, implying a constant growth rate, which indicates that the number of new cases is stable. This behavior is consistent when analyzed in the light of new cases and especially the number of deaths, approximately constant, which tends to be more reliable. According to the model, this rate should drop starting in mid-October, as shown in Fig 3.

Results by state

One of the challenges of public health management was described as “flattening” the epidemic curve, a way to openly communicate the strategies chosen for this purpose. Fig 5 illustrates the peaks in Brazil, with the first state to reach the peak being Pernambuco and the last states to suffer from the acute phase of the epidemic being Mato Grosso and Mato Grosso do Sul, which are located in the central west, and Paraná, which is located in the south. In Fig 6, the estimated parameters are presented for the ten best adjusted states for the first scenario. From the table, it is possible to see that most values of R₀ stay in this range, except for SC (Santa Catarina), indicating that the inclusion of the λ parameter helps with the parameter bias usually observed in direct optimisations, where it is required that λ = 1.

Fig 6 — (a) Distribution of the estimated recovery rate in days: − D − ${\bar{X}}_{D} = 17.97 \pm σ_{D} = 3.41$ . (b) Distribution of the estimated basic reproductive rate: R₀ − ${\bar{X}}_{R_{0}} = 2.9 \pm σ_{R_{0}} = 0.9$ .

Considering the SIRD model for each state, it is observed that the recovery rate D for individuals who accessed the health system is approximately 17 days and that the basic reproduction rate R₀ is approximately 2.9. Two other relevant parameters are the average mortality rate μ, which is close to 0.8%, which implies an estimated number of deaths of 128, 000 in August 2020. The rate of use of the average health system λ is on the order of 0.6% and may reach 1.0%, which implies an expectation that 2 million people will seek care in the health system. From Fig 6, it is possible to verify some other relevant features. The first is related to the comparison analysis of the parameter D. In the data, a recovered person is not a person considered to no longer be contagious, but rather a person who has been cleared by the hospital as recovered from the disease. Therefore, the state transfer dynamics, from infected to recovered, maps the time in which a person needs to receive health care until they are considered no longer affected by the disease and thus cured. That is why the estimated parameters for D have an average value of 17.97, when it is well known that a person is only contagious during their first week of symptoms. From the data, it was found that COVID-19 has a consistent value for the daily death rate, which has an average value of 0.6%. Compared with the model results, the state that is most off is RJ, with an estimated value of 2.0%. Notice however that this state that has the worst value for R² in Fig 2. This particular problem was caused by a lack of rigorous data collection. The data contain many outliers, and during a long period of time, the data-set was not updated. The synthesis of the results, based on the models by state, can be analysed in Fig 7.

Fig 7 — (a) Distribution of the estimated recovery rate in days: D − ${\bar{X}}_{D} = 17.97 \pm σ_{D} = 3.41$ . (b) Distribution of the estimated basic reproductive rate: R₀ − ${\bar{X}}_{R_{0}} = 2.9 \pm σ_{R_{0}} = 0.9$ . (c) Distribution of the estimated mortality rate: μ − ${\bar{X}}_{μ} = 0.8 % \pm σ_{μ} = 0.2 %$ . (d) Distribution of the proportionality rate of the estimated population: λ − ${\bar{X}}_{λ} = 0.6 % \pm σ_{λ} = 0.4 %$ .

Space-time analysis

To calculate the R_t values for each Brazilian state, we used the predicted number of infected individuals, considering a window of w days. The infection series can be predicted from the likelihood function considering a Poisson process [21, 22]. This procedure can be applied to both raw data and data predicted by the model. The results are illustrated in Fig 8 using a window w = 5 days displaced by 1 day, normalised by the z-score method. A second method, based on the extraction of the β transmission rate, was applied for each state, resulting in a third estimate for control of the epidemic [23]. The results are illustrated in Fig 8. This last method, although probably affected by the fluctuations in the updated data in addition to social isolation factors, is shown to be correlated with the other two. The estimate of the R_t rate from the model works as a filter when compared to the rate obtained from the data window, where the estimate tends to the mean the data and seems suitable for use in the prediction of contagion behavior.

Fig 8 — (a) Maranhão estimated R(t). (b) São Paulo estimated R(t).

As noted in Figs 1 and 8, there is a rapid recovery of the model from variations in the data. In the specific case, the variation was caused by the variation of the data sources and their consolidation, but it is expected that the model recovery will be equally rapid if the change comes from real cases. In this sense, the rates estimated from data and from the model can be used in a combined way for decision making, since they are interpreted on a 1- or 2-week horizon to observe their effects.

When we analyse the temporal and geographical progression of the virus, representing the effective reproductive index R_t for each moment and each state, as shown in Fig 9, a considerable portion of the states are still observed to have indices greater than 1. Other states show a decline but are in the opening process. According to the graphs, it is visible that the epidemic started in the northern and southeastern regions. Then it spread progressively throughout the country but did not progress equally in each state. The southeastern region reaches its peak weeks later than the northern region. Compared to the northern region, the southeastern region maintains high R_t values at present, showing less control of the epidemic. Note that at the moment, there are still three states that have R_t values greater than 1.0, i.e., the epidemic is still growing. An example of this behaviour is the state of Paraná. As previously mentioned, according to state models, Brazil will experience a second smaller peak in September 2020. This second peak has an amplitude mostly determined by infection numbers from the state of Paraná, which has the potential to reach values equivalent to those of São Paulo (predominant state in the amplitude of the first peak).

Fig 9 — Starting in March 2020 (upper left corner) and concluding in September 2020 (lower right corner). The maps were generated in python using the Plotly library [24].

Conclusion and future directions

In conclusion, our modified SIRD model allowed the estimation of the COVID-19 epidemic model for the whole Brazil and may be used in other very large countries, such as the USA, India and Russia. The results obtained and observations during the training and tuning process are compatible with other works [25]. When predicting the future of the pandemic in those countries, it is important that local variation in epidemic stage is accounted in the model to provide accurate results. The use of the composite model to understand the epidemic in Brazil allowed for a more realistic modeling, regarding the predictions of the use of the health system, as well as the average control parameters of the epidemic. We also found that COVID-19 peaked in Brazilian states during periods in which the peak of respiratory diseases also used to occur. At the time of the writing of this paper, June 2020, the values of R₀ and R_t higher than 1 found for Brazilian States and the high values predicted until the last quarter of 2020 suggests that non-pharmacological measures would be needed for months, what turned out to be true. Another aspect that the model brought as evidence of its predictive capacity was the stable level between the months of June and October, with the beginning of a decline in cases after a considerable period of stable number of cases.

As a management element for a country of continental dimensions such as Brazil, the model proved to be effective in providing information to support decision making in the public and private spheres. However, its predictive capacity can be expanded considering aspects of closure and opening of economically active segments of society, such as the proposal studied in [26]. One feature of the model is that it needs to be updated as new data become available daily to improve its estimates. Limitations to be addressed refer to the characterisation of the initial state of the epidemic, prediction of the possibility of new waves, and the inclusion of vaccination processes. In order to improve its capacity, increasing the complexity, we can analyse the impact of the quarantine as in the model tuned in Dynamic-Susceptible-Exposed-Infective-Quarantined (D-SEIQ) [27].

As part of the dissemination strategy for this work, we created a repository that allows interested parties to access all the code described in python as well as the preliminary tuning results. We intend to improve communication with the inclusion of information about vaccination, risk analysis as proposed in [28, 29], and also modifications of the model in the case of incorporating the latency aspect in the transfer, which has gained greater understanding in the light of new observations [30].

Data Availability

The dataset and documentation for this study are available on Kaggle. Dataset: https://www.kaggle.com/marcelolafeta/epidemicmodelsbrazilstateanalysis. Documentation: https://www.kaggle.com/marcelolafeta/covid-brazil-local-predictions-heuristic-learning.

Funding Statement

This work is supported by Instituto Mauá de Tecnologia.

References

1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time; 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Cota W. Monitoring the number of COVID-19 cases and deaths in Brazil at municipal federative units level. Scielo Preprints. 2020. [Google Scholar]
3. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: Progress, crisis and resilience. BMJ Global Health. 2018;3(4):1–8. doi: 10.1136/bmjgh-2018-000829 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. de Souza WM, Fletcher Buss L, Candido DdS, Carrera JP, Li S, Zarebski AE, et al. Epidemiological and clinical characteristics of the early phase of the COVID-19 epidemic in Brazil. medRxiv—Imperial College London. 2020;4(April):19. doi: 10.1101/2020.04.25.20077396 [DOI] [PubMed] [Google Scholar]
5.Adam D. Special report: The simulations driving the world’s response to COVID-19; 2020. [DOI] [PubMed]
6. Nikolopoulos K, Punia S, Schäfers A, Tsinopoulos C, Vasilakis C. Forecasting and planning during a pandemic: COVID-19 growth rates, supply chain disruptions, and governmental decisions. European Journal of Operational Research. 2020. doi: 10.1016/j.ejor.2020.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Jo H, Son H, Jung SY, Hwang HJ. Analysis of COVID-19 spread in South Korea using the SIR model with time-dependent parameters and deep learning. medRxiv. 2020; p. 2020.04.13.20063412. 10.1101/2020.04.13.20063412 [DOI]
8. Bhattacharya S, Kumar P, Maddikunta R, Pham Qv. Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey. 2020;(January). 2020 Sustain Cities Soc. 2021. Feb;65:102589, Epub 2020. 2020. doi: 10.1016/j.scs.2020.102589 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Ahmad A, Garhwal S, Ray SK, Kumar G, Malebary SJ, Barukab OM. The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges. Archives of Computational Methods in Engineering. 2020(0123456789). doi: 10.1007/s11831-020-09472-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Chen YC, Lu PE, Chang CS, Liu TH. A Time-dependent SIR model for COVID-19 with Undetectable Infected Persons. ArXiv. 2020; p. 1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Tuckwell HC, Williams RJ. Some properties of a simple stochastic epidemic model of SIR type. Mathematical Biosciences. 2007;208(1):76–97. doi: 10.1016/j.mbs.2006.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Weiss Sir Ronald Ross H. The SIR model and the Foundations of Public Health. MATerials MATemàtics Volum. 2013;17(3):1887–1097. [Google Scholar]
13. Huang W, Provan G. An improved state filter algorithm for SIR epidemic forecasting. Frontiers in Artificial Intelligence and Applications. 2016;285(August):524–532. doi: 10.3233/978-1-61499-672-9-524 [DOI] [Google Scholar]
14. Bonneville F, Wallinga J, Fiocco M. BAYESIAN FORECASTING OF Specialisation: Statistical Science STATISTICAL SCIENCE Basis huisstijl. BMJ Global Health. 2019. [Google Scholar]
15. Yang W, Zhang D, Peng L, Zhuge C, Hong L. Rational evaluation of various epidemic models based on the COVID-19 data of China. ArXiv. 2020; p. 1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Springer; US; 2020; 26. 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. D’Arienzo M, Coniglio A. Assessment of the SARS-CoV-2 basic reproduction number, R0, based on the early phase of COVID-19 outbreak in Italy. Biosafety and Health. 2020;2(2):57–59. doi: 10.1016/j.bsheal.2020.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Adam D. A guide to R—the pandemic’s misunderstood metric; 2020. [DOI] [PubMed] [Google Scholar]
19.Manoj M, Srivastava G, Somayaji SRK, Gadekallu TR, Maddikunta PKR, Bhattacharya S. An Incentive Based Approach for COVID-19 planning using Blockchain Technology. 2020 IEEE Globecom Workshops, GC Wkshps 2020—Proceedings. 2020. 10.1109/GCWkshps50303.2020.9367469 [DOI]
20.Filho LR, Lichtenthäler DG. A dynamic model for Covid-19 in Brazil. Medrxiv. 2020; p. 1–10. 10.1101/2020.05.10.20097550 [DOI]
21. Chowell G, Hyman JM, Bettencourt LMA, Castillo-Chavez C. Mathematical and statistical estimation approaches in epidemiology. Mathematical and Statistical Estimation Approaches in Epidemiology. 2009; p. 1–363. doi: 10.1007/978-90-481-2313-1_1 [DOI] [Google Scholar]
22. Nishiura H. Correcting the actual reproduction number: A simple method to estimate R0 from early epidemic growth data. International Journal of Environmental Research and Public Health. 2010;7(1):291–302. doi: 10.3390/ijerph7010291 [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Pollicott M, Wang H, Weiss H. Extracting the time-dependent transmission rate from infection data via solution of an inverse ODE problem. Journal of Biological Dynamics. 2012;6(2):509–523. doi: 10.1080/17513758.2011.645510 [DOI] [PubMed] [Google Scholar]
24.Inc PT. Collaborative data science; 2015. https://plot.ly.
25. Roda WC, Varughese MB, Han D, Li MY. Why is it difficult to accurately predict the COVID-19 epidemic? Infectious Disease Modelling. 2020;5:271–281. doi: 10.1016/j.idm.2020.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Liu M, Thomadsen R, Yao S. Forecasting the spread of COVID-19 under different reopening strategies. Scientific Reports. 2020;10(1):1–8. doi: 10.1038/s41598-020-77292-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Sun J, Chen X, Zhang Z, Lai S, Zhao B, Liu H, et al. Forecasting the long-term trend of COVID-19 epidemic using a dynamic model. Scientific Reports. 2020; p. 1–10. doi: 10.1038/s41598-020-78084-w [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Tang B, Bragazzi NL, Li Q, Tang S, Xiao Y, Wu J. An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov). Infectious Disease Modelling. 2020;5:248–255. doi: 10.1016/j.idm.2020.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Chan YWD, Flasche S, Lam TLT, Leung MHJ, Wong ML, Lam HY, et al. Transmission dynamics, serial interval and epidemiology of COVID-19 diseases in Hong Kong under different control measures. Wellcome Open Research. 2020;5:91. doi: 10.12688/wellcomeopenres.15896.2 [DOI] [Google Scholar]
30. Liu Z, Magal P, Seydi O, Webb G. A COVID-19 epidemic model with latency period. Infectious Disease Modelling. 2020;5(11811530272):323–337. doi: 10.1016/j.idm.2020.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0253146.r001

Decision Letter 0

Seyedali Mirjalili

8 Dec 2020

PONE-D-20-34349

Predicting COVID-19 in very large countries: the case of Brazil

PLOS ONE

Dear Dr. Parro,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 22 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Seyedali Mirjalili

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.Thank you for stating the following in the Acknowledgments Section of your manuscript:

"The authors would like to thank the Instituto Maua de Tecnologia for funding this work."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"The author(s) received no specific funding for this work."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

3. We noted in your submission details that a portion of your manuscript may have been presented or published elsewhere. (preprint on Research square.)

Please clarify whether this [conference proceeding or publication] was peer-reviewed and formally published. If this work was previously peer-reviewed and published, in the cover letter please provide the reason that this work does not constitute dual publication and should be included in the current manuscript.

4. We note that Figure 9 in your submission contain map images which may be copyrighted. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For these reasons, we cannot publish previously copyrighted maps or satellite images created using proprietary data, such as Google software (Google Maps, Street View, and Earth). For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

(1) You may seek permission from the original copyright holder of Figure 9 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

(2) f you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

The following resources for replacing copyrighted map figures may be helpful:

USGS National Map Viewer (public domain): http://viewer.nationalmap.gov/viewer/

The Gateway to Astronaut Photography of Earth (public domain): http://eol.jsc.nasa.gov/sseop/clickmap/

Maps at the CIA (public domain): https://www.cia.gov/library/publications/the-world-factbook/index.html and https://www.cia.gov/library/publications/cia-maps-publications/index.html

NASA Earth Observatory (public domain): http://earthobservatory.nasa.gov/

Landsat: http://landsat.visibleearth.nasa.gov/

USGS EROS (Earth Resources Observatory and Science (EROS) Center) (public domain): http://eros.usgs.gov/#

Natural Earth (public domain): http://www.naturalearthdata.com/

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors presented a logical modified version of numerical model of susceptible, Infected, Removed and dead (SIRD) model to estimate health system use for COVID-19 cases. This realistic machine learning (ML) approach is not only suitable for Brazil but also for other large countries like USA, India and Russia. Moreover, the developed system is accessible on an open platform and the best part is that it is possible to receive a synthesis report, synchronized model and python source code once a standard CSV file is submitted. Optimization algorithm using heuristic search is also introduced into the learning algorithm.

Overall, great work has been presented by the authors. The beauty of the proposed work is its simplicity that is a boost for the use by public agents in management and communication. When compared with other models, it is simpler to tune the model’s parameters. It allows all relevant parameters for public management and decision making such as its main parameters, infection rate, removal rate, basic reproduction rate R_0 and effective reproduction rate R_t to do a comparative assessment. The authors successfully validated the model on 26 Brazilian states.

Comments:

The topic is interesting however the study has few flaws and requires minor revisions in order to improve the study in the following ways:

Graphical/Tabular performance analysis of the proposed work with other existing ML models

Inclusion of limitation of the model

Conclusion may be further elaborated (page 18)

Check on the repetition of the same line twice where ∆t and P is mentioned (2nd paragraph, page 9)

Reviewer #2: The work presents a new proposal for estimating health system use for COVID-19 cases for the whole Brazil. The paper seems good, well-described and structured. However, the comments and the requirements given below should be addressed before accepting the paper

1 There are some grammar and spelling mistakes in the paper, the language quality should be enhanced.

2 In section 2 on page 3 (Section the model), the sentence (where ∆t is the sample time of the data-sets, and P is the total population that should be considered for each case study) is repeated. The repeated sentence must be removed.

3 All symbols in equations should be clarified.

4 The paper needs to add a review section (Related works) and include some well-established works on predicting COVID-19 that done until now.

5 Add some suggestions for further works and change the title of "conclusion" section to "conclusion and future directions".

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Yassine Meraihi

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Jul 1;16(7):e0253146. doi: 10.1371/journal.pone.0253146.r002

Author response to Decision Letter 0

25 Feb 2021

All reviewers questions were discussed in the document Response to reviewers .pdf

Attachment

Submitted filename: Response to Reviewers.pdf

Click here for additional data file.^{(147.3KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0253146.r003

Decision Letter 1

Thippa Reddy Gadekallu

11 Apr 2021

PONE-D-20-34349R1

Predicting COVID-19 in very large countries: the case of Brazil

PLOS ONE

Dear Dr. Parro,

Based on the reviewer's and my own suggestions, I recommend major revisions for this paper.

Please submit your revised manuscript by May 26 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Thippa Reddy Gadekallu

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Reviewer #2: The work presents a new proposal for estimating health system use for COVID-19 cases for the whole Brazil. The paper seems good, well-described and structured. I accept the paper

Reviewer #3: 1. The English language in the paper is loose in some instances. The paper needs a thorough proofread.

2. There are several long sentences in the paper. The authors can break them down into smaller sentences.

3. List out the main contributions of the paper.

4. Some of the recent works such as the following can be discussed in the paper: "Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey, An Incentive Based Approach for COVID-19 planning using Blockchain Technology".

5. Discuss about the limitations of the present work.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Yassine Meraihi

Reviewer #3: No

PLoS One. 2021 Jul 1;16(7):e0253146. doi: 10.1371/journal.pone.0253146.r004

Author response to Decision Letter 1

14 May 2021

All answers and comments were included at the Response to reviewers file.

Attachment

Submitted filename: Response to Reviewers.pdf

Click here for additional data file.^{(76.6KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0253146.r005

Decision Letter 2

Thippa Reddy Gadekallu

31 May 2021

Predicting COVID-19 in very large countries: the case of Brazil

PONE-D-20-34349R2

Dear Dr. Parro,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Thippa Reddy Gadekallu

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #3: Yes

**********

6. Review Comments to the Author

Reviewer #3: The authors have addressed all the comments and incorporated all my suggestions. The paper can be accepted for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: No

PLoS One. doi: 10.1371/journal.pone.0253146.r006

Acceptance letter

Thippa Reddy Gadekallu

23 Jun 2021

PONE-D-20-34349R2

Predicting COVID-19 in very large countries: the case of Brazil

Dear Dr. Parro:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Thippa Reddy Gadekallu

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Attachment

Submitted filename: Response to Reviewers.pdf

Click here for additional data file.^{(147.3KB, pdf)}

Attachment

Submitted filename: Response to Reviewers.pdf

Click here for additional data file.^{(76.6KB, pdf)}

Data Availability Statement

[pone.0253146.ref001] 1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time; 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref002] 2. Cota W. Monitoring the number of COVID-19 cases and deaths in Brazil at municipal federative units level. Scielo Preprints. 2020. [Google Scholar]

[pone.0253146.ref003] 3. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: Progress, crisis and resilience. BMJ Global Health. 2018;3(4):1–8. doi: 10.1136/bmjgh-2018-000829 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref004] 4. de Souza WM, Fletcher Buss L, Candido DdS, Carrera JP, Li S, Zarebski AE, et al. Epidemiological and clinical characteristics of the early phase of the COVID-19 epidemic in Brazil. medRxiv—Imperial College London. 2020;4(April):19. doi: 10.1101/2020.04.25.20077396 [DOI] [PubMed] [Google Scholar]

[pone.0253146.ref005] 5.Adam D. Special report: The simulations driving the world’s response to COVID-19; 2020. [DOI] [PubMed]

[pone.0253146.ref006] 6. Nikolopoulos K, Punia S, Schäfers A, Tsinopoulos C, Vasilakis C. Forecasting and planning during a pandemic: COVID-19 growth rates, supply chain disruptions, and governmental decisions. European Journal of Operational Research. 2020. doi: 10.1016/j.ejor.2020.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref007] 7.Jo H, Son H, Jung SY, Hwang HJ. Analysis of COVID-19 spread in South Korea using the SIR model with time-dependent parameters and deep learning. medRxiv. 2020; p. 2020.04.13.20063412. 10.1101/2020.04.13.20063412 [DOI]

[pone.0253146.ref008] 8. Bhattacharya S, Kumar P, Maddikunta R, Pham Qv. Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey. 2020;(January). 2020 Sustain Cities Soc. 2021. Feb;65:102589, Epub 2020. 2020. doi: 10.1016/j.scs.2020.102589 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref009] 9. Ahmad A, Garhwal S, Ray SK, Kumar G, Malebary SJ, Barukab OM. The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges. Archives of Computational Methods in Engineering. 2020(0123456789). doi: 10.1007/s11831-020-09472-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref010] 10. Chen YC, Lu PE, Chang CS, Liu TH. A Time-dependent SIR model for COVID-19 with Undetectable Infected Persons. ArXiv. 2020; p. 1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref011] 11. Tuckwell HC, Williams RJ. Some properties of a simple stochastic epidemic model of SIR type. Mathematical Biosciences. 2007;208(1):76–97. doi: 10.1016/j.mbs.2006.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref012] 12. Weiss Sir Ronald Ross H. The SIR model and the Foundations of Public Health. MATerials MATemàtics Volum. 2013;17(3):1887–1097. [Google Scholar]

[pone.0253146.ref013] 13. Huang W, Provan G. An improved state filter algorithm for SIR epidemic forecasting. Frontiers in Artificial Intelligence and Applications. 2016;285(August):524–532. doi: 10.3233/978-1-61499-672-9-524 [DOI] [Google Scholar]

[pone.0253146.ref014] 14. Bonneville F, Wallinga J, Fiocco M. BAYESIAN FORECASTING OF Specialisation: Statistical Science STATISTICAL SCIENCE Basis huisstijl. BMJ Global Health. 2019. [Google Scholar]

[pone.0253146.ref015] 15. Yang W, Zhang D, Peng L, Zhuge C, Hong L. Rational evaluation of various epidemic models based on the COVID-19 data of China. ArXiv. 2020; p. 1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref016] 16. Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Springer; US; 2020; 26. 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref017] 17. D’Arienzo M, Coniglio A. Assessment of the SARS-CoV-2 basic reproduction number, R0, based on the early phase of COVID-19 outbreak in Italy. Biosafety and Health. 2020;2(2):57–59. doi: 10.1016/j.bsheal.2020.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref018] 18. Adam D. A guide to R—the pandemic’s misunderstood metric; 2020. [DOI] [PubMed] [Google Scholar]

[pone.0253146.ref019] 19.Manoj M, Srivastava G, Somayaji SRK, Gadekallu TR, Maddikunta PKR, Bhattacharya S. An Incentive Based Approach for COVID-19 planning using Blockchain Technology. 2020 IEEE Globecom Workshops, GC Wkshps 2020—Proceedings. 2020. 10.1109/GCWkshps50303.2020.9367469 [DOI]

[pone.0253146.ref020] 20.Filho LR, Lichtenthäler DG. A dynamic model for Covid-19 in Brazil. Medrxiv. 2020; p. 1–10. 10.1101/2020.05.10.20097550 [DOI]

[pone.0253146.ref021] 21. Chowell G, Hyman JM, Bettencourt LMA, Castillo-Chavez C. Mathematical and statistical estimation approaches in epidemiology. Mathematical and Statistical Estimation Approaches in Epidemiology. 2009; p. 1–363. doi: 10.1007/978-90-481-2313-1_1 [DOI] [Google Scholar]

[pone.0253146.ref022] 22. Nishiura H. Correcting the actual reproduction number: A simple method to estimate R0 from early epidemic growth data. International Journal of Environmental Research and Public Health. 2010;7(1):291–302. doi: 10.3390/ijerph7010291 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref023] 23. Pollicott M, Wang H, Weiss H. Extracting the time-dependent transmission rate from infection data via solution of an inverse ODE problem. Journal of Biological Dynamics. 2012;6(2):509–523. doi: 10.1080/17513758.2011.645510 [DOI] [PubMed] [Google Scholar]

[pone.0253146.ref024] 24.Inc PT. Collaborative data science; 2015. https://plot.ly.

[pone.0253146.ref025] 25. Roda WC, Varughese MB, Han D, Li MY. Why is it difficult to accurately predict the COVID-19 epidemic? Infectious Disease Modelling. 2020;5:271–281. doi: 10.1016/j.idm.2020.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref026] 26. Liu M, Thomadsen R, Yao S. Forecasting the spread of COVID-19 under different reopening strategies. Scientific Reports. 2020;10(1):1–8. doi: 10.1038/s41598-020-77292-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref027] 27. Sun J, Chen X, Zhang Z, Lai S, Zhao B, Liu H, et al. Forecasting the long-term trend of COVID-19 epidemic using a dynamic model. Scientific Reports. 2020; p. 1–10. doi: 10.1038/s41598-020-78084-w [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref028] 28. Tang B, Bragazzi NL, Li Q, Tang S, Xiao Y, Wu J. An updated estimation of the risk of transmission of the novel coronavirus (2019-nCov). Infectious Disease Modelling. 2020;5:248–255. doi: 10.1016/j.idm.2020.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0253146.ref029] 29. Chan YWD, Flasche S, Lam TLT, Leung MHJ, Wong ML, Lam HY, et al. Transmission dynamics, serial interval and epidemiology of COVID-19 diseases in Hong Kong under different control measures. Wellcome Open Research. 2020;5:91. doi: 10.12688/wellcomeopenres.15896.2 [DOI] [Google Scholar]

[pone.0253146.ref030] 30. Liu Z, Magal P, Seydi O, Webb G. A COVID-19 epidemic model with latency period. Infectious Disease Modelling. 2020;5(11811530272):323–337. doi: 10.1016/j.idm.2020.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Predicting COVID-19 in very large countries: The case of Brazil

V C Parro

M L M Lafetá

F Pait

F B Ipólito

T N Toporcov

Roles

Abstract

Introduction

The model

Definition of susceptible component

Optimization problem

Validation for Brazilian states

Fig 1. Comparison of predictions using the estimated models, with each state real data.

Fig 2. R2 value comparing the predictions with the real data of each state and the reliability metric that shows the proportion of the data that is reliable to use on the learning algorithm.

Results from the model

Fig 3. Comparison of predictions using the estimated models with real data for each state.

Fig 4. Prediction of the proportion of the population to attend to health care systems in Brazil.

Results by state

Fig 5. Peak evolution of the epidemic in the Brazilian states illustrated in its temporal sequence of occurrence.

Fig 6. Distributions of the estimated parameters D, R0, μ and λ, for all Brazilian states based on the state that presented the first peak.

Fig 7. Distributions of the estimated parameters D, R0, μ e λ, accumulated since the beginning of the pandemic, for all Brazilian states.

Space-time analysis

Fig 8. Comparison of standard scaled Rt from different estimate algorithms.

Fig 9. Estimated values of Rt for each state during the epidemic period in Brazil.

Conclusion and future directions

Data Availability

Funding Statement

References

Decision Letter 0

Seyedali Mirjalili

Roles

Author response to Decision Letter 0

Decision Letter 1

Thippa Reddy Gadekallu

Roles

Author response to Decision Letter 1

Decision Letter 2

Thippa Reddy Gadekallu

Roles

Acceptance letter

Thippa Reddy Gadekallu

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 2. R² value comparing the predictions with the real data of each state and the reliability metric that shows the proportion of the data that is reliable to use on the learning algorithm.

Fig 6. Distributions of the estimated parameters D, R₀, μ and λ, for all Brazilian states based on the state that presented the first peak.

Fig 7. Distributions of the estimated parameters D, R₀, μ e λ, accumulated since the beginning of the pandemic, for all Brazilian states.

Fig 8. Comparison of standard scaled R_t from different estimate algorithms.

Fig 9. Estimated values of R_t for each state during the epidemic period in Brazil.