Leveraging dynamics informed neural networks for predictive modeling of COVID-19 spread: a hybrid SEIRV-DNNs approach

Cheng Cheng; Elayaraja Aruchunan; Muhamad Hifzhudin Noor Aziz

doi:10.1038/s41598-025-85440-1

. 2025 Jan 15;15:2043. doi: 10.1038/s41598-025-85440-1

Leveraging dynamics informed neural networks for predictive modeling of COVID-19 spread: a hybrid SEIRV-DNNs approach

Cheng Cheng ¹, Elayaraja Aruchunan ², Muhamad Hifzhudin Noor Aziz ^1,^✉

PMCID: PMC11735935 PMID: 39814760

Abstract

A dynamics informed neural networks (DINNs) incorporating the susceptible-exposed-infectious-recovered-vaccinated (SEIRV) model was developed to enhance the understanding of the temporal evolution dynamics of infectious diseases. This work integrates differential equations with deep neural networks to predict time-varying parameters in the SEIRV model. Experimental results based on reported data from China between January 1, and December 1, 2022, demonstrate that the proposed dynamics informed neural networks (DINNs) method can accurately learn the dynamics and predict future states. Our proposed hybrid SEIRV-DNNs model can also be applied to other infectious diseases such as influenza and dengue, with some modifications to the compartments and parameters in the model to accommodate the related control measures. This approach will facilitate improving predictive modeling and optimizing public health intervention strategies.

Keywords: COVID-19, Transmission dynamics, Neural networks, DINNs

Subject terms: Infectious diseases, Computational science

Introduction

In the rapidly evolving COVID-19 pandemic, traditional epidemiological models have struggled to account for dynamic factors such as new variants and shifting public health policies^1,2. The emergence of the novel coronavirus SARS-CoV-2, the causative agent of COVID-19, was initially identified in December 2019 in Wuhan, Hubei province, China³. The virus quickly precipitated a global health emergency. Within a month of the first reported case, the virus had manifested in several countries, compelling the World Health Organization (WHO) to declare it a public health emergency of international concern by the end of January 2020⁴. As the virus disseminated globally, its trajectory was punctuated by the emergence of several variants. The Alpha variant (B.1.1.7), first identified in the United Kingdom, marked the beginning of a perilous phase of rapid global spread⁵. Then the Delta variant (B.1.617.2) detected in India became predominant, its heightened transmissibility exacerbating the pandemic’s second wave⁶. The relentless evolution of the virus saw further iterations with the Beta (B.1.351) and Gamma (P.1) variants, each contributing to the complex epidemiological landscape. However, the Omicron variant (B.1.1.529), first reported in South Africa in November 2021, underscored the pandemic’s unpredictable nature with its multitude of mutations and potential for immune evasion⁷. Over the span of 3 years, these variants have challenged public health interventions, testing the resilience of global health systems and the adaptability of scientific responses⁸.

The application of mathematical models in the COVID-19 pandemic has been paramount in dissecting the mechanisms of spread, forecasting epidemiological trends, evaluating the efficacy of control interventions, and optimizing containment strategies. Central to these models are data fitting and parameter estimation, which are foundational yet critical components for understanding and simulating the dynamics of disease transmission^9,10. Amid the real-world scenario of shifting control policies, vaccination rollouts, and the emergence of new viral variants, the temporality of parameters becomes a significant consideration. Analyzing factors that influence public health behaviours, such as individuals’ intentions to receive COVID-19 vaccine boosters, is crucial. Understanding these factors can guide efforts to increase booster vaccination rates, which remains essential for effective pandemic control¹¹. Capturing the temporal variability in disease transmissibility is essential to accurately assess the effectiveness of non-pharmaceutical interventions (NPIs) or vaccination campaigns¹². Traditional mechanism-based models typically predefine specific transmission (or contact) rate functions to simulate a declining trend, alongside another distinct function that captures the increasing trend to track significant events that enhance or mitigate the impact of NPIs or the vaccination timelines¹³.

However, to accommodate multiple epidemic trajectories, models require the integration of more specific functions and parameters. The limited information available in datasets and constraints in the speed of response to adverse indicators undoubtedly pose a substantial challenge to data fitting and parameter estimation, simultaneously amplifying the issue of parameter unidentifiability. This remains a critical issue that the field currently faces¹⁴. The complexity of modeling is further exacerbated by the need to continuously adapt and fine-tune models to reflect the evolving landscape of the pandemic. As new data emerge and our understanding of the virus deepens, iterative model refinements are necessary. This iterative process includes incorporating the impact of behavioural changes, government-imposed restrictions, and biological factors such as immunity waning and reinfection rates¹⁵. The real-time recalibration of models is vital to maintain their relevance and predictive accuracy as the pandemic progresses. Moreover, researchers have turned to advanced computational techniques and machine learning algorithms to enhance the robustness of parameter estimation. These methodologies allow for the exploration of vast parameter spaces and can identify patterns that might not be immediately evident through traditional means¹⁶.

Deep neural networks (DNNs) have permeated a wide array of disciplines, revolutionizing approaches in data mining, natural language processing, pattern recognition, medical diagnostics, fault diagnosis, and genomics^17–21. Within infectious disease modeling, DNNs present distinct advantages, effectively capturing the complex, nonlinear patterns associated with disease transmission. Neural networks have shown superior accuracy over traditional univariate time series models in forecasting healthcare metrics, such as COVID-19 case numbers and mortality²². A study in Pakistan found that artificial neural networks outperformed other time series models in predicting COVID-19 deaths and recoveries, highlighting the accuracy and usefulness of neural network models in healthcare forecasting²³. A pivotal attribute of DNNs is their universal approximation property, which allows them to approximate the state variables of mechanistic models, such as differential equations, and their time-dependent parameters²⁴. The adoption of DNNs in the domain of infectious disease modeling offers a paradigm shift from traditional analysis, providing a robust framework for capturing the complex biological, behavioral, and social factors that drive epidemic spread²⁵. This is particularly relevant in the context of COVID-19, where the virus’s transmission dynamics are influenced by a myriad of factors including governmental policies, community responses, and the intrinsic properties of the virus itself²⁶. Moreover, the integration of DNNs allows for a more nuanced understanding of the temporal evolution of the pandemic, accommodating shifts in virus dynamics due to mutations, and changes in population behavior due to interventions such as lockdowns and mask mandates²⁷. Such an approach is invaluable in informing the strategic deployment of resources, timing of intervention measures, and the potential necessity for vaccine deployment and modification in the face of viral evolution²⁸. The DINNs model’s capacity for real-time learning from data can provide forecasts that are critical for decision-makers, offering insights into the efficacy of current interventions and the necessity for their alteration or continuation²⁹.

The central research question this study addresses revolves around the application of DINNs in capturing the complexity of infectious disease spread, particularly in the context of the COVID-19 pandemic. We develop a novel DINNs framework that integrates compartmental models with the predictive power of deep learning to estimate time-dependent epidemiological parameters accurately. This is particularly significant in capturing the dynamic nature of COVID-19 spread as it factors in the temporal changes in policy interventions, behavioral adaptations, and viral evolution.

Mini-review of related works

The intersection of traditional infectious disease modeling and machine learning offers a transformative approach to understanding and predicting the spread of diseases. Traditional epidemiological models, such as the Susceptible-Infected-Recovered (SIR) and its derivatives, have provided foundational insights into disease dynamics by segmenting populations based on disease status³⁰. These compartmental models, which are grounded in differential equations, have been instrumental in forecasting disease trajectories and assessing the potential impact of public health interventions³¹. However, a significant limitation of SIR and SEIR models is their assumption of static parameters, which makes them ill-suited for capturing real-time changes in transmission dynamics due to evolving factors such as policy interventions, vaccination rollouts, and the emergence of new viral variants^32,33. These models often fail to adapt to the dynamic nature of pandemics, thereby reducing their predictive accuracy and applicability in rapidly changing environments.

The advent of machine learning, especially deep learning techniques¹⁸, has markedly improved conventional epidemic models through sophisticated parameter estimation and long-term trend forecasting. Deep neural networks (DNNs) are essential for modelling intricate transmission dynamics, facilitating the estimate of time-dependent parameters that are difficult to capture using traditional methods³⁴. Utilising their layered design, DNNs proficiently capture non-linear correlations in extensive datasets³⁵, hence enhancing the calibration and forecasting precision of epidemiological models³⁶. Furthermore, ensemble approaches that amalgamate many machine learning models have exhibited enhanced robustness and accuracy in predicting infectious disease outbreaks³⁷. The integration of mechanistic models with deep neural networks (DNNs) encompasses parameter inference, whereby sophisticated methods like Bayesian optimisation enable a thorough investigation of parameter spaces, hence improving model calibration to empirical data³⁸. This hybrid modelling technique enhances the prediction capability of conventional models and delivers profound insights into epidemic dynamics, therefore furnishing more dependable decision-support tools for public health managers³⁹.

Deep neural networks (DNNs) has significantly advanced infectious disease modeling by enabling direct data-driven approaches that capture complex biological processes relying on predefined equations⁴⁰. Recent studies have demonstrated the efficacy of recurrent neural networks (RNNs) and long short-term memory (LSTM) networks in modeling the temporal dynamics of COVID-19, effectively predicting its spread by learning from sequential case data⁴¹. DNNs excel in identifying intricate, non-linear relationships within large datasets, thereby enhancing the precision and predictive capabilities of epidemiological analyses⁴². Unlike traditional compartmental models such as the susceptible-exposed-infected-recovered (SEIR) framework, which assume homogeneous population mixing and often overlook complex social behaviors and networks⁴³, DNNs can account for heterogeneities and the stochastic nature of real-world interactions that drive disease transmission⁴⁴. Additionally, DNNs facilitate parameter estimation in complex disease dynamics, providing estimates for transmission and recovery rates, as well as the impacts of interventions⁴⁵. Real-time prediction models continuously update with new data, enabling the assessment of public health measures as they are implemented⁴⁶. Their ability to integrate diverse data sources-including clinical data, mobility patterns, and social media trends-allows for a comprehensive analysis of factors influencing disease spread⁴⁷. The flexibility of DNN architectures permits their adaptation to various modeling tasks, from forecasting case numbers to evaluating non-pharmaceutical interventions⁴⁸.

This study proposes a novel approach that leverages the power of machine learning, specifically deep neural networks (DNNs), to unravel the intricate epidemiological dynamics often obscured in the study of COVID-19 spread. The proposed method, termed as SEIRV-dynamics-informed neural networks, incorporates prior knowledge from the SEIRV model into the loss functions of DNNs as a physical regularizer. This innovative approach extends beyond the conventional SIR model by including additional compartments for exposed individuals (E) and vaccinated populations (V), enabling a more comprehensive representation of the complex dynamics at play. To assess the efficacy of the SEIRV-dynamics-informed neural networks, the method is first tested against synthetic data generated from the SEIRV model under various scenarios. Subsequently, the approach is validated using real-world COVID-19 data reported from China between January 1, 2022, and December 1, 2022, sourced from the COVID-19 Data Repository maintained by Johns Hopkins University. The incorporation of the SEIRV model’s ordinary differential equations (ODEs) into the DNN’s learning process allows for an efficient and accurate approximation of the underlying complex dynamics.

Methodology

Dynamic model

The SEIRV model is an epidemiological model that extends the classic SEIR model by incorporating vaccination dynamics to account for the mitigating effects of vaccine deployment in a population. Each compartment in the model represents a different stage of how the disease spreads through a population. The susceptible (S) group includes people who could potentially catch the infection. The exposed (E) group consists of those who have come into contact with the virus but are not yet able to spread it. The infectious (I) group includes those who can transmit the disease to others. The recovered (R) group represents people who have had the infection, recovered, and are now immune. Finally, the vaccinated (V) group includes individuals who have been vaccinated, which lowers their chance of getting infected. The addition of the vaccinated class allows public health officials and researchers to simulate and understand the impact of vaccination strategies on the spread of an infectious disease. Recent research employed the SEIRV model to assess the influence of vaccination on the COVID-19 pandemic⁴⁹. The corresponding differential equations for the SEIRV model are given by Eq. (1):

In these equations, Inline graphic represents the total population and is defined as the sum of all compartmental populations, . The parameter denotes the transmission rate of the disease, is the rate at which exposed individuals become infectious, is the recovery rate, is the vaccination rate, causing vaccinated individuals to become susceptible again⁵⁰.

Figure 1 presents the flowchart of the SEIRV model, illustrating the dynamic interactions among the different compartments: Susceptible (S), Exposed (E), Infectious (I), Recovered (R), and Vaccinated (V). Vaccination (V) serves as a critical mechanism in shaping population dynamics, facilitating the transition of individuals from the Susceptible (S) compartment to the Vaccinated (V) compartment at a rate Inline graphic . By reducing the number of individuals vulnerable to infection, vaccination significantly curtails the transmission potential of the disease.

Fig. 1 — Flowchart of the SEIRV epidemiological model. This diagram illustrates the core components and dynamic transitions between compartments in the SEIRV model, including susceptible (S), exposed (E), infected (I), recovered (R), and vaccinated (V) populations. Arrows between the compartments represent how individuals transition due to disease transmission, recovery, or vaccination.

DNN architecture

The architecture of deep neural networks (DNNs) is inspired by the biological neural networks that constitute animal brains. A DNN is composed of multiple layers of artificial neurons or nodes, each designed to perform specific transformations on the data. The typical architecture includes an input layer, several hidden layers, and an output layer^17,51. The performance of DNNs can be attributed to their depth, which allows the networks to learn complex patterns and hierarchies of features. The deeper layers can create abstract representations of the input data, making DNNs highly effective for tasks such as image and speech recognition, natural language processing, and more^52,53. Combining deep neural networks (DNNs) with the SEIRV epidemiological model can be done to enhance the model’s predictive capability. The universal approximation theorem, which states that a feed-forward network with a single hidden layer containing a finite number of neurons can approximate continuous functions on compact subsets of Inline graphic , justifies the use of DNNs for such tasks⁵⁴.

In our implementation, we utilize a DNN architecture with five hidden layers. The choice of five layers strikes a balance between model complexity and computational efficiency, providing sufficient depth to capture the nonlinear dynamics of the SEIRV epidemiological model without leading to overfitting. For activation functions, we have chosen the Rectified Linear Unit (ReLU) as the primary activation function in our hidden layers. ReLU is favored due to its computational efficiency and ability to mitigate the vanishing gradient problem, which is prevalent in deeper networks when using activation functions like sigmoid or hyperbolic tangent (tanh). The ReLU function allows for faster convergence during training and helps maintain strong gradients, facilitating effective learning in deep architectures⁵⁵. Additionally, we have incorporated L2 regularization to prevent overfitting by penalizing large weights, thus enhancing the generalization capability of our model. We also included dropout layers, which randomly deactivate a subset of neurons during training, further preventing overfitting and promoting a robust model that generalizes well to new data..

The training of DNNs is typically done through backpropagation, an algorithm that calculates the gradient of the loss function with respect to the network’s weights, adjusting them to minimize the loss¹⁸. For optimizing the parameters of the DNNs, algorithms such as stochastic gradient descent (SGD), Adam, and RMSprop are often used. These optimizers adjust the weights to minimize a loss function, typically a form of mean squared error (MSE) when dealing with continuous output like Inline graphic ⁵⁶.

To apply DNNs to the SEIRV model, one might construct a DNN for the SEIRV Model: This network would aim to simulate the dynamics of the SEIRV model itself, learning from the data how the different compartments interact over time. The network would be trained simultaneously with the same objective: to minimize the difference between the predicted values and the actual data on the spread of the disease.

DINNs model

The coupling strategy and parameter-sharing mechanism in integrating deep neural networks (DNNs) with the SEIRV model involves using the DNNs to numerically solve the system of ordinary differential equations (ODEs). Using one set Inline graphic , , , for the state variables. The DNN for solving the SEIRV model, outputs approximations of the compartments, aiming to solve the ODEs given the current estimates of from . This methodology effectively creates a feedback loop. The networks share parameters through this feedback mechanism, leading to a coherent model that aligns with the underlying disease spread dynamics. Figure 2 illustrates the architecture of the Dynamics Informed Neural Networks (DINNs), which comprises an input layer, fully connected hidden layers with nonlinear activation functions, and an output layer. The network updates the state of neurons across layers via weight matrices Inline graphic and biases . Dedicated neural networks are deployed to infer the dynamic parameters, observed states, and unobserved states at each discrete time .

Fig. 2 — Schematic diagram of the dynamics informed neural networks. This model is designed for simultaneous data fitting and solution of ordinary differential equations (ODEs) associated with the SEIRV epidemiological framework and incorporates two deep neural networks (DNNs) to optimize the model parameters and solve the ODEs by minimizing the mean squared error (MSE). The training process persists until either a predefined error threshold is achieved or a maximum number of iterations is completed.

The loss function for the DINNs framework can be designed to comprise two main components: the model loss, which ensures the outputs from the neural network adhere to the SEIRV differential equations, and the data loss, which quantifies the discrepancies between the network outputs and the actual epidemiological data. This dual-component loss function is pivotal for the network to learn both the dynamics of disease transmission and fit the empirical data.

The model loss ( Inline graphic ) ensures that the predictions from the neural networks for the susceptible , exposed (), infected (), recovered () and vaccinated () adhere to the dynamics described by the SEIRV differential equations. The data loss () could be defined using mean squared error (MSE) for various epidemiological data points such as daily new cases ( Inline graphic ), cumulative cases (), daily vaccinations (), and cumulative vaccinations (). The total loss function is then a weighted sum of these two losses:

As described in the model, the data loss calculation is given by which takes into account both the new cases and the vaccination data.

The model loss ( Inline graphic ) for the SEIRV model in the context of a deep neural network would satisfy the following system of differential equations:

The loss function L could be designed to minimize the error between the observed data and the model predictions over time. Typically, this would be achieved by using the sum of squared errors (SSE) between the observed compartmental values and those predicted by the model:

where Inline graphic represents the set of all model parameters. are the observed values of the compartments at time t. are the values predicted by the neural network model at time t. T is the total number of time points at which data are available.

Equation (6) is the model loss can be formulated as the sum of the squares of the residuals for these equations:

where the sum runs over the time steps, and the ellipsis (...) stands for similar terms for the Inline graphic , , , , and compartments.

Experiments and results

This section first outlines the process for generating synthetic data using the SEIRV model, followed by the preprocessing of the reported data from China. Subsequently, the architecture and specifics of the physics-informed neural networks developed for these scenarios are detailed. Finally, the efficacy of the proposed DINNs is evaluated using the testing datasets.

Data preparation

We obtained COVID-19 data for China from the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE)⁵⁷. This dataset includes daily reported cases, daily deaths, cumulative reported cases, and cumulative deaths. Vaccination data were sourced from “Our World in Data”, encompassing daily vaccination numbers as well as cumulative vaccination totals⁵⁸.

The reliability and cleanliness of reported data are critical for the predictability of compartmental models, especially in terms of parameter estimation. However, due to issues such as misreporting and delays in reporting, the data available is considerably noisy. The data reported from China exhibits significant differences between weekdays and weekends, attributed to a substantial reduction in testing volume during weekends. Consequently, the ability to make valuable estimates from the available reported data is limited. To further process outliers and missing values, we use the interquartile range (IQR) method to identify outliers. Data points below Inline graphic or above are considered outliers⁵⁹. To ensure data consistency, outliers and missing data points are processed by averaging the values immediately adjacent to the mean. This method takes into account the information on both sides of the missing value, better balances possible fluctuations, and reduces the deviation during the filling process. Subsequently, to remove dimensionality, we applied min-max normalization to scale the data within the [0,1] range.

where Inline graphic is the normalized value, is the original value, is the minimum value in the dataset, and is the maximum value in the dataset. Prior to data analysis, we preprocessed the dataset by applying a 7-day moving average window to eliminate the serrated variations due to weekday-weekend cycles in the epidemic reporting. As illustrated in Fig. 3, the reported data from different compartments, post-moving window preprocessing, appear significantly smoother and less noisy compared to the original data.

Fig. 3 — Comparison of reported data before and after preprocessing. Each subplot corresponds to cumulative cases, daily cases, and daily vaccinations respectively. Black dots represent the raw, unprocessed data, while the green solid line illustrates the data smoothed via a moving window.

Model training and predictions

We implement the dynamics informed neural networks (DINNs) method using the Python programming language and the TensorFlow framework⁶⁰. Each neural network developed in this study consists of 5 layers. The weights and biases of each layer, denoted as Inline graphic and respectively, are structured as follows: and Table 1 lists the parameters and their corresponding values used in the epidemiological models.

Table 1.

Parameters and their values used in the epidemiological and neural network models.

Parameter	Description	Value
	Rate of transition from exposed to infectious	⁶¹
	Recovery rate of infectious individuals	⁶¹
	Rate at which exposed individuals become infectious	0.1781³⁴
	Vaccination rate	Fitted
	Transmission rate	Fitted
	Vaccine onset period	14 days⁶²
	Incubation period	3.4 days⁶³

Open in a new tab

The neural network architecture employed in our experiments is designed to take a single scalar input, the time Inline graphic . Each hidden layer of the network is characterized by weight matrices , where denotes the position of the start node and indicates the position of the end node. The non-linear activation function applied at each hidden node is the hyperbolic tangent function, defined as 8:

For the output nodes, a sigmoid activation function is utilized to normalize the outputs corresponding to Inline graphic , and . The sigmoid function is given by 9:

The neural network consists of 5 hidden layers, each containing 50 neurons. The Adam optimization algorithm⁵⁶, as implemented in the TensorFlow package, was selected to train the network. The initial learning rate was set to Inline graphic , with training conducted over 10,000 epochs. To mitigate the risk of overfitting and to enhance the generalization capability of the model, a learning rate decay mechanism was incorporated, reducing the learning rate by 95% every 2000 epochs. The entire training process spans approximately 7 min for 10,000 epochs over all the training data.

By utilizing the model, we can predict future trends of the epidemic. The modeling results can provide reliable feedback for future decision-making by authorities. To evaluate the fitting performance of the proposed DINNs method against observed data from January 1, to December 1, 2022, we visualize the training data in Fig. 4. Figure 4a represents the daily data of new infections, while Fig. 4b shows the daily vaccination data. The DINNs model demonstrates significant accuracy in aligning with the general epidemic pattern and well accommodates the observed data points. The fitted curve effectively represents the three major epidemic peaks throughout this era. Nevertheless, the model exhibits minor discrepancies, which are ascribed to abrupt policy alterations or unexpected changes in population behavior that the model parameters cannot entirely include. Notwithstanding these slight discrepancies, the model effectively encapsulates many waves of infections, illustrating its resilience and flexibility to intricate epidemic patterns. Furthermore, as shown in Fig. 5, the value of the loss function ensures the convergence of the proposed DINNs method for the SEIRV partition model.The loss steadily decreases, indicating successful convergence of the DINNs model. Employing a learning rate decay method to diminish the learning rate by 95% every 2000 epochs alleviates the risk of overfitting and enhancing the model’s generalization capability.

Fig. 4 — Fitted curves for (a) daily new infections and (b) daily vaccination data. The realized depiction displays the actual data, whilst the dotted line illustrates the outcome of the fit.

Fig. 5 — Loss function during the training process. The figure illustrates the trajectory of the loss value as the DINNs model undergoes training epochs. A steady decline in the loss indicates successful convergence, while stabilization suggests that the model has effectively minimized prediction errors.

We tested the predictive capability of the proposed method by forecasting the outbreak of COVID-19 in China. The compartmental model based on ordinary differential equations (ODEs) requires the determination of initial conditions and model parameters to make predictions. Since the initial conditions can be obtained from the training data and the model parameters have already been calibrated, we can predict the epidemic dynamics by executing the forward process.

Model evaluation performance

Moreover, to quantitatively evaluate the performance of the proposed DINNs method, we employed five evaluation metrics for fair and effective comparison: mean absolute error (MAE), mean squared error (MSE), root mean squared error (RMSE), mean absolute percentage error (MAPE) and Inline graphic . MAE directly interprets the average error in the same units as the data, making it easy to understand typical prediction deviations. MSE and RMSE are sensitive to large errors, highlighting significant deviations between the model and the actual value, which is crucial for identifying potential outbreaks or sudden changes in vaccination rates. MAPE expresses the prediction accuracy as a percentage, which facilitates comparison of the performance of different scales of infection or vaccination data. The Inline graphic metric assesses the proportion of the variance in the observed data that is explained by the model, providing insight into the overall fit and predictive power. The calculation methods are shown in Eqs. (10), (11), (12), (13), and (14). .

where Inline graphic and represent the actual and predicted values, respectively, and is the total number of data points. The MAE, MSE, RMSE, MAPE, and MAPE values are calculated for the predicted daily new cases and daily vaccinations, providing a comprehensive evaluation of the proposed DINNs method’s performance.

Experimental results as represented in Table 2 show the highly accurate forecasting capability of the proposed method. The findings indicate that the DINNs model has significant predictive accuracy, particularly in the Inline graphic and intervals, characterized by low MAE, MAPE, RMSE, and MSE values, with elevated scores. The RMSE of 0.0118 and MSE of 0.0001 for the infectious (I) category further underscore the model’s precision in representing the dynamics of the infectious population. The RMSE of 0.0129 and MSE of 0.0001 for the vaccine (V) are minimal, underscoring the model’s proficiency in reliably forecasting vaccination trends. The elevated error rate in the susceptible ( Inline graphic ) can be ascribed to the intrinsic difficulty of modeling susceptible populations, influenced by several factors including behavioral modifications, governmental interventions, and disparate immunity rates. Furthermore, Fig. 6 displays scatter plots that compare the predicted versus actual values for the infectious ( Inline graphic ) and vaccinated () compartments. The black dots represent observed data points, while the solid red line signifies the ideal fit where predictions perfectly match observations. The high scores for both compartments highlight a strong correlation between the model’s predictions and the actual data, thereby validating the DINNs model’s effectiveness in accurately capturing the complex dynamics of COVID-19 spread and vaccination efforts. These scatter plots demonstrate that the model’s predictions are consistently close to the real-world data, affirming its reliability for forecasting infectious disease trends.

Table 2.

Metrics for SEIRV compartments.

Compartment	MAE	MAPE (%)	RMSE	MSE
S	0.3087	1.50	0.3738	0.1397	0.8389
I	0.0085	0.12	0.0118	0.0001	0.9973
V	0.0084	0.16	0.0129	0.0001	0.9971

Open in a new tab

Fig. 6 — Scatter plots demonstrating the predicted versus actual results for infectious () and vaccine () compartments. Black dots represent observed values, while the solid red line illustrates the ideal fit line. The scores highlight the goodness of fit to the observed data.

Inline graphic — Scatter plots demonstrating the predicted versus actual results for infectious () and vaccine () compartments. Black dots represent observed values, while the solid red line illustrates the ideal fit line. The scores highlight the goodness of fit to the observed data.

Computational complexity

Given the complexity of the DINNs model, we analyzed its computational complexity. During the training phase, the computational complexity of the DINNs model primarily depends on the depth and width of the neural network. We conducted experiments with models of varying depths and widths to evaluate their computational complexity. Table 3 presents the final loss values for different model configurations. We observed that changes in model depth and width had minimal impact on the final loss values. This indicates that the DINNs model has relatively low computational complexity, making it suitable for training and prediction on large-scale datasets.

Table 3.

Final loss values for different model depths and widths.

Width	Depth
Width	5	10	15
30	8.19e−4	4.36e−4	1.06e−3
40	8.86e−4	3.24e−4	3.21e−4
50	3.06e−4	3.44e−4	3.04e−4

Open in a new tab

To assess the computing requirements of the DINNs method, we contrasted it with other SEIRV model parameter estimate techniques, including the nonlinear least squares method. Table 4 summarizes the comparison between DINNs and traditional SEIRV parameter estimation methods in terms of training time, memory usage, and inference time. The experimental results demonstrate that although DINNs incur higher training time and memory consumption, they exhibit comparable or superior performance during the inference phase. The traditional model’s faster inference time is primarily due to its pre-set parameters, which lack the flexibility to adapt to the dynamic changes of multi-peak epidemics. Conversely, DINNs dynamically estimate parameters, enabling a better fit to the complex epidemic curves observed in real-world scenarios.

Table 4.

Computational Complexity Comparison between DINNs and Traditional SEIRV Parameter Estimation Methods.

Metric	DINN	Traditional model
Training time (s)	431.78	0.10
Memory usage (MB)	255.31	107.34
Inference time (s)	0.0021	0.0023

Open in a new tab

Subsequently, we performed a sensitivity analysis on the parameters Inline graphic and to assess the robustness of the DINNs model to parameter variations. Figure 7 below illustrates the effect of different parameter changes on the final loss values. We found that variations in these parameters resulted in significant fluctuations in the loss values, indicating that the DINNs model is sensitive to parameter changes. Therefore, precise estimation of these parameters is crucial in practical applications to ensure the accuracy and robustness of the model.

Fig. 7 — Quantitative Sensitivity Analysis of DINNs Model Parameters. Effects of , , and Variations on Model Outputs.

The parameter Inline graphic represents the decline in immune effect after vaccination and the further transition to exposure, and the parameter controls the speed at which infected people recover and develop immunity. The parameter represents the rate at which exposed people become infected and further determines the length of the incubation period. A higher value of Inline graphic signifies a more rapid decline in the vaccine’s immunological response, resulting in an increased population of vulnerable individuals, intensifying system instability, complicating model training, and diminishing forecast accuracy. Conversely, augmenting (the recovery rate of infected individuals) and Inline graphic (the conversion rate from the incubation period) can diminish transmission risk, streamline the transmission process, expedite the achievement of equilibrium, and enhance the model’s capacity to accurately depict the dynamics of infectious diseases, thereby increasing precision. Consequently, augmenting Inline graphic and while diminishing can substantially decrease the system’s complexity, allowing the DINNs to more precisely represent the disease transmission process aligned with the biological traits of COVID-19.

Discussions and conclusions

During the COVID-19 pandemic, vaccination and control measures became primary methods for combating the spread of the virus, playing a decisive role in understanding the dynamics of the outbreak in China. Thus, quantifying vaccination rates and estimating their impacts remains challenging. Our study introduces a novel approach called the dynamics informed neural network (DINNs), which combines the advantages of ordinary differential equation (ODE) models with deep learning to enhance the predictive capabilities of traditional compartmental models, such as SEIRV. The DINNs framework leverages the universal approximation properties of deep neural networks to estimate time-varying epidemiological parameters. By coupling two subnetworks-one for inferring key parameters like vaccination rates ( Inline graphic ) and the other for solving SEIRV differential equations-DINNs can understand the temporal evolution of the epidemic from a data-driven perspective.

Notably, the DINNs algorithm addresses some limitations of traditional dynamical models in simulating COVID-19. For instance, in the SEIR model, vaccination rates and transmission rates are typically set as constants^64,65. To fit multi-stage epidemic spreads, intervals are often required for infection peaks, which significantly limits the performance of traditional dynamical models. In contrast, our proposed DINNs model effectively resolves this issue by integrating the universal approximation theorem⁶⁶ of neural networks with traditional dynamical models, and deriving information on contact rates and vaccination rates from daily new case numbers. In our sensitivity analysis, we observed that variations in neural network depth and width had a minimal impact on the final loss values. This suggests that the DINNs model has relatively low computational complexity, making it suitable for training and prediction on large-scale datasets. However, the predictions of the DINNs model are highly sensitive to initial conditions and parameter estimates. Inaccurate or ambiguous beginning data might disseminate mistakes throughout the model, thereby compromising the trustworthiness of long-term forecasts. The model’s success is contingent upon the availability of high-quality, comprehensive data. In reality, data may be missing, delayed, or influenced by reporting biases, which might negatively impact the model’s accuracy and robustness. Furthermore, although the DINNs methodology adeptly captures the dynamics of transmission rates and vaccination impacts, it may inadequately consider other influencing factors, such as behavioral modifications, the emergence of novel variants, or regional disparities, unless supplemented with additional data sources or increased model complexity. Subsequent study has to investigate methods to alleviate these constraints, including the incorporation of real-time data assimilation approaches, the enhancement of the model’s adaptability to novel variations, and the integration of multi-source data to augment the model’s generalizability and robustness. Upon examining the model’s sensitivity to parameter variations, we observe that alterations in these parameters significantly affect the loss value. An elevation in Inline graphic will result in a hastened rate of immune deterioration and heightened system instability; a reduction in and will precipitate an expedited rate of infection and further complicate the model, demonstrating that the DINNs model is highly responsive to parameter variations, aligning with the biological attributes of actual COVID-19. Consequently, precise estimation of these parameters is essential for maintaining the model’s accuracy and resilience in actual applications.

Applying DINNs to COVID-19 data from China demonstrates its exceptional performance in fitting epidemic curves, including cumulative and daily cases, as well as vaccination data. Quantitative evaluation metrics such as MAE, MAPE, MSE, RMSE, and Inline graphic further validate the high accuracy and stability of DINNs. Additionally, during the early stages of the outbreak, DINNs was able to predict the progression of the epidemic by utilizing daily new case numbers, thus guiding vaccination trends. Integrating deep learning into epidemiological models opens new avenues for data-driven decision-making in public health. Traditional statistical models based on partial differential equations often struggle to adapt to increasingly complex real-world scenarios, such as the emergence of new variants and dynamic control strategies. DINNs harnesses the power of machine learning to accurately describe epidemiological parameters and predict epidemic trajectories, offering a promising solution. Graph neural networks (GNNs) can proficiently represent the intricate interrelations across various geographical regions, allowing DINNs to encapsulate the spatial variability of disease transmission. This integration can enhance forecast accuracy by including aspects such as population movement, alterations in regional policy, and local epidemic dynamics. Utilizing the benefits of GNNs for graph-structured data processing, DINNs models can improve their capacity to depict and forecast the spatiotemporal progression of infectious illnesses, thereby offering more nuanced insights for targeted public health actions. Future research may explore combining DINNs with graph neural networks to account for spatiotemporal factors like population movement.

Author contributions

C.C: Designed the study and wrote the main article, E.A and M.H.N.A: Writing-review and editing. All authors reviewed the manuscript.

Data availability

All data used in this study are publicly available and can be accessed from the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) and Our World in Data^57,58.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Calistri, A., Roggero, P. F. & Palù, G. Chaos theory in the understanding of COVID-19 pandemic dynamics. Gene 148334 (2024). [DOI] [PubMed]
2.Hu, S. et al. Races of small molecule clinical trials for the treatment of COVID-19: An up-to-date comprehensive review. Drug Dev. Res.83, 16–54 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med.382, 727–733 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
4.World Health Organization. Who timeline - COVID-19. https://www.who.int/news/item/27-04-2020-who-timeline---COVID-19 (2020). Accessed: 20 Apr 2024.
5.Andrew, R. Preliminary genomic characterisation of an emergent sars-cov-2 lineage in the UK defined by a novel set of spike mutations. Virological (2020).
6.Dhar, M. S. et al. Genomic characterization and epidemiology of an emerging sars-cov-2 variant in Delhi, India. Science374, 995–999 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Tegally, H. et al. Detection of a sars-cov-2 variant of concern in south Africa. Nature592, 438–443 (2021). [DOI] [PubMed] [Google Scholar]
8.Harvey, W. T. et al. Sars-cov-2 variants, spike mutations and immune escape. Nat. Rev. Microbiol.19, 409–424 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Vespignani, A. et al. Modelling COVID-19. Nat. Rev. Phys.2, 279–281 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Aziz, M. H. N. et al. Modelling the effect of vaccination program and inter-state travel in the spread of COVID-19 in Malaysia. Acta. Biotheor.71, 2 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Zhu, Q., Gao, Y., Hu, Q., Hu, D. & Wu, X. A study on the factors influencing the intention to receive booster shots of the COVID-19 vaccine in China based on the information frame effect. Front. Public Health12, 1258188 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Holmdahl, I. & Buckee, C. Wrong but useful–what COVID-19 epidemiologic models can and cannot tell us. N. Engl. J. Med.383, 303–305 (2020). [DOI] [PubMed] [Google Scholar]
13.Thompson, R. N. et al. Key questions for modelling COVID-19 exit strategies. Proc. R. Soc. B287, 20201405 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Bretó, C., He, D., Ionides, E. L. & King, A. A. Time series analysis via mechanistic models. Ann. Appl. Stat. 319–348 (2009).
15.Kucharski, A. J. et al. Early dynamics of transmission and control of COVID-19: A mathematical modelling study. Lancet. Infect. Dis.20, 553–558 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Giordano, G. et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat. Med.26, 855–860 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
17.LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature521, 436–444 (2015). [DOI] [PubMed] [Google Scholar]
18.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016). [Google Scholar]
19.Jordan, M. I. & Mitchell, T. M. Machine learning: Trends, perspectives, and prospects. Science349, 255–260 (2015). [DOI] [PubMed] [Google Scholar]
20.Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature542, 115–118 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Angermueller, C., Pärnamaa, T., Parts, L. & Stegle, O. Deep learning for computational biology. Mol. Syst. Biol.12, 878 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Aljaaf, A. J., Mohsin, T. M., Al-Jumeily, D. & Alloghani, M. A fusion of data science and feed-forward neural network-based modelling of COVID-19 outbreak forecasting in Iraq. J. Biomed. Inform.118, 103766 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Alshanbari, H. M. et al. On the implementation of the artificial neural network approach for forecasting different healthcare events. Diagnostics13, 1310 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst.2, 303–314 (1989). [Google Scholar]
25.Liu, D., Ding, W., Dong, Z. S. & Pedrycz, W. Optimizing deep neural networks to predict the effect of social distancing on COVID-19 spread. Comput. Ind. Eng.166, 107970 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Kermack, W. O. & McKendrick, A. G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. Ser. A Contain. Pap. A Math. Phys. Charact.115, 700–721 (1927). [Google Scholar]
27.Ferguson, N. M. et al. Report 9: Impact of Non-pharmaceutical Interventions (NPIs) to Reduce COVID19 Mortality and Healthcare Demand Vol. 16 (Imperial College London, 2020). [Google Scholar]
28.Moore, S., Hill, E. M., Tildesley, M. J., Dyson, L. & Keeling, M. J. Vaccination and non-pharmaceutical interventions for COVID-19: A mathematical modelling study. Lancet. Infect. Dis.21, 793–802 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Kissler, S. M., Tedijanto, C., Goldstein, E., Grad, Y. H. & Lipsitch, M. Projecting the transmission dynamics of sars-cov-2 through the postpandemic period. Science368, 860–868 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Anderson, R. M. & May, R. M. Infectious Diseases of Humans: Dynamics and Control (Oxford University Press, 1991). [Google Scholar]
31.Hethcote, H. W. The mathematics of infectious diseases. SIAM Rev.42, 599–653 (2000). [Google Scholar]
32.Bozzani, F. M. Incorporating Feasibility in Priority Setting: A Case Study of Tuberculosis Control in South Africa. Ph.D. thesis, London School of Hygiene & Tropical Medicine (2021).
33.Thompson, J. & Wattam, S. Estimating the impact of interventions against COVID-19: From lockdown to vaccination. PLoS ONE16, e0261330 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.He, M., Tang, S. & Xiao, Y. Combining the dynamic model and deep neural networks to identify the intensity of interventions during COVID-19 pandemic. PLoS Comput. Biol.19, e1011535 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Samek, W., Montavon, G., Lapuschkin, S., Anders, C. J. & Müller, K.-R. Explaining deep neural networks and beyond: A review of methods and applications. Proc. IEEE109, 247–278 (2021). [Google Scholar]
36.Rahmadani, F. & Lee, H. Hybrid deep learning-based epidemic prediction framework of COVID-19: South Korea case. Appl. Sci.10, 8539 (2020). [Google Scholar]
37.Dietterich, T. G. Ensemble methods in machine learning. In International Workshop on Multiple Classifier Systems 1–15 (Springer, 2000).
38.Tian, Y., Luthra, I. & Zhang, X. Forecasting COVID-19 cases using machine learning models. MedRxiv 2020–07 (2020).
39.Ghafouri-Fard, S. et al. Application of machine learning in the prediction of COVID-19 daily new cases: A scoping review. Heliyon7 (2021). [DOI] [PMC free article] [PubMed]
40.Ning, X., Jia, L., Wei, Y., Li, X.-A. & Chen, F. Epi-dnns: Epidemiological priors informed deep neural networks for modeling COVID-19 dynamics. Comput. Biol. Med.158, 106693 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Chimmula, V. K. R. & Zhang, L. Time series forecasting of COVID-19 transmission in Canada using lstm networks. Chaos Solitons Fractals135, 109864 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Lou, H.-R., Wang, X., Gao, Y. & Zeng, Q. Comparison of Arima model, DNN model and lstm model in predicting disease burden of occupational pneumoconiosis in Tianjin, China. BMC Public Health22, 2167 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kong, L. et al. Compartmental structures used in modeling COVID-19: A scoping review. Infect. Dis. Poverty11, 72 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Cao, L. & Liu, Q. COVID-19 modeling: A review. ACM Comput. Surv.57, 10.1145/3686150 (2024).
45.Ning, X., Guan, J., Li, X.-A., Wei, Y. & Chen, F. Physics-informed neural networks integrating compartmental model for analysing COVID-19 transmission dynamics. Viruses15, 1749 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Pei, S., Kandula, S., Yang, W. & Shaman, J. Forecasting the spatial transmission of influenza in the united states. Proc. Natl. Acad. Sci.115, 2752–2757 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Zhao, A. P. et al. Ai for science: Predicting infectious diseases. J. Saf. Sci. Resilience (2024).
48.Yeung, A., Roewer-Despres, F., Rosella, L. & Rudzicz, F. Comparison of multiple machine learning-based predictions of growth in COVID-19 confirmed infection cases in countries using non-pharmaceutical interventions and cultural dimensions data: Development and validation (preprint). J. Med. Intern. Res.[SPACE]10.2196/26628 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Annas, S., Pratama, M. I., Rifandi, M., Sanusi, W. & Side, S. Stability analysis and numerical simulation of seir model for pandemic COVID-19 spread in Indonesia. Chaos Solitons Fractals139, 110072 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Watson, O. J. et al. Global impact of the first year of COVID-19 vaccination: A mathematical modelling study. Lancet. Infect. Dis.22, 1293–1302 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw.61, 85–117 (2015). [DOI] [PubMed] [Google Scholar]
52.Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst.25 (2012).
53.Hinton, G. et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag.29, 82–97 (2012). [Google Scholar]
54.Hornik, K., Stinchcombe, M. & White, H. Multilayer feedforward networks are universal approximators. Neural Netw.2, 359–366 (1989). [Google Scholar]
55.Nair, V. & Hinton, G. E. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML-10) 807–814 (2010).
56.Kingma, D. P. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
57.Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). 2019 novel coronavirus COVID-19 (2019-ncov) data repository. https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data (2022). Accessed on 16 Oct 2022.
58.Our World in Data. Data on COVID-19 (coronavirus) vaccinations. https://github.com/owid/COVID-19-data/tree/master/public/data/vaccinations (2022).
59.Rousseeuw, P. J. & Hubert, M. Robust statistics for outlier detection. Wiley Interdiscipl. Rev. Data Min. Knowl. Discov.1, 73–79 (2011). [Google Scholar]
60.Abadi, M. et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).
61.He, M., Tang, B., Xiao, Y. & Tang, S. Transmission dynamics informed neural network with application to COVID-19 infections. Comput. Biol. Med.165, 107431 (2023). [DOI] [PubMed] [Google Scholar]
62.Colburn, N. How long does it take for the COVID-19 vaccine to work? Wexner Medical Center Blog (2021). Available online.
63.Wu, Y. et al. Incubation period of COVID-19 caused by unique sars-cov-2 strains: A systematic review and meta-analysis. JAMA Netw. Open5, e2228008 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Chen, Y.-T. Effect of vaccination patterns and vaccination rates on the spread and mortality of the COVID-19 pandemic. Health Policy Technol.12, 100699 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Salcedo-Varela, G. A., Peñuñuri, F., González-Sánchez, D. & Díaz-Infante, S. Synchronizing lockdown and vaccination policies for COVID-19: An optimal control approach based on piecewise constant strategies. Opt. Control Appl. Methods45, 523–543 (2024). [Google Scholar]
66.Voigtlaender, F. The universal approximation theorem for complex-valued neural networks. Appl. Comput. Harmon. Anal.64, 33–61 (2023). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data used in this study are publicly available and can be accessed from the Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE) and Our World in Data^57,58.

[CR1] 1.Calistri, A., Roggero, P. F. & Palù, G. Chaos theory in the understanding of COVID-19 pandemic dynamics. Gene 148334 (2024). [DOI] [PubMed]

[CR2] 2.Hu, S. et al. Races of small molecule clinical trials for the treatment of COVID-19: An up-to-date comprehensive review. Drug Dev. Res.83, 16–54 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med.382, 727–733 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.World Health Organization. Who timeline - COVID-19. https://www.who.int/news/item/27-04-2020-who-timeline---COVID-19 (2020). Accessed: 20 Apr 2024.

[CR5] 5.Andrew, R. Preliminary genomic characterisation of an emergent sars-cov-2 lineage in the UK defined by a novel set of spike mutations. Virological (2020).

[CR6] 6.Dhar, M. S. et al. Genomic characterization and epidemiology of an emerging sars-cov-2 variant in Delhi, India. Science374, 995–999 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Tegally, H. et al. Detection of a sars-cov-2 variant of concern in south Africa. Nature592, 438–443 (2021). [DOI] [PubMed] [Google Scholar]

[CR8] 8.Harvey, W. T. et al. Sars-cov-2 variants, spike mutations and immune escape. Nat. Rev. Microbiol.19, 409–424 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Vespignani, A. et al. Modelling COVID-19. Nat. Rev. Phys.2, 279–281 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Aziz, M. H. N. et al. Modelling the effect of vaccination program and inter-state travel in the spread of COVID-19 in Malaysia. Acta. Biotheor.71, 2 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Zhu, Q., Gao, Y., Hu, Q., Hu, D. & Wu, X. A study on the factors influencing the intention to receive booster shots of the COVID-19 vaccine in China based on the information frame effect. Front. Public Health12, 1258188 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Holmdahl, I. & Buckee, C. Wrong but useful–what COVID-19 epidemiologic models can and cannot tell us. N. Engl. J. Med.383, 303–305 (2020). [DOI] [PubMed] [Google Scholar]

[CR13] 13.Thompson, R. N. et al. Key questions for modelling COVID-19 exit strategies. Proc. R. Soc. B287, 20201405 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Bretó, C., He, D., Ionides, E. L. & King, A. A. Time series analysis via mechanistic models. Ann. Appl. Stat. 319–348 (2009).

[CR15] 15.Kucharski, A. J. et al. Early dynamics of transmission and control of COVID-19: A mathematical modelling study. Lancet. Infect. Dis.20, 553–558 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Giordano, G. et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat. Med.26, 855–860 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature521, 436–444 (2015). [DOI] [PubMed] [Google Scholar]

[CR18] 18.Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016). [Google Scholar]

[CR19] 19.Jordan, M. I. & Mitchell, T. M. Machine learning: Trends, perspectives, and prospects. Science349, 255–260 (2015). [DOI] [PubMed] [Google Scholar]

[CR20] 20.Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature542, 115–118 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Angermueller, C., Pärnamaa, T., Parts, L. & Stegle, O. Deep learning for computational biology. Mol. Syst. Biol.12, 878 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Aljaaf, A. J., Mohsin, T. M., Al-Jumeily, D. & Alloghani, M. A fusion of data science and feed-forward neural network-based modelling of COVID-19 outbreak forecasting in Iraq. J. Biomed. Inform.118, 103766 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Alshanbari, H. M. et al. On the implementation of the artificial neural network approach for forecasting different healthcare events. Diagnostics13, 1310 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst.2, 303–314 (1989). [Google Scholar]

[CR25] 25.Liu, D., Ding, W., Dong, Z. S. & Pedrycz, W. Optimizing deep neural networks to predict the effect of social distancing on COVID-19 spread. Comput. Ind. Eng.166, 107970 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Kermack, W. O. & McKendrick, A. G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. Ser. A Contain. Pap. A Math. Phys. Charact.115, 700–721 (1927). [Google Scholar]

[CR27] 27.Ferguson, N. M. et al. Report 9: Impact of Non-pharmaceutical Interventions (NPIs) to Reduce COVID19 Mortality and Healthcare Demand Vol. 16 (Imperial College London, 2020). [Google Scholar]

[CR28] 28.Moore, S., Hill, E. M., Tildesley, M. J., Dyson, L. & Keeling, M. J. Vaccination and non-pharmaceutical interventions for COVID-19: A mathematical modelling study. Lancet. Infect. Dis.21, 793–802 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Kissler, S. M., Tedijanto, C., Goldstein, E., Grad, Y. H. & Lipsitch, M. Projecting the transmission dynamics of sars-cov-2 through the postpandemic period. Science368, 860–868 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Anderson, R. M. & May, R. M. Infectious Diseases of Humans: Dynamics and Control (Oxford University Press, 1991). [Google Scholar]

[CR31] 31.Hethcote, H. W. The mathematics of infectious diseases. SIAM Rev.42, 599–653 (2000). [Google Scholar]

[CR32] 32.Bozzani, F. M. Incorporating Feasibility in Priority Setting: A Case Study of Tuberculosis Control in South Africa. Ph.D. thesis, London School of Hygiene & Tropical Medicine (2021).

[CR33] 33.Thompson, J. & Wattam, S. Estimating the impact of interventions against COVID-19: From lockdown to vaccination. PLoS ONE16, e0261330 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.He, M., Tang, S. & Xiao, Y. Combining the dynamic model and deep neural networks to identify the intensity of interventions during COVID-19 pandemic. PLoS Comput. Biol.19, e1011535 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Samek, W., Montavon, G., Lapuschkin, S., Anders, C. J. & Müller, K.-R. Explaining deep neural networks and beyond: A review of methods and applications. Proc. IEEE109, 247–278 (2021). [Google Scholar]

[CR36] 36.Rahmadani, F. & Lee, H. Hybrid deep learning-based epidemic prediction framework of COVID-19: South Korea case. Appl. Sci.10, 8539 (2020). [Google Scholar]

[CR37] 37.Dietterich, T. G. Ensemble methods in machine learning. In International Workshop on Multiple Classifier Systems 1–15 (Springer, 2000).

[CR38] 38.Tian, Y., Luthra, I. & Zhang, X. Forecasting COVID-19 cases using machine learning models. MedRxiv 2020–07 (2020).

[CR39] 39.Ghafouri-Fard, S. et al. Application of machine learning in the prediction of COVID-19 daily new cases: A scoping review. Heliyon7 (2021). [DOI] [PMC free article] [PubMed]

[CR40] 40.Ning, X., Jia, L., Wei, Y., Li, X.-A. & Chen, F. Epi-dnns: Epidemiological priors informed deep neural networks for modeling COVID-19 dynamics. Comput. Biol. Med.158, 106693 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Chimmula, V. K. R. & Zhang, L. Time series forecasting of COVID-19 transmission in Canada using lstm networks. Chaos Solitons Fractals135, 109864 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Lou, H.-R., Wang, X., Gao, Y. & Zeng, Q. Comparison of Arima model, DNN model and lstm model in predicting disease burden of occupational pneumoconiosis in Tianjin, China. BMC Public Health22, 2167 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Kong, L. et al. Compartmental structures used in modeling COVID-19: A scoping review. Infect. Dis. Poverty11, 72 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Cao, L. & Liu, Q. COVID-19 modeling: A review. ACM Comput. Surv.57, 10.1145/3686150 (2024).

[CR45] 45.Ning, X., Guan, J., Li, X.-A., Wei, Y. & Chen, F. Physics-informed neural networks integrating compartmental model for analysing COVID-19 transmission dynamics. Viruses15, 1749 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Pei, S., Kandula, S., Yang, W. & Shaman, J. Forecasting the spatial transmission of influenza in the united states. Proc. Natl. Acad. Sci.115, 2752–2757 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Zhao, A. P. et al. Ai for science: Predicting infectious diseases. J. Saf. Sci. Resilience (2024).

[CR48] 48.Yeung, A., Roewer-Despres, F., Rosella, L. & Rudzicz, F. Comparison of multiple machine learning-based predictions of growth in COVID-19 confirmed infection cases in countries using non-pharmaceutical interventions and cultural dimensions data: Development and validation (preprint). J. Med. Intern. Res.[SPACE]10.2196/26628 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Annas, S., Pratama, M. I., Rifandi, M., Sanusi, W. & Side, S. Stability analysis and numerical simulation of seir model for pandemic COVID-19 spread in Indonesia. Chaos Solitons Fractals139, 110072 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Watson, O. J. et al. Global impact of the first year of COVID-19 vaccination: A mathematical modelling study. Lancet. Infect. Dis.22, 1293–1302 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] 51.Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw.61, 85–117 (2015). [DOI] [PubMed] [Google Scholar]

[CR52] 52.Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst.25 (2012).

[CR53] 53.Hinton, G. et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag.29, 82–97 (2012). [Google Scholar]

[CR54] 54.Hornik, K., Stinchcombe, M. & White, H. Multilayer feedforward networks are universal approximators. Neural Netw.2, 359–366 (1989). [Google Scholar]

[CR55] 55.Nair, V. & Hinton, G. E. Rectified linear units improve restricted Boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML-10) 807–814 (2010).

[CR56] 56.Kingma, D. P. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[CR57] 57.Johns Hopkins University Center for Systems Science and Engineering (JHU CSSE). 2019 novel coronavirus COVID-19 (2019-ncov) data repository. https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data (2022). Accessed on 16 Oct 2022.

[CR58] 58.Our World in Data. Data on COVID-19 (coronavirus) vaccinations. https://github.com/owid/COVID-19-data/tree/master/public/data/vaccinations (2022).

[CR59] 59.Rousseeuw, P. J. & Hubert, M. Robust statistics for outlier detection. Wiley Interdiscipl. Rev. Data Min. Knowl. Discov.1, 73–79 (2011). [Google Scholar]

[CR60] 60.Abadi, M. et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).

[CR61] 61.He, M., Tang, B., Xiao, Y. & Tang, S. Transmission dynamics informed neural network with application to COVID-19 infections. Comput. Biol. Med.165, 107431 (2023). [DOI] [PubMed] [Google Scholar]

[CR62] 62.Colburn, N. How long does it take for the COVID-19 vaccine to work? Wexner Medical Center Blog (2021). Available online.

[CR63] 63.Wu, Y. et al. Incubation period of COVID-19 caused by unique sars-cov-2 strains: A systematic review and meta-analysis. JAMA Netw. Open5, e2228008 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR64] 64.Chen, Y.-T. Effect of vaccination patterns and vaccination rates on the spread and mortality of the COVID-19 pandemic. Health Policy Technol.12, 100699 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR65] 65.Salcedo-Varela, G. A., Peñuñuri, F., González-Sánchez, D. & Díaz-Infante, S. Synchronizing lockdown and vaccination policies for COVID-19: An optimal control approach based on piecewise constant strategies. Opt. Control Appl. Methods45, 523–543 (2024). [Google Scholar]

[CR66] 66.Voigtlaender, F. The universal approximation theorem for complex-valued neural networks. Appl. Comput. Harmon. Anal.64, 33–61 (2023). [Google Scholar]

PERMALINK

Leveraging dynamics informed neural networks for predictive modeling of COVID-19 spread: a hybrid SEIRV-DNNs approach

Cheng Cheng

Elayaraja Aruchunan

Muhamad Hifzhudin Noor Aziz

Abstract

Introduction

Mini-review of related works

Methodology

Dynamic model

Fig. 1.

DNN architecture

DINNs model

Fig. 2.

Experiments and results

Data preparation

Fig. 3.

Model training and predictions

Table 1.

Fig. 4.

Fig. 5.

Model evaluation performance

Table 2.

Fig. 6.

Computational complexity

Table 3.

Table 4.

Fig. 7.

Discussions and conclusions

Author contributions

Data availability

Declarations

Competing interests

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases