Cascading failures and resilience optimization of hospital infrastructure systems against the COVID-19

Hongyan Dui; Kaixin Liu; Shaomin Wu

doi:10.1016/j.cie.2023.109158

. 2023 Mar 17;179:109158. doi: 10.1016/j.cie.2023.109158

Cascading failures and resilience optimization of hospital infrastructure systems against the COVID-19

Hongyan Dui ^a,^⁎, Kaixin Liu ^a, Shaomin Wu ^b

PMCID: PMC10022470 PMID: 36960126

Abstract

The outbreak of the Coronavirus Disease 2019 (COVID-19) has put the resilience of a country’s healthcare infrastructure to the most severe test. The challenge of taking emergency measures to optimize the supply of medical resources and effectively meet the medical needs of residents is an important issue that needs to be resolved urgently in the prevention and control of public health emergencies. This paper analyzes cascading failures and optimization of the resilience of the hospital infrastructure system (HIS) with the presence of the COVID-19. It proposes a propagation model to describe the COVID-19 infectious process and establishes a cascading failure model of a HIS to analyze its failure mechanism. It also proposes a method for optimizing the resilience of HIS. Then the supplies and demands in maintaining the operations of HIS are studied, and a restoration strategy is obtained. Finally, simulation analysis of the spread of the COVID-19 is carried out to illustrate the applicability of the proposed method.

Keywords: Reliability, Cascading failure, Resilience, Hospital infrastructure system, Supply chain

1. Introduction

1.1. Background

In December 2019, a coronavirus disease, which was later named as the COVID-19, was detected in Wuhan, Hubei Province, China, and then began spreading globally. It is transmitted mainly by respiratory droplets and physical contact and is highly contagious. It poses a tremendous threat to the lives and people’s health, and causes immense damage to economic and social development. As of 12:02 am, February 28, 2023 (Greenwich time), there are 679,887,320 COVID-19 infected cases and 6,799,660 deaths relating to the COVID-19 (Worldometer, 2023). The World Health Organization has listed the COVID-19 as a public health emergency of international concern.

As the main part of a healthcare system for responding public health emergencies, hospital infrastructure systems (HIS’s) are directly responsible for the prevention and control of epidemics. A HIS is a complex system composed of medical staff and various types of medical resources interacting with each other and can be abstracted as a complex network consisting of all hospitals and their linkage relationships.

At the beginning of the outbreak of the COVID-19, hospitals were overwhelmed with COVID-19 patients and HIS’s struggled in coping with the surging medical demand. The outbreak of the COVID-19 put the resilience of HIS’s to the test, and emergency management tools were therefore necessary in managing the performance and quantity of medical resources. To address the shortcomings and deficiencies revealed in the outbreak of the COVID-19, there is a need to conduct failure analysis of HIS’s and investigate post-disaster restoration strategies for HIS’s. This paper serves this purpose.

1.2. Literature reviews

The infectious disease dynamics model (IDD model) is an effective tool for the study of infectious diseases, on which there is an abundance of work (Gao and Wang, 2022, Qian and Ukkusuri, 2021, Kermack and Mckendrick, 1991; Enatsu, Messina, Nakata, Muroya, & Vecchio, 2012). In the literature, there are two main approaches to characterizing the dynamics of infectious diseases: the first one includes compartmental models and the second one models the disease propagation at the individual level over large-scale networks (Qian & Ukkusuri, 2021). Particularly, a compartmental model classifies the population under study into several states: susceptible (S), latent (E), infected (I), and recovered (R). The transition between the states of the subjects describes the process of virus propagation. A susceptible person, as a latent person, could be contracted by an infected person, which may then become a recovered one after being treated. Compartmental models include SIR (susceptible-infected-recovered) models, SIRS (susceptible-infected-recovered-susceptible) models, SEIR (susceptible-exposed-infected-removed) models, among others. Kermack & Mckendrick (1991) first proposed the SIR model in 1927, assumed that the number of people in the target area was constant and that people recovered from the virus will no longer infected, and divided the target population into three categories: susceptible (S), infected (I), and recovered (R). Enatsu et al., 2012, Sekiguchi and Ishiwata, 2010 studied discrete-time SIRS infectious disease kinetic models with time lags and non-linear incidence. They used mathematical induction, the principle of comparison of differential equations and the construction of appropriate Lyapunov functions, to obtain the conclusion that the disease is persistent when the underlying regeneration number is greater than one.

Researchers have proposed complex network virus models based on compartmental models, and they treat individuals as nodes and connections between individuals as node-linked edges to study the virus propagation process on both homogeneous and non-homogeneous networks. Gagliardi & Alves (2010) studied the effect of small-world effect on virus propagation based on Cellular Automata (CA) and concluded that enhanced network small-world effect can accelerate the virus propagation rate, etc. Wang, Wang, Liu & Li (2014) studied an SIR epidemic model with demographics and time-delay on networks. According to Zhang & Jin (2011), the epidemic model has been considered networks with birth and death rates, where the basic reproductive threshold parameter is defined to show the dynamics of an epidemic.

Most of the research on the cascading failures in a complex network has focused on quantitative analysis and applied research (Sheu et al., 2020, Whitman et al., 2017). Wang & Xiao (2016) presented a cascade failure model based on an improved ant colony algorithm for a cluster-distributed supply chain network, taking into account the topology of the network, the flexibility of the nodes and the efficiency of the nodes. Zhou, Huang, Coit & Fel (2018) analyzed the process of network cascade failures from the perspectives of load dynamics and node dependence, respectively. Zheng, Gao & Zhao (2007) constructed a cascade failure model for scale-free networks that consider aggregation coefficients and congestion effects, and pointed out the characteristics of network element coefficients with high sensitivity to failures. Rodríguez-Méndez, Ser-Giacomi & HernándezGarcía (2017) investigated the characteristics of clustering coefficients in the cascade failure process of fluid networks and the impact on the scale of its failures. Linkov, Keenan & Trump (2021) reviewed research that applies risk, resilience, and strategy theories to civil, environmental, and public health in the context of COVID-19. Their work enables decision-makers to understand the systemic and sweeping nature of the COVID-19 pandemic. Hynes, Trump, Love & Linkov (2020) point out that COVID-19 can reduce the ability of critical systems to withstand shocks and can cause failures in one system to spread to another. Wells, Boden, Tseytlin & Linkov (2022) conducted a literature review on the resilience of critical infrastructure in the network science literature published between 2010 and 2021 under compounding failure. Guo et al. (2019) developed a cascade model that takes account of the project’s self-protection mechanism to examine a failure propagation process originated from a single task failure.

There are many studies on maintenance optimization of complex systems (Broek et al., 2021, Broek et al., 2019, Keizer et al., 2017, Zhao et al., 2018), and resilience is an indicator to guide the maintenance of complex systems (Almoghathawi and Barker, 2019). The word “resilience” is originally derived from the Latin word “resiliere”, meaning “to rebound”, and is commonly used to indicate the ability of a system to sustain external and internal disruptions without interrupting the execution of system functions, or, if the function is disconnected, to fully recover the function rapidly (Hosseini, Barker & Ramirez-Marquez, 2016). Galaitsi et al. (2020) studied eight concepts, which characterize systems facing threats: adaptability, agility, reliability, resilience, resistance, robustness, safety, security, and sustainability. They found that resilience could only manifest when recovery is needed, and thus could complement concepts related to threat impact like resistance, robustness, safety, and security. Siskos & Burgherr (2022) proposed an elaborative multicriteria decision support methodological framework for the Evaluation of Electricity Supply Resilience, based on three major resilience dimensions including “resist”, “restabilise” and “recover”. Ouyang (2017) proposed a mathematical framework to support resilience optimization of interdependent critical infrastructure system under the worst critical infrastructure system. Linkov et al. (2018) proposed a three-tier qualitative analysis framework for resilience assessment. The framework allows regulators to integrate resilience assessments with existing risk assessment protocols. Ransolin, Saurin and Formoso (2020) developed a framework for the integrated modelling of built environment and functional requirements, supporting the analysis of resilient performance.

In the context of HIS’s, resilience refers to its ability to recover quickly from an attack by a health event. The continued operation of infrastructure is fundamental to people’s daily life, and optimizing the resilience of Hospital Infrastructure Systems is essential for the safety and health of the population (Barabadi, Ghiasi & Nouri, 2020). Studies on HIS resilience under emergencies can be found in the literature. Pishnamazzadeh, Sepehri & Ostadi (2020) proposed a model to assess hospital resilience based on a system dynamics approach. The model studied the effect of four Key Performance Indicators (KPI) of hospitals: patient satisfaction, patient waiting time, staff burnout and staff satisfaction on the resilience. Achour, Miyajima, Pascale & Price (2014) assessed the resilience of healthcare institutions under supply disruption, using data from hospitals in the aftermath of the 2003 Tohoku earthquake in Japan for validation. Tariverdi, Fotouhi, Moryadee & Miller-Hooks (2018) proposed a hierarchical modeling concept to quantify the resilience of regional hospital response under disaster, and estimated resilience in terms of total patient waiting time and unserved patients. Zhang, Shi, Huang, Hua & Teunter (2021) studied policies for optimizing the inventory and capital reserves of emergency medical resources under the COVID-19. Samsuddin, Takim, Nawawi & Alwee (2018) measured hospital’s disaster resilience as the hospital’s ability to resist, absorb, accommodate and recover from the effects of a hazard in a timely and efficient manner. Additionally, they investigated the hospital preparedness attributes and resilience indicators and established relationship of preparedness attributes towards hospital’s resilience. Hassan and Mahmoud (2021) investigated the combined impact of wildfire and pandemic on a network of hospitals, they combined wildfire data with varying courses of the spread of COVID-19 to evaluate the effectiveness of different strategies for managing patient demand. Li et al. (2020) developed a system dynamics model describing hospital functionality after earthquakes (SD-HFE) to simulate hospital functionalities, then the resilience assessment can then be conducted based on the functionality curve. Grimaz, Ruzzene & Zorzini (2021) illustrated the RADAR-HF (Recon Analysis for Detecting the Actual situation and the improvement Requests, applied to Hospital Facilities) developed for the situational assessment of the physical environment of hospital facilities. Decision makers can use RADAR-HF to define comprehensive modernization strategies with resilience improvements, monitor the condition of facilities, and understand the effectiveness of interventions. Barasa, Mbau and Gilson (2018) performed a systematic review of empirical literature on organizational resilience, and made several observations that were relevant to nurturing the resilience of health systems.

1.3. Knowledge gaps, novelty and contributions

From the above literature review, there are some hospital resilience models that apply resilience theory in hospital management. There are four main categories of hospital resilience models in the literature: models based on a system dynamics approach to studying the relevant factors affecting resilience, models that assess resilience through different perspectives, models for optimizing resilience based on different optimization purposes, and models that develop informative decision support systems. This paper presents a resilience optimization model for hospitals based on the Markov decision process, which integrates the virus propagation process and a hospital cascading failure process. Then a hospital resilience optimization model is proposed, which can determine the restoration strategy of HIS’s at each period and can restore the hospital's ability to serve patients as soon as possible.

It can be seen from literature review that the resilience and COVID-19 has been studied from different perspectives. However, there are some shortcomings in the above studies. First, studies on virus transmission do not consider individual nodal heterogeneity. Second, the effect mechanism of virus propagation on the HIS state is not considered. Third, a HIS is treated as a two-state system, however a HIS is a multi-state system. Fourth, relevant studies did not investigate the performance of the restoration strategies for maximizing HIS resilience in the event of cascading failure.

This paper aims to fill up these knowledge gaps and therefore makes the following contributions.

(a)
We propose a COVID-19 propagation model with node heterogeneity based on the SEIR model. The degree, activity capability, and propagation capability of nodes are considered into the process of virus propagation by nodes, the propagation probability of nodes is considered in studying the propagation process of COVID-19 in the crowd based on the SEIR model.
(b)
A hospital cascading failure model is proposed by using the hospital outbreak rate as an indicator of hospital cascading failures while taking into account the distribution of patient flow. This model can study the influence mechanism of spread of the COVID-19 on the supply and demand in maintaining the operations of hospital. In addition, the cascading model can also assess the loss to the hospital from the patient's perspective.
(c)
We apply the theory of resilience to manage HIS’s and propose a quantitative framework for resilience management. We propose a hospital resilience model from the patient's perspective. The hospital resilience is the ratio of the number of patients transferred out of the hospital to the number of patients transferred in over a period of time, which can reflect the real-time resilience of the hospital and can quantify the hospital's ability to serve patients unaffected by unexpected events.
(d)
The paper considers a hospital as a polymorphic system and proposes an optimization model for HIS resilience based on the Markov process. Real-time restoration strategies can be determined based on the resilience optimization model.

1.4. Overview

This remainder of this paper is structured as follows. Section 2 proposes a COVID-19 propagation model considering node heterogeneity and studies the propagation process of the disease. Section 3 proposes a cascading failure model of a HIS under the COVID-19. Based on the load model, the cascading failure process of a HIS is portrayed. Section 4 takes a hospital infrastructure network after a node failure as the object to study the resilience optimization of HIS’s under the COVID-19. Section 5 takes HIS’s in two districts in a city as an example for simulation verification. Section 6 wraps up the paper and proposes future research. Supporting Information provides supplementary information on the Markov decision process modeling and some original data for simulation.

2. A COVID-19 propagation model based on node heterogeneity

2.1. Model indicators

Crowd is abstracted as a scale-free network, denoted by $G (V, W)$ . Residents are abstracted as individual nodes, denoted as $V,$ and the connecting relationship between residents are abstracted as edges, denoted as $W$ . There are $N$ nodes in the scale-free network, $V = {1, \dots, i, \dots, N}$ , the element $i$ in $V$ represents the $i$ -th node. The adjacency matrix of $G (V, W)$ is represented by a matrix ${[W_{ij}]}_{N \times N}$ , where $1 \leq i, j \leq N, W_{ij} = 1$ , if node $i$ is connected to node $j$ , $W_{ij} = 0$ otherwise.

Nodes with different attributes have different actions. Considering the heterogeneity of nodes, the different attributes of nodes are described by a topological structure, activity ability and virus propagation ability of crowd network nodes. The establishment indicators are as follows.

The degree of a node represents the structural centrality of the node and reflects the degree of mutual influence between the node and its neighboring nodes. It represents the number of links between a node and other nodes, and can reflect the number of people a person has contracted. The degree of node $i$ is defined by:

F_{i} = \sum_{j = 1}^{N} W_{ij}

(1)

where $W_{ij}$ is the adjacency relationship between node $i$ and node $j$ , $N$ represents the number of nodes in $G (V, W)$ , and $F_{i}$ is the degree of node $i$ .

The activity of a node is paroxysmal, and the activity time interval can describe its activity ability (Li, Guo, Gao, Zhang & Zhang, 2018). The higher level of physical activity of a node has, the higher possibility of the node participating in the virus spreading process has. If the node is a susceptible person with a high level of physical activity, the probability of the spread of infection of this person is higher. If the node is an infected person, the level of physical activity of the person is directly proportional to its ability to infect others. Inactive nodes do not perform any activities, such as spreading viruses and seeking medical treatment. At the end of each time interval, the node will have an active time point, at which the node can spread the virus. The activity time interval sequence of node $i$ is $T_{i} = {t_{i 1} {, t}_{i 2}, \dots, t_{i q_{i}}}$ , where $t_{iz} (z = 1, 2, \dots q_{i})$ follows a normal distribution, and $q_{i}$ is the number of elements in $T_{i}$ . The average value of the active time interval of node $i$ is taken as the active time interval of node $i$ . The level of physical activity $A_{i}$ of node $i$ is the ratio of the average value of the activity time interval of node $i$ to the sum of the average value of the activity time interval of all nodes. $A_{i}$ represents the activity capacity of individual node $i$ related to the population, as shown in Eq. (2),

A_{i} = - \ln \frac{\sum_{z = 1}^{q_{i}} t_{iz} / q_{i}}{\sum_{i = 1}^{N} (\sum_{z = 1}^{q_{i}} t_{iz} / q_{i})}

(2)

The nodes in the crowd network are different. Therefore, different nodes are considered to have different virus propagation abilities. For the propagation of the COVID-19, the propagation ability $P_{i}$ of node $i$ is assumed constant. Let $P_{i}$ be generated by a normally distributed random variable $P$ , i.e., $P N (μ_{p}, {σ_{p}}^{2})$ , ( $P_{i} = 0$ when $P < 0$ and $P_{i} = 1$ when $P > 0$ ) (Zou, Towsley & Gong, 2004).

The level of physical activity of a node will affect its ability of the spread of infection. Nodes with a high level of physical activity can promote the spread of the virus more efficiently than those with a lower level of physical activity (Xin, Gao, Wang, Zhen & Li, 2019). The effective propagation ability of node $i$ is $σ (i),$ as shown in Eq. (3),

σ_{i} = \{\begin{matrix} P_{i} A_{i} \geq a \\ 00 \leq A_{i} < a \end{matrix})

(3)

where $a$ is the average of the activity capacity of all nodes, $P_{i}$ is the propagation ability of node $i$ , $A_{i}$ is the activity capacity of node $i$ , and $σ_{i}$ is the effective propagation ability of node $i$ .

In the process of virus transmission, a person contracts the COVID-19 with a certain probability, and this probability is related to the number of people the person is exposed to and the effective transmission capacity of that person. The probability of transmission increases with the number of human contacts and the ability of effective transmission. Therefore, the propagation probability $α (i)$ of node $i$ can be expressed by

α_{i} = α_{0} + ε F_{i} σ_{i}

(4)

where $α_{0}$ is a given basic propagation probability, $ε$ is a given parameter, and $0 < α_{i} \leq 1$ .

2.2. The propagation model

Combining the characteristics of the propagation process of COVID-19, the SEIR propagation model of COVID-19 is established. In the model, nodes have the following five states:

•
Susceptible state S: The node has not yet been infected with the virus.
•
Latent state L: The node has been infected by the virus and is asymptomatic but contagious.
•
Exposed state E: The node has been infected by the virus, is symptomatic and contagious.
•
Recovered state R: The node has recovered from COVID-19, is immune to further infection and is incontiguous.
•
Dead state D: The node has died with COVID-19 and is incontiguous.
•
We take the crowd network as the object to establish a COVID-19 propagation model, as shown in Fig. 1 .

Fig. 1 — The COVID-19 propagation model.

Assume that the existence of edges between nodes is a condition for the realization of virus propagation, and nodes transferred to both cured and dead states will no longer participate in the network propagation process. Therefore, the state transition rules of nodes are as follows.

In this paper, time is divided into identical periods. The states of individuals and hospital nodes in each period will be studied, with $k$ denoting the order of a period in the following. In each period, a susceptible state node $i$ is infected by its neighboring nodes with probability $α_{i}$ , and then transitions to the latent state, which has the onset symptom with probability $β$ and then transitions into the exposed state. The exposed state node will be cured in the hospital with probability $γ$ and then transitions to the recovered state. The exposed state node may die with the disease with probability of $η$ . After transitioning to the recovered state and the dead state, the node is removed and does not participate in the propagation process in the crowd network. Considering node heterogeneity factors, the probability of a node being in each state at a time $(k + 1)$ is then given by Eqs. (5)-(9).

P_{i}^{S} (k + 1) = S_{i}^{S} (k) (1 - α_{i})

(5)

P_{i}^{L} (k + 1) = S_{i}^{S} (k) α (i) + S_{i}^{L} (k) (1 - β)

(6)

P_{i}^{E} (k + 1) = S_{i}^{L} (k) β + S_{i}^{E} (k) (1 - γ) (1 - η)

(7)

P_{i}^{R} (k + 1) = S_{i}^{E} (k) (γ + 1)

(8)

P_{i}^{D} (k + 1) = S_{i}^{E} (k) (η + 1)

(9)

where propagation probability $α (i)$ represents the probability of node $i$ spreading the virus. The exposed probability $β$ is the proportion of nodes that change from the latent state to the exposed state per period. The recovered probability $γ$ is the proportion of nodes that change from the exposed state to the recovered state in a period. The probability $η$ of death is the proportion of nodes that change from the exposed state to the dead state per period. $S_{i} (k + 1) = [S_{i}^{S} (k + 1), S_{i}^{L} (k + 1), S_{i}^{E} (k + 1), S_{i}^{R} (k + 1), S_{i}^{D} (k + 1)]$ is the state vector of the node $i$ at the $k$ -th period, where $S_{i}^{S} (k + 1), S_{i}^{L} (k + 1), S_{i}^{E} (k + 1), S_{i}^{R} (k + 1), S_{i}^{D} (k + 1) = 0, 1,$ an element equaling to 1 means that the node is at this state and an element equaling to 0 means that the node is not at this state. $S_{i}^{S} (k + 1) + S_{i}^{L} (k + 1) + S_{i}^{E} (k + 1) + S_{i}^{R} (k + 1) + S_{i}^{D} (k + 1) = 1$ indicates that the node can only be at one of the five states in the $k$ -th period.

$P_{i} (k + 1) = [P_{i}^{S} (k + 1), P_{i}^{L} (k + 1), P_{i}^{E} (k + 1), P_{i}^{R} (k + 1), P_{i}^{D} (k + 1)]$ is the probability vector of node $i$ at each state in the $k$ -th period. These probabilities are normalized such that it indicates the probability of a node being at one of the five states, as shown in Eq. (10).

{P_{i}^{S} (k) + P}_{i}^{L} (k) + {P_{i}^{E} (k) + P}_{i}^{D} (k) {+ P}_{i}^{R} (k) = 1

(10)

Time is divided into equal periods, and the state at any $(k + 1)$ -th period is then given by:

S_{i} (k + 1) = M u l t i R e a l i z e [P_{i} (k + 1)]

(11)

where $M u l t i R e a l i z e [P_{i} (k + 1)]$ is to randomly realize the state of node $i$ in the $k$ -th period according to the probability distribution of $P_{i} (k + 1)$ .

3. Cascading failure model of HIS’s

3.1. Indicators of cascading failures of HIS’s

Let the hospitals in the city be regarded as nodes, denoted as $H$ . The traffic roads between hospitals are connected by edges, denoted as $L$ . The hospital infrastructure network is established, denoted as $U = G (H, L)$ . Suppose there are $M$ hospital nodes and the adjacency matrix of $U$ is ${[M_{rs}]}_{M \times M}$ . $M_{rs} = 1$ if the hospital node $r$ and the hospital node $s$ have an edge $(r, s \in H)$ , $M_{rs} = 0$ , otherwise.

The node of a hospital is responsible for patients in its catchment area, which is defined as the area where residents live in. The nearest neighbor classification method is used to classify individual nodes to hospital nodes in its proximity, as shown in Fig. 2 . The number of residents served by hospital node $r$ is $I_{r}$ . The patients with COVID-19 symptoms in the hospital catchment area will first choose that hospital for consultation, at this time, they enter’HIS's as the load of the hospital node.

Fig. 2 — The crowd in the hospital catchment area.

The hospital admission rate is defined as the proportion of the population from the latent state to the exposed state in the population in the hospital catchment area per period, and it is denoted as $φ_{r}$ . The hospital discharge rate is defined as the proportion of the population that transitions from an exposed state to a recovered state in the population in the hospital catchment area per period, and it is denoted as $ω_{r}$ .

The outbreak rate $μ_{r} (k)$ is an indicator of the hospital load and is defined as the ratio of the hospital admission rate to discharge rate:

μ_{r} (k) = \frac{φ_{r} (k)}{ω_{r} (k)}

(12)

The threshold of the outbreak rate is 1. If $μ_{r} (k) > 1$ then the hospital is said to be under attack. When the outbreak rate of a hospital node at one or more locations is larger than 1, the number of admissions is larger than the number of discharges and the total traffic flow of the hospital network rises, the corresponding hospital node is said to be under attack.

The basic regeneration number refers to the ability to quantify the transmission of an infectious disease and is a macroscopic concept that is widely used in infectious disease models. The basic regurgitation number depends on the outbreak rate of a hospital. The basic regurgitation number is directly proportional to the outbreak rate in hospitals. The outbreak rate is the ratio of hospital admissions to discharges per unit time, representing the average level of outbreaks over time, and this indicator already incorporates the effects of fluctuations in demand.

The state of a hospital is a comprehensive overall effect of the interaction of health staff and various types of health care resources. A visual representation of the state of the hospital is the outbreak rate of the hospital. Under normal circumstances, hospital infrastructure is in equilibrium: the demand and supply of medical resources per unit of time are basically equal, the discharge rate is equal to the admission rate, and the outbreak rate is equal to 1. When the outbreak rate is greater than 1, the supply of medical resources per unit of time is insufficient, reflecting an active outbreak during this period.

The node load $Q_{r}$ and node capacity $C_{r}$ of the hospital are used to describe the workload of a node and working capacity in the process of network failures, respectively. The excess load $d_{r} (k)$ refers to the part of the node load exceeding the node capacity at the $k$ -th period.

d_{r} (k) = Q_{r} (k) - C_{r}

(13)

where $Q_{r} (k)$ is the workload of hospital node $r$ at the $k$ -th period, $C_{r}$ is the working capacity of hospital node $r$ , $d_{r} (k)$ is the part of the node load exceeding the node capacity at the $k$ -th period.

The resources such as medical staff and beds in a hospital needs to include the construction cost and the needs of the surrounding residents. Therefore, it is assumed that the node capacity is proportional to the number of catchment clusters of the hospital $r$ (Albert, Jeong & Barabasi, 2001),

C_{r} = I_{r} (1 + ρ_{0}),

(14)

where $ρ_{0}$ is an adjustable parameter that controls the capacity of the node, $ρ_{0} \geq 0$ , $I_{r}$ is the number of elements in the set $R_{r}$ , $C_{r}$ is the capacity of hospital node $r$ .

3.2. Process analysis of cascading failures of HIS’s

Combined with the reality of the crowd's action in terms of proximity to a hospital, the first choice of all people at the initial moment is the nearest hospital for the treatment of COVID-19. The crowd moves along the traffic roads between hospitals, and the crowd has access to information about traffic conditions and hospitals, including the traffic flow on the roads, the remaining capacity of the hospital, and the road structure at a given moment. Under normal circumstances, the number of admissions and number of people discharged per period are the same, and the total traffic volume of the entire hospital network per period is a fixed value. When the outbreak rate of a hospital node at one or more locations exceeds a threshold, the number of admissions exceeds the number of discharges and the total traffic flow of the hospital network rises, the corresponding hospital node is said to be under attack. In this paper, interventions such as widespread disinfection and epidemic prevention propaganda are not considered. The only actions occurring in the population are daily activities, promptly seeking medical attention when symptoms are detected and choosing a hospital.

In the process of cascading failures of hospital infrastructure networks, the hospital node has only normal and failed states. The normal state means that the hospital still has free medical resources. The failed state means that the hospital accepts too many patients and the node load $Q_{r}$ exceeds node capacity $C_{r}$ . By comparing the node load and node capacity, the state of the hospital node can be judged. When the node load exceeds the capacity of the node and the number of medical resources is in short supply, the node will fail.

Therefore, the specific process of node failures is as follows. When the outbreak rate of a node is greater than 1, the number of newly increased patients in the hospital is more than that of cured patients, thus generating a load increase. After the hospital receives the load increment, if the load of the node exceeds its capacity, the node fails; otherwise, the node is in a normal state.

To reflect the actual supply and demand mechanism of the hospital, the load of the failed node is distributed according to the actual situation. When the number of patients admitted by a hospital reaches its saturation point, the hospital will continue to treat those patients that have already been admitted. The traffic road connecting the hospital with other hospitals will not be abandoned. As the hospital does not have extra beds, medical equipment or other resources, new patients cannot be admitted or treated there, and these patients will go to other hospitals for treatment. Those patients who are receiving treatment in the hospital will continue being treated. Therefore, the failed node is not removed, but acts as a transit node that no longer receives load. The failed nodes can send out loads and can also be used as a transit node for other loads to move. Loads within the capacity of the node are received by the node and are no longer involved in the subsequent process. The load in excess of the node's capacity is seen as not being admitted to the hospital and needs to be redistributed.

Patients who have not been admitted by a hospital are more likely to choose the nearest hospital with more remaining capacity as the destination. Considering travel time and remaining capacity together, the attractiveness index of hospital $s$ in case of node $r$ failure is proposed as $A_{s}$ . To maximize the benefit of moving the excess load to other normal nodes for treatment, the redistribution method considering the destination selection is carried out, as shown in Eqs. (15)-(17),

A_{s}^{r} (k) = \frac{C_{s} - Q_{s} (k)}{T_{r \to s}}

(15)

δ_{r \to s} (k) = \frac{A_{s}^{r} (k)}{\sum_{s \in H_{1} (k)} A_{s}^{r} (k)}

(16)

and

\sum_{s \in H_{1} (k)} δ_{r \to s} (k) = 1

(17)

where $A_{s}^{r} (k)$ is the index of attractiveness of node $s$ to failed node $r$ proposed in this paper, $C_{s}$ is the capacity of hospital node $s$ , $Q_{s} (k)$ is the load of hospital node $r$ at the $k$ -th period, $T_{r \to s}$ is the shortest travel time from failed node $r$ to node $s$ , $δ_{r \to s} (k)$ is the ratio of the amount of load traveling from the failed node $r$ to node $s$ to the amount of excess load of node $r$ , $H_{1} (k)$ is the set of normal hospital nodes at the $k$ -th period.

All excess loads depart from the currently failed node and satisfy the starting traffic conservation condition.

d_{r \to s} (k) = d_{r} (k) δ_{r \to s} (k)

(18)

and

\sum_{s \in H_{1} (k)} d_{r \to s} (k) = d_{r} (k)

(19)

where $d_{r} (k)$ represents the excess load that needs to be removed from the failed node $r$ at the $k$ -th period. $d_{r \to s} (k)$ represents the excess load that needs to be moved from the failed node $r$ to the node $s$ at the $k$ -th period.

After selecting the destination node for all the excess load, it is necessary to continue selecting the shortest path to the destination node to complete the flow distribution. The BPR impedance function is used to describe the crowding effect. Impedance is related to travel time and road congestion, as shown in Eq. (20).

T_{a} (x_{a}) = T_{a} (0) (1 + ρ_{1} {(\frac{x_{a}}{C_{a}})}^{ρ_{2}})

(20)

where $T_{a} (x_{a})$ is the actual travel time of the selected route section $a$ . $T_{a} (0)$ is the travel time when no one passes by on road section $a$ . $x_{a}$ is the excess load of the selected road section $a$ , $x$ is the set of excess loads for all sections, $x_{a} > 0$ . $C_{a}$ is the traffic capacity of section $a$ . $ρ_{1}$ and $ρ_{2}$ are adjustable parameters.

Based on Eqs. (15) - (20) and the user balance distribution model, the distribution method is constructed, as shown in Eqs. (21)-(23),

\min Z (x) = \min \sum_{a \in A} \int_{0}^{x_{a}} T_{a} (w) d w

(21)

\sum_{l \in L_{r \to s}} h_{r \to s}^{l} (k) = d_{r \to s} (k)

(22)

and

x_{a} = \sum_{l \in L_{r \to s}} \sum_{r, s \in V} D_{r \to s}^{a, l} (k) h_{r \to s}^{l} (k)

(23)

where $A$ is the set of road sections, and $T_{a} (w)$ is the impedance function on section $a$ . Eq. (21) represents the shortest sum of travel time for all sections. $L_{r \to s}$ is the set of feasible routes between nodes $r$ and $s$ , $l$ is one of the routes in $L_{r \to s}$ , $h_{r \to s}^{l} (k)$ is the excess load of the $l$ -th route between nodes $r$ and $s$ at the $k$ -th period, $h_{r \to s}^{l} (k) \geq 0$ ，Eq. (22) indicates that the excess load from node $r$ to $s$ is the sum of the excess load of all possible routes. $D_{r \to s}^{a, l} (k)$ represents whether the $l$ -th route between nodes $r$ and $s$ chooses road section $a$ at the $k$ -th period, if the $l$ -th route contains road section $a$ , then $D_{r \to s}^{a, l} (k) = 1$ ; otherwise, $D_{r \to s}^{a, l} (k) = 0$ .

When the user balance distribution reaches its equilibrium, all the individuals in excess load will choose the route with the shortest travel time. There will be a situation where the travel time of all selected routes is fixed, and the travel time of the selected route is less than that of all unselected routes. After the load is redistributed, if the load of the new node exceeds the capacity, the node fails, and the cascading failure continues occurring.

3.3. Cascading failure model of HIS’s

The specific implementation phases of the cascading failure model of a HIS network are as follows.

Phase 1: At the initial moment, a small number of individuals are randomly selected to be set as patients at the latent state and begin to spread the virus. The hospital infrastructure network is established. According to the nearest neighbor classification method, the population is classified to different hospital nodes.

Phase 2: Determine the state of all patient nodes. Patients with an infected state enter their associated hospital node in the catchment area according to the nearest neighbor classification. If there is no failure in the corresponding hospital node, the patient load can enter the associated hospital smoothly. If the associated hospital node has failed, the infected person is regarded as overloaded and enters phase 5.

Phase 3: Whether the hospital node outbreak rate exceeds the threshold or not is judged. When the outbreak rate $μ_{r}$ of one or more hospital nodes exceeds the threshold, the hospital nodes can be regarded as being attacked. If the outbreak rate of all hospital nodes is lower than the threshold, no node will be attacked.

Phase 4: Determining whether a hospital node is failed under attacked or not. The corresponding hospital node is attacked and the number of new patients entering the hospital network increases. The load increment of the hospital network will go to the attacked hospital node $r .$ When the load $Q_{r}$ of the attacked hospital node $r$ is higher than its capacity $C_{r}$ , the hospital node $r$ is failed. The failed node is processed so that it no longer receives load. The ability to transport loads is still retained, so that the loads within the capacity range are absorbed, and the excess loads are redistributed. If the load of all nodes is less than the capacity, no failed nodes will be generated.

Phase 5: Destination node selection and flow distribution for excess load. The user balance distribution method considering destination selection is used to redistribute the excess load. Select a new hospital node with a shorter arrival time and more remaining capacity as the destination for the excess load, and select the shortest path to the new hospital node to complete the flow distribution.

Phase 6: Whether the failure is terminated is judged. If the load of all nodes after redistribution does not exceed the node capacity, the failure is terminated. If there is a new node whose load is greater than the capacity of the node after redistribution, a new failed node will be generated, the hospital network will be updated, and phase 2 will be returned.

In the above six phases, phase 1 is executed at the initial moment, phase 2 - phase 6 are executed once in each time period.

4. Resilience optimization model for HIS’s

4.1. Restoration of hospital

The analysis in this section is only for failed nodes. Node restoration is defined as the process of bringing a failed node back to a normal state. If the failed node is restored only by increasing the node capacity, the failed node can certainly continue to receive more patients in a short period, and the performance of the node will be improved in a short period. However, its outbreak rate remains unacceptably high, with far more new hospital admissions than new hospital discharges per period. The node load will inevitably exceed the node capacity again within a limited time. Therefore, in order to restore failed nodes, restoration measures to reduce the outbreak rate and increase node capacity should be implemented at the same time. The specific measures are as follows.

1)
Improving the hospital discharge rate $ω_{r}$ by increasing the production of medical resources such as personal protective equipment (PPE) and disinfectants. This type of medical resources will be consumed in a short period and needs to be supplied to hospitals at a high frequency. By increasing the inventory of such resources of the failed node, the outbreak rate can be reduced. The failed node can be gradually restored to a normal state.
2)
Increasing the node capacity of a hospital by requisitioning hotels near the hospital and establishing temporary hospitals. As the capacity of the node increases, the node can accommodate more patients, thereby reducing the number of loads transferred out of the node. The capacity that can be added to each hospital node is a finite fixed value.

Only one node can be restored within a period, and other nodes will not take any measures. After a node has been performed multiple restoration measures, the outbreak rate gradually decreases until it drops below 1, and the node is gradually restored to a normal state.

4.2. The restoration benefits of hospital

When the outbreak rate of hospital node $r$ is greater than 1, the number of new hospital admissions is greater than that of new hospital discharges per period. The difference between the number of new hospital admissions and the number of new hospital discharges is the restoration demand of hospital node $r$ . The restoration demand of hospital node $r$ is denoted as ${Δ D}_{r} (k)$ , which can account for the net increase of patients in a hospital node per period, as shown in Eq. (24).

{Δ D}_{r} (k) = (φ_{r} (k) - ω_{r} (k)) N_{r}

(24)

where $N_{r}$ represents the number of residents in the catchment area of the hospital node $r$ , $φ_{r} (k)$ is the hospital admission rate of node $r$ at the $k$ -th period, $ω_{r} (k)$ is the hospital discharge rate of node $r$ at the $k$ -th period.

Between phases 4 and 5 of cascading failure model in Section 3.3, restoration actions are to be executed on the failed node, i.e., increase the hospital node capacity. There are 3 cases after performing the restoration actions, as shown below.

(1)
Action effect 1 (Fig. 3 ). Non-executing measures were implemented on the failed hospital nodes. All the excess loads are transferred to other normal hospitals. The transfer-out load of node $r$ is ${Δ D}_{r} (k) = \sum_{s \in H_{1}} d_{r \to s} (k)$ . $\sum_{s \in H_{1}} d_{r \to s} (k)$ represents the excess load transferred from the node $r$ to the normal nodes at the $k$ -th period.
(2)
Action effect 2 (Fig. 4 ). After performing restoration measures on the failed node, the node has a transfer-out load and a transfer-in load.

After performing restoration measures on the failed node, the capacity of the failed hospital increased by $Δ C_{r} (k)$ . The newly added capacity $Δ C_{r} (k)$ is less than $Δ D_{r} (k)$ , the loads $d_{ri}$ that cannot be accommodated by node $r$ is used as the transfer-out load to hospital node $i$ . In this case, $Δ Y_{r} (k)$ is the transfer-in load and $Δ Y_{r} (k) = Δ C_{r} (k)$ . $\sum_{s \in H_{1}} d_{r \to s} (k)$ represents the excess load transferred from the node $r$ to the normal node s at the $k$ -th period, $\sum_{s \in H_{1}} d_{r \to s} (k) + Δ Y_{r} (k) = Δ D_{r} (k)$ .

(3)
Action effect 3 (Fig. 5 ). After performing restoration measures on the failed node, the node only has a transfer-in load.

After performing restoration measures on the failed node, the capacity of the failed node increases. The newly added capacity $Δ D_{r} (k)$ is able to accommodate the full restoration demand and there may be spare capacity that can be used to accommodate the load $d_{s \to r}$ that people transferred from other failed nodes. In this case, $\sum_{s \in H_{2}} d_{s \to r} (k) + Δ Y_{r} (k) = Δ C_{r} (k)$ , and $Δ Y_{r} (k) = Δ D_{r} (k)$ . $Δ Y_{r} (k)$ are the amount of transfer-in load of hospital node $r$ from itself, $\sum_{s \in H_{2}} d_{s \to r} (k)$ represents the excess load transferred from the failed node s to node $r$ at the $k$ -th period.

A node transfers patients to other nodes, indicating that the node is not capable of receiving all patients within its catchment area and can be considered a loss of performance for that node. If the node is able to meet the access needs of all patients within its catchment area, or even accept patients from other nodes, this can be considered as an increase in performance for that node. Therefore, the loss of performance of a node can be expressed in terms of transfer-out load. The more transfer-out load, the more performance loss. The increase in the performance of a node can be expressed in terms of transfer-in load. The more transfer-in load, the more performance increase.

Resilience theory is used to describe the ability of nodes to cope with emergencies, as well as to quantify the cumulative effect of restoration measures on node performance recovery over previous periods.

In this paper, the hospital node resilience is defined as the ratio of the cumulative performance gain to the performance loss of a node at the $k$ -th period. The greater the ratio, the greater the resilience of the node. The resilience of hospital node $r$ at the $k$ -th period is calculated as shown in Eq. (25).

g^{r} (k) = \sum_{p = 0}^{k} \frac{Δ Y_{r} (p) + \sum_{s \in H_{2}} d_{s \to r} (p)}{\sum_{s \in H_{1}} d_{r \to s} (p)}

(25)

where $k$ is an integer, and $k > 0$ , $g^{r} (k)$ denotes the resilience of hospital node $r$ at the $k$ -th period. $Δ Y_{r} (p)$ represents the transfer-in load from itself at the $p$ -th period, $p$ is an integer, $p > 0 .$ $\sum_{s \in H_{2}} d_{s \to r} (p)$ represents the excess load transferred from the failed node s to node $r$ at the $p$ -th period. $\sum_{s \in H_{1}} d_{r \to s} (p)$ represents the excess load transferred from the node $r$ to the normal node s at the $p$ -th period. From equation (25), we can see that hospital resilience $g^{r} (k)$ increases with $Δ Y_{r} (p)$ , $\sum_{s \in H_{2}} d_{s \to r} (p)$ , and decreases with $\sum_{s \in H_{1}} d_{r \to s} (p)$ . The capacity and speed of access to a hospital can affect all these three indicators, resilience as an inherent property of hospitals, can be improved by implementing two kinds of restoration measures in Section 4.1.

4.3. Resilience optimization of HIS’s based on Markov decision process

In the restoration process, the states of the nodes are not merely normal and failed, the restoration process with gradually decreasing outbreak rate can be discretized into multiple states with different outbreak rates (Zeng, Fang, Zhai & Du, 2021) Let $S (k)$ denote the state of the node at the $k$ -th period, which reflects the restoration degree of the hospital, $S (k) \in S = {0, 1, 2, . . ., m}$ . The larger the value, the larger the outbreak rate of the node. $S (k) = m$ corresponds to the level of the outbreak rate when the node fails, $S (k) = 0$ means the node is in a normal state, corresponding to the level of the node outbreak rate less than or equal to 1. The node state transition process during the restoration process is shown in Fig. 6 .

Fig. 6 — The process of node state transfer during the restoration process.

In order to optimize the resilience of hospital nodes, the Markov decision process (MDP) approach is used. The MDP is of the form of a quadruplet: ${S, (B (i), i \in S), P, R}$ . (See Supporting Information for details on modeling the Markov decision process).

$R (k) = {r (i, b), b \in B (i), i \in S}$ , $r (i, b)$ is the reward function, $R (k)$ denotes the expected reward received by the node when a node is in state $i$ at the $k$ -th period and takes action $b$ . In this paper, $R (k)$ is defined as the sum of the resilience of all nodes at the $k$ -th period, and the formula is as shown in Eq. (26),

R (k) = \sum_{r \in H} g^{r} (k) = \sum_{r \in H} \sum_{p = 0}^{k} \frac{Δ Y_{r} (p) + \sum_{s \in H_{2}} d_{s \to r} (p)}{\sum_{s \in H_{1}} d_{r \to s} (p)}

(26)

With the objective of maximizing the sum of the resilience of all nodes, the node restoration strategy is found for each moment, as in Eq. (27),

\max R (k) = \sum_{r \in H} g^{r} (k) = \sum_{r \in H} \sum_{p = 0}^{k} \frac{Δ Y_{r} (p) + \sum_{s \in H_{2}} d_{s \to r} (p)}{\sum_{s \in H_{1}} d_{r \to s} (p)}

(27)

Based on the COVID-19 propagation model, the cascading failure model and the resilience optimization model, we can obtain the Markov reward process-based framework for resilience optimization of HIS’s against COVID-19, as shown in Fig. 7 .

Fig. 7 — The markov reward process-based framework for resilience optimization of his’s against COVID-19.

5. Application

5.1. Data and methods of simulation

Real data from Lucheng District and Ouhai District of Wenzhou City, Zhejiang Province, China, are used as examples for simulation. To avoid disclosing national security information, we name Wenzhou city as CityA, Lucheng district as district A, and Ouhai district as District B. There are two districts, $A 1$ and $A 2$ , in City $A$ , for example. There are six hospitals in District $A 1$ and four in District $A 2$ , with the number of hospital beds shown in Table 1 S (shown in Supporting Information).

Table 1.

The MDP-based restoration strategies for optimal resilience at the 35-th period.

Step	State of the Hospital 5	State of the Hospital 9	State of the Hospital 10	Strategy (effective at the next step)
0	5	5	5	[0,1,0]
1	5	3	5	[1,0,0]
2	4	3	5	[0,1,0]
3	4	2	5	[1,0,0]
4	2	2	5	[0,0,1]
5	2	2	4	[0,1,0]
6	2	1	4	[0,1,0]
7	2	0	4	[0,0,1]
8	2	0	3	[1,0,0]
9	2	0	3	[1,0,0]
10	1	0	3	[0,0,1]
11	1	0	2	[0,0,1]
12	1	0	2	[1,0,0]
13	0	0	2	[0,0,1]
14	0	0	1	[0,0,1]
15	0	0	0

Open in a new tab

For the research, the population data of each street in City $A$ was used to reflect the reality of the population distribution. The administrative centers of street settlements were used as a proxy for the center of gravity of the population. 14 street townships are within the area of District $A 1$ , and 13 street townships are within the area of District $A 2$ , as shown in Table 2S (shown in Supporting Information), with data from the 2019 Statistical Yearbook of the Bureau of Statistics of City $A$ .

After collecting the data, an agent-based simulation model is built using a software package entitled Anylogic to realize the crowd virus propagation and hospital node cascading failure process. Pathmind, which is a SaaS platform that enables businesses to apply reinforcement learning to real-world scenarios without data science expertise, was integrated to enable MDP driven node resilience optimization. The simulation process is as follows.

Step 1: Constructing a hospital infrastructure network for Districts $A 1$ and $A 2$ in city $A$ , as shown in Fig. 8 . The blue building icons in the GIS map represents hospitals, which are connected to each other by roads.

A network of hospital infrastructure in Districts $A 1$ and $A 2$ in City $A$ is created, using the hospitals as nodes and connecting roads as edges, as shown in Fig. 9 .

Fig. 9 — Hospital infrastructure network in Districts $A 1$ and.. $A 2$

Step 2: Refinement of the agent. To achieve the virus propagation and cascading failure process under the COVID-19 outbreak, the internal attributes and functions of community and hospital agents are set. As shown in Fig. 10 , the left community agent has different area names, area population and contains people agent, which represents the residential population. The right hospital agent has two attributes of node names and node capacity. Four variables are set for the hospital agent: the number of current patients, whether it is in a failed state or not, the hospital admission rate and hospital discharge rate that are set to record the operation of each node. The node load is represented by the current number of patients and is used to determine whether or not the node status is failed in the state diagram on the right.

Fig. 10 — Community agent (left) and Hospital agent (right).

As shown in Fig. 11 , for each person within the community agent, the number of daily contacts, action capacity and spread capacity differ, as shown by the parameters in the Fig. 11. The yellow state diagram on the right indicates the spread of the virus within the population. After going through the susceptible, latent and infectious states, individuals in the infectious state will call the function findHos to find the nearest hospital node that has not failed and will travel along the traffic path. Based on the proposed load allocation model as shown in Eqs. (21)-(23), the function findHos in this paper is defined to allocate traffic for failed hospital nodes. The infectious individual reaches the node and begins treatment, and then enters the death state with certain probability, or the cure state with a delay.

The cascading failure process is mainly reflected by the function findHos. When the findHos function is called by an infectious individual, all hospital nodes are stored in the set normalHos in Fig. 11 and the nearest node is found in the set. If the nearest hospital node enters a failed state, it is removed from the set and the search for the most suitable node in the set is continued with the objective of being the closest and smallest. Consideration of traffic impedance along the route is omitted here. After finding the most suitable node, calling all of the functions is completed, the infectious individual travels to the most suitable node.

Step 3: Outputting node attribute value data during cascading failures and integrate Pathmind for resilience optimization. Four metrics including the admission rate, the discharge rate, the node load, and whether the node is disabled, are recorded and output for each period of the hospital node. The data is randomly taken at the 15-th, 35-th and 55-th periods, respectively. The nodes were divided into six states based on the outbreak rate of nodes in different ranges, as shown in Table 3S (shown in Supporting Information).

Assign value to the probability $P (j | i, b)$ of transferring to state $j$ after doing act $b$ in state $i$ , please see Supporting Information.

Using the Pathmind Helper to introduce MDP into Anylogic. Pathmind Helper is an AnyLogic pallette item. Drop Pathmind Helper into the model and use it to add MDP functions. Starting from the selected moment, the current state of the node, the outbreak rate is observed and the node is made to behave. Let only one node be restored at a period, and calculate the marginal benefit of making the action at each moment.

Step 4: Uploading the simulation model to the Pathmind cloud. Train the model with the objective of maximizing the resilience of all nodes to obtain the best action strategy. Download the strategy trained by Pathmind and verify the optimum in Anylogic.

5.2. Analysis of simulation results

The results of the cascading failure are shown in Fig. 12, Fig. 13, Fig. 14 , respectively, in which the black character icon represents the infectious individuals, the black character icon walking on the road indicates that the infectious individuals have left their places of residence to go to a hospital to seek treatment. The red building icons represent failed hospital nodes, the blue ones represent normal hospital nodes, and the yellow ones represent population residences. The number of failed hospital nodes increases with time, from 2 at the 15-th period to 3 at the 35-th period and finally to 5 at the 55-th period, causing the cascading failure effect.

As can be seen in Fig. 12, Fig. 13, Fig. 14, if restoration actions are not implemented, a HIS will suffer from serious cascading failures. Therefore, it is necessary to implement restoration actions for the HIS in time. Take the cascading failure case at the 35-th period as an example and calculate the optimal restoration strategy for resilience. The failed nodes at the 35-th period are hospitals 5, 9 and 10, respectively. All three nodes, which are in state 5, have an outbreak rate greater than 1.2. With the objective of maximizing the sum of the resilience of all nodes, based on the MDP, the restoration strategy is obtained, as shown in Table 1. The actions are [1, 0, 0], [0, 1, 0] and [0, 0, 1], respectively, representing the restoration measures performed on hospitals 5, 9 and 10 in the current step, respectively. Which node we should repair at each step? The question is addressed by the restoration strategy in Table 1. Only one failed node is repaired at each step and the node state changes at the next step. Since the change of the hospital state is a stochastic process, a total of 15 steps of restoration strategy are taken to restore all three failed nodes to normal state 0. From Table 1, the nodes 9, 5, 9, 5, 10, 9, 9, 10, 5, 5, 10, 10, 5, 10, and 10 are repaired at steps 0–14, respectively.

The line graph shown in Fig. 15 provides a visual illustration of the restoration measures for the 35th period, with actions 1, 2 and 3 representing the execution of restoration strategies for hospitals 5, 9 and 10, respectively, in the current step.

The restoration effect after applying the optimal action strategy is shown in Fig. 16, Fig. 17, Fig. 18 , respectively. The red buildings in the figure represent failed nodes, the green ones represent nodes that have been restored to a normal state and the blue ones represent normal nodes that have never failed. It can be seen that hospital 9 is restored to normal, then hospital 5 is restored to normal, and finally hospital 10 is restored to normal.

Fig. 18 — Hospital 10 returns to normal.

After applying the optimal action strategy, a graph illustrating the changes between node states during the MDP-based resilience optimal restoration process is shown in Fig. 19 .

From Fig. 19, we can see the state change of the three failed nodes in the 35-th period. We can see that node 9 is the first to return to its normal state, node 5 is the second to return to its normal state, and node 10 is the last to return to its normal state.

6. Conclusions and future work

6.1. Research content

This paper used the Markov decision process to analyze the resilience of the hospital infrastructure system (HIS) under the attack of the COVID-19. First, a COVID-19 propagation model based on node heterogeneity was developed, and a cascading failure model for HIS’s based on the virus propagation model was developed. Then, based on the virus propagation model and the cascading failure model, a resilience optimization model for HIS’s was established, which provides a framework for restoration of hospital infrastructure in response to public health emergencies. Finally, this paper illustrated the applicability of the model proposed in this paper with a real case, which is beneficial for readers to clearly understand the performance change of HIS before and after the occurrence of an emergency event and how to develop a remediation strategy.

6.2. Managerial implication

The results of this paper can provide a useful reference to people in the emergency management of HIS.

First, if the failed node is not repaired in time, the HIS undergoes cascading failures. Therefore, hospital managers should assess the states of their HIS’s in time and take timely measures such as increasing beds and speeding up access for medical treatment to reduce losses.

Second, managers should focus on the state of the hospital. A hospital is a single node of HIS on the one hand, and a system of staffs, patients and various medical resources interacting with each other on the other hand. There are many indicators to evaluate the state of a HIS from different perspectives. In this paper, we proposed the concept of the outbreak rate to evaluate the hospital states from the patients’ perspective, which can measure the attacks on the HIS’s over a period of time. Therefore, hospital managers should not only consider the number of admitted patients, but also the number of discharged patients. In addition, three scenarios of hospitals after maintenance were discussed to provide a basis for managers to evaluate the actual status of HIS’s.

Third, managers of a HIS should focus on the performance of the hospital infrastructure network when making decisions on restoration. When making maintenance decisions, there are many optimization objectives. Resilience, as an indicator of a system's ability to withstand external risks, can reduce the risks associated with the inevitable disruption of systems. Determining the restoration measures based on the resilience optimization model can ensure the maximum resilience of the HIS, i.e., the maximum capacity of the HIS to serve patients after a disaster. Therefore, managers of HIS’s can manage risk with the goal of optimizing the resilience of the entire system.

6.3. Future work

The study of the propagation characteristics of COVID-19 in this paper did not consider realistic factors such as isolation interventions and information dissemination or the multiple failed states of nodes during cascading failures. Those limitations will be studied in the future. Based on the study, further research can be conducted on the following concerns:

(1)
Introduce isolation interventions into the virus transmission model in the study.
(2)
Consider the state of the hospital as a continuous variable in the study of hospital resilience.
(3)
Introduce importance measures in the study to determine the maintenance priority of different hospitals.
(4)
Study the impact of different types of maintenance measures on hospital resilience.

CRediT authorship contribution

Hongyan Dui: Conceptualization, Methodology, Formal analysis, Funding acquisition, Supervision, Writing - Original draft. Kaixin Liu: Conceptualization, Methodology, Formal analysis, Investigation, Data curation, Writing - Original draft. Shaomin Wu: Methodology, Investigation, Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

The authors gratefully acknowledge the financial support for this research from the National Natural Science Foundation of China (Nos. 72071182, U1904211), the Key Science and Technology Program of Henan Province (No. 222102520019), the Program for Science & Technology Innovation Talents in Universities of Henan Province (No. 22HASTIT022), and the Program for young backbone teachers in Universities of Henan Province (No. 2021GGJS007).

Footnotes

^{Appendix A}

Supplementary data to this article can be found online at https://doi.org/10.1016/j.cie.2023.109158.

Appendix A. Supplementary material

The following are the Supplementary data to this article:

Supplementary data 1

mmc1.docx^{(165.1KB, docx)}

Data availability

No data was used for the research described in the article.

References

Achour N., Miyajima M., Pascale F., Price A.D.F. Hospital resilience to natural hazards: Classification and performance of utilities. Disaster Prevention & Management. 2014;23(1):40–52. doi: 10.1108/DPM-03-2013-0057. [DOI] [Google Scholar]
Albert R., Jeong H., Barabasi A.L. Erratum: correction: Error and attack tolerance of complex networks. Nature. 2001;409(6819):542. doi: 10.1038/35054111. [DOI] [PubMed] [Google Scholar]
Almoghathawi Y., Barker K. Component importance measures for interdependent infrastructure network resilience. Computers & Industrial Engineering. 2019;133:153–164. doi: 10.1016/j.cie.2019.05.001. [DOI] [Google Scholar]
Barabadi A., Ghiasi M.H., Nouri A. Qarahasanlou A N. A Holistic View of Health Infrastructure Resilience before and after COVID-19. Archives of Bone and Joint Surgery. 2020;8:262–269. doi: 10.22038/abjs.2020.47817.2360. [DOI] [PMC free article] [PubMed] [Google Scholar]
Barasa E.W., Mbau R., Gilson L. What is resilience and how can it be nurtured? a systematic review of empirical literature on organizational resilience. International Journal of Health Policy & Management. 2018;7(6) doi: 10.15171/ijhpm.2018.06. [DOI] [PMC free article] [PubMed] [Google Scholar]
Broek M., Teunter R., Jonge B.D., Veldman J. Joint condition-based maintenance and condition-based production optimization. Reliability Engineering and System Safety. 2019;214 doi: 10.1016/j.ress.2021.107743. [DOI] [Google Scholar]
Broek M., Teunter R.H., Jonge B.D., Veldman J. Joint condition-based maintenance and load-sharing optimization for two-unit systems with economic dependency - sciencedirect. European Journal of Operational Research. 2021;295(3):1119–1131. doi: 10.1016/j.ejor.2021.03.044. [DOI] [Google Scholar]
Enatsu Y., Messina E., Nakata Y., Muroya Y., Vecchio R.A. Global dynamics of a delayed sirs epidemic model with a wide class of nonlinear incidence rates. Journal of Applied Mathematics & Computing. 2012;39:15–34. doi: 10.1007/s12190-011-0507-y. [DOI] [Google Scholar]
Gagliardi H.F., Alves D. Small-world effect in epidemics using cellular automata. Mathematical Population Studies. 2010;17(2):79–90. doi: 10.1080/08898481003689486. [DOI] [Google Scholar]
Galaitsi S.E., Keisler J.M., Trump B.D., Linkov I. The need to reconcile concepts that characterize systems facing threats. Risk Analysis. 2020;41(1):3–15. doi: 10.1111/risa.13577. [DOI] [PubMed] [Google Scholar]
Gao S., Wang H. Scenario prediction of public health emergencies using infectious disease dynamics model and dynamic Bayes. Future Generation Computer Systems. 2022;127:334–346. doi: 10.1016/j.future.2021.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
Grimaz S., Ruzzene E., Zorzini F. Situational assessment of hospital facilities for modernization purposes and resilience improvement. International Journal of Disaster Risk Reduction. 2021;66 doi: 10.1016/j.ijdrr.2021.102594. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo N., Guo P., Dong H.Y., Zhao J., Han Q.Y. Modeling and analysis of cascading failures in projects: A complex network approach. Computer & Industrial. Engineering. 2019;127:1–7. doi: 10.1016/j.cie.2018.11.051. [DOI] [Google Scholar]
Hassan E.M., Mahmoud H.N. Orchestrating performance of healthcare networks subjected to the compound events of natural disasters and pandemic. Nature Communications. 2021;12(1):1338. doi: 10.1038/s41467-021-21581-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hosseini S., Barker K., Ramirez-Marquez J.E. A review of definitions and measures of system resilience. Reliability Engineering and System Safety. 2016;145:47–61. doi: 10.1016/j.ress.2015.08.006. [DOI] [Google Scholar]
Hynes W., Trump B.D., Love P., Linkov I. Bouncing forward: A Resilience Approach to dealing with COVID-19 and future systemic shocks. Environment, Systems, Decisions. 2020;40(2):174–184. doi: 10.1007/s10669-020-09776-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Keizer M., Teunter R.H., Veldman J. Joint condition-based maintenance and inventory optimization for systems with multiple components. European Journal of Operational Research. 2017;257(1):209–222. doi: 10.1016/j.ejor.2016.07.047. [DOI] [Google Scholar]
Kermack W.O., Mckendrick A.G. A contribution to the mathematical theory of epidemics. Bulletin of Mathematical Biology. 1991;53(1–2):57–87. doi: 10.1016/S0092-8240(05)80042-4. [DOI] [PubMed] [Google Scholar]
Li X., Guo J., Gao C., Zhang L.Y., Zhang Z.L. A hybrid strategy for network immunization. Chaos Solitons & Fractals. 2018;106:214–219. doi: 10.1016/j.chaos.2017.11.029. [DOI] [Google Scholar]
Li Z., Li N., Cimellaro G.P., Fang D. System Dynamics Modeling-Based Approach for Assessing Seismic Resilience of Hospitals: Methodology and a Case in China. Journal of Management in Engineering. 2020;36(5):04020050. doi: 10.1061/(ASCE)ME.1943-5479.0000814. [DOI] [Google Scholar]
Linkov I., Fox-Lent C., Read L., Allen C.R., Arnott J.C., Bellini E.…Hyde I. Tiered Approach to Resilience Assessment. Risk Analysis. 2018;38(9):1772–1780. doi: 10.1111/risa.12991. [DOI] [PubMed] [Google Scholar]
Linkov, I., Keenan, J. M. & Trump, B. D. (2021). COVID-19: systemic risk and resilience. Springer, Amsterdam, The Netherlands. Doi: 10.1007/978-3-030-71587-8.
Ouyang M. A mathematical framework to optimize resilience of interdependent critical infrastructure systems under spatially localized attacks. European Journal of Operational Research. 2017;262(3): Doi:1072-1084. doi: 10.1016/j.ejor.2017.04.022. [DOI] [Google Scholar]
Pishnamazzadeh M., Sepehri M.M., Ostadi B. An Assessment Model for Hospital Resilience according to the Simultaneous Consideration of Key Performance Indicators: A System Dynamics Approach. Perioperative Care and Operating Room Management. 2020;20 doi: 10.1016/j.pcorm.2020.100118. [DOI] [Google Scholar]
Qian X., Ukkusuri S.V. Connecting urban transportation systems with the spread of infectious diseases: A trans-seir modeling approach. Transportation Research Part B Methodological. 2021;145(3):185–211. doi: 10.1016/j.trb.2021.01.008. [DOI] [Google Scholar]
Ransolin N., Saurin T.A., Formoso C.T. Integrated modelling of built environment and functional requirements: Implications for resilience. Applied Ergonomics. 2020;88 doi: 10.1016/j.apergo.2020.103154. [DOI] [PubMed] [Google Scholar]
Rodríguez-Méndez V., Ser-Giacomi E., HernándezGarcía E. Clustering coefficient and periodic orbits in flow networks. Chaos. 2017;27(3):35-46. doi: 10.1063/1.4971787. [DOI] [PubMed] [Google Scholar]
Samsuddin N.M., Takim R., Nawawi A.H., Alwee S.N.A.S. Disaster Preparedness Attributes and Hospital's Resilience in Malaysia. Procedia Engineering. 2018;212:371–378. doi: 10.1016/j.proeng.2018.01.048. [DOI] [Google Scholar]
Sekiguchi M., Ishiwata E. Global dynamics of a discretized sirs epidemic model with time delay. Journal of Mathematical Analysis & Applications. 2010;371(1):195–202. doi: 10.1016/j.jmaa.2010.05.007. [DOI] [Google Scholar]
Sheu S.-H., Liu T.-H., Zhang Z.-G., Tsai H.-N. Optimum replacement policy for cumulative damage models based on multi-attributes. Computers & Industrial Engineering. 2020;139 doi: 10.1016/j.cie.2019.106206. [DOI] [Google Scholar]
Siskos E., Burgherr P. Multicriteria Decision Support for the Evaluation of Electricity Supply Resilience: Exploration of Interacting Criteria. European Journal of Operational Research. 2022;298(2):611–626. doi: 10.1016/j.ejor.2021.07.026. [DOI] [Google Scholar]
Tariverdi M., Fotouhi H., Moryadee S., Miller-Hooks E. Health care system disaster-resilience optimization given its reliance on interdependent critical lifelines. Journal of Infrastructure Systems. 2018;25(1):04018044. doi: 10.1061/(ASCE)IS.1943-555X.0000465. [DOI] [Google Scholar]
Wang J.R., Wang J.P., Liu M.X., Li Y.W. Global stability analysis of an SIR epidemic model with demographics and time delay on networks. Physica A-Statistical Mechanics & Its Applications. 2014;410:268–275. doi: 10.1016/j.physa.2014.05.011. [DOI] [Google Scholar]
Wang Y.C., Xiao R.B. An ant colony-based resilience approach to cascading failures in cluster supply network. Physica A. 2016;462:150–166. doi: 10.1016/j.physa.2016.06.058. [DOI] [Google Scholar]
Wells E.M., Boden M., Tseytlin I., Linkov I. Modeling critical infrastructure resilience under compounding threats: A systematic literature review. Progress in Disaster Science. 2022;15 doi: 10.1016/j.pdisas.2022.100244. [DOI] [Google Scholar]
Whitman M.G., Barker K., Johansson J., Darayi M. Component importance for multi-commodity networks: Application in the Swedish railway. Computers & Industrial Engineering. 2017;112:274–288. doi: 10.1016/j.cie.2017.08.004. [DOI] [Google Scholar]
Worldometer. (2023). Retrieved from https://www.worldometers.info/coronavirus/. Accessed at 12:02, Feb 28, 2023 (Greenwich time).
Xin Y., Gao C., Wang Z., Zhen X.Y., Li X.H. Discerning Influential Spreaders in Complex Networks by Accounting the Spreading Heterogeneity of the Nodes. IEEE Access. 2019;9:92070–92078. doi: 10.1109/ACCESS.2019.2927775. [DOI] [Google Scholar]
Zeng Z.G., Fang Y.P., Zhai Q., Du S. A Markov reward process-based framework for resilience analysis of multistate energy systems under the threat of extreme events. Reliability Engineering & System Safety. 2021;209(7) doi: 10.1016/j.ress.2021.107443. [DOI] [Google Scholar]
Zhang J.P., Jin Z. The analysis of an epidemic model on networks. Applied Mathematics & Computation. 2011;217(17):7053–7064. doi: 10.1016/j.amc.2010.09.063. [DOI] [Google Scholar]
Zhang W.J., Shi X.L., Huang A.Q., Hua G.W., Teunter R.H. Optimal stock and capital reserve policies for emergency medical supplies against epidemic outbreaks, European Journal of Operational Research. ISSN. 2021;2021:0377–2217. doi: 10.1016/j.ejor.2021.06.026. [DOI] [Google Scholar]
Zhao X., Wu C., Wang S., Wang X. Reliability analysis of multi-state k-out-of-n: G system with common bus performance sharing. Computers & Industrial Engineering. 2018;124:359–369. doi: 10.1016/j.cie.2018.07.034. [DOI] [Google Scholar]
Zheng J.F., Gao Z.Y., Zhao X.M. Clustering and congestion effects on cascading failures of scale-free networks. EPL (Europhysics Letters) 2007;79(5):46–56. doi: 10.1209/0295-5075/79/58002. [DOI] [Google Scholar]
Zhou J., Huang N., Coit D.W., Felder F.A. Combined effects of load dynamics and dependence clusters on cascading failures in network systems. Reliability Engineering and System Safety. 2018;170(1):116–126. doi: 10.1016/j.ress.2017.10.008. [DOI] [Google Scholar]
Zou, C. C., Towsley, D. & Gong, W. (2004). Email worm modeling and defense. 13th International Conference on Computer Communications and Networks (IEEE Cat. No.04EX969). 10.1109/ICCCN.2004.1401687.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data 1

mmc1.docx^{(165.1KB, docx)}

Data Availability Statement

No data was used for the research described in the article.

[b0005] Achour N., Miyajima M., Pascale F., Price A.D.F. Hospital resilience to natural hazards: Classification and performance of utilities. Disaster Prevention & Management. 2014;23(1):40–52. doi: 10.1108/DPM-03-2013-0057. [DOI] [Google Scholar]

[b0010] Albert R., Jeong H., Barabasi A.L. Erratum: correction: Error and attack tolerance of complex networks. Nature. 2001;409(6819):542. doi: 10.1038/35054111. [DOI] [PubMed] [Google Scholar]

[b0015] Almoghathawi Y., Barker K. Component importance measures for interdependent infrastructure network resilience. Computers & Industrial Engineering. 2019;133:153–164. doi: 10.1016/j.cie.2019.05.001. [DOI] [Google Scholar]

[b0020] Barabadi A., Ghiasi M.H., Nouri A. Qarahasanlou A N. A Holistic View of Health Infrastructure Resilience before and after COVID-19. Archives of Bone and Joint Surgery. 2020;8:262–269. doi: 10.22038/abjs.2020.47817.2360. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0025] Barasa E.W., Mbau R., Gilson L. What is resilience and how can it be nurtured? a systematic review of empirical literature on organizational resilience. International Journal of Health Policy & Management. 2018;7(6) doi: 10.15171/ijhpm.2018.06. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0030] Broek M., Teunter R., Jonge B.D., Veldman J. Joint condition-based maintenance and condition-based production optimization. Reliability Engineering and System Safety. 2019;214 doi: 10.1016/j.ress.2021.107743. [DOI] [Google Scholar]

[b0035] Broek M., Teunter R.H., Jonge B.D., Veldman J. Joint condition-based maintenance and load-sharing optimization for two-unit systems with economic dependency - sciencedirect. European Journal of Operational Research. 2021;295(3):1119–1131. doi: 10.1016/j.ejor.2021.03.044. [DOI] [Google Scholar]

[b0040] Enatsu Y., Messina E., Nakata Y., Muroya Y., Vecchio R.A. Global dynamics of a delayed sirs epidemic model with a wide class of nonlinear incidence rates. Journal of Applied Mathematics & Computing. 2012;39:15–34. doi: 10.1007/s12190-011-0507-y. [DOI] [Google Scholar]

[b0045] Gagliardi H.F., Alves D. Small-world effect in epidemics using cellular automata. Mathematical Population Studies. 2010;17(2):79–90. doi: 10.1080/08898481003689486. [DOI] [Google Scholar]

[b0050] Galaitsi S.E., Keisler J.M., Trump B.D., Linkov I. The need to reconcile concepts that characterize systems facing threats. Risk Analysis. 2020;41(1):3–15. doi: 10.1111/risa.13577. [DOI] [PubMed] [Google Scholar]

[b0055] Gao S., Wang H. Scenario prediction of public health emergencies using infectious disease dynamics model and dynamic Bayes. Future Generation Computer Systems. 2022;127:334–346. doi: 10.1016/j.future.2021.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0060] Grimaz S., Ruzzene E., Zorzini F. Situational assessment of hospital facilities for modernization purposes and resilience improvement. International Journal of Disaster Risk Reduction. 2021;66 doi: 10.1016/j.ijdrr.2021.102594. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0065] Guo N., Guo P., Dong H.Y., Zhao J., Han Q.Y. Modeling and analysis of cascading failures in projects: A complex network approach. Computer & Industrial. Engineering. 2019;127:1–7. doi: 10.1016/j.cie.2018.11.051. [DOI] [Google Scholar]

[b0070] Hassan E.M., Mahmoud H.N. Orchestrating performance of healthcare networks subjected to the compound events of natural disasters and pandemic. Nature Communications. 2021;12(1):1338. doi: 10.1038/s41467-021-21581-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0075] Hosseini S., Barker K., Ramirez-Marquez J.E. A review of definitions and measures of system resilience. Reliability Engineering and System Safety. 2016;145:47–61. doi: 10.1016/j.ress.2015.08.006. [DOI] [Google Scholar]

[b0080] Hynes W., Trump B.D., Love P., Linkov I. Bouncing forward: A Resilience Approach to dealing with COVID-19 and future systemic shocks. Environment, Systems, Decisions. 2020;40(2):174–184. doi: 10.1007/s10669-020-09776-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b0085] Keizer M., Teunter R.H., Veldman J. Joint condition-based maintenance and inventory optimization for systems with multiple components. European Journal of Operational Research. 2017;257(1):209–222. doi: 10.1016/j.ejor.2016.07.047. [DOI] [Google Scholar]

[b0090] Kermack W.O., Mckendrick A.G. A contribution to the mathematical theory of epidemics. Bulletin of Mathematical Biology. 1991;53(1–2):57–87. doi: 10.1016/S0092-8240(05)80042-4. [DOI] [PubMed] [Google Scholar]

[b0095] Li X., Guo J., Gao C., Zhang L.Y., Zhang Z.L. A hybrid strategy for network immunization. Chaos Solitons & Fractals. 2018;106:214–219. doi: 10.1016/j.chaos.2017.11.029. [DOI] [Google Scholar]

[b0100] Li Z., Li N., Cimellaro G.P., Fang D. System Dynamics Modeling-Based Approach for Assessing Seismic Resilience of Hospitals: Methodology and a Case in China. Journal of Management in Engineering. 2020;36(5):04020050. doi: 10.1061/(ASCE)ME.1943-5479.0000814. [DOI] [Google Scholar]

[b0105] Linkov I., Fox-Lent C., Read L., Allen C.R., Arnott J.C., Bellini E.…Hyde I. Tiered Approach to Resilience Assessment. Risk Analysis. 2018;38(9):1772–1780. doi: 10.1111/risa.12991. [DOI] [PubMed] [Google Scholar]

[b0110] Linkov, I., Keenan, J. M. & Trump, B. D. (2021). COVID-19: systemic risk and resilience. Springer, Amsterdam, The Netherlands. Doi: 10.1007/978-3-030-71587-8.

[b0115] Ouyang M. A mathematical framework to optimize resilience of interdependent critical infrastructure systems under spatially localized attacks. European Journal of Operational Research. 2017;262(3): Doi:1072-1084. doi: 10.1016/j.ejor.2017.04.022. [DOI] [Google Scholar]

[b0120] Pishnamazzadeh M., Sepehri M.M., Ostadi B. An Assessment Model for Hospital Resilience according to the Simultaneous Consideration of Key Performance Indicators: A System Dynamics Approach. Perioperative Care and Operating Room Management. 2020;20 doi: 10.1016/j.pcorm.2020.100118. [DOI] [Google Scholar]

[b0125] Qian X., Ukkusuri S.V. Connecting urban transportation systems with the spread of infectious diseases: A trans-seir modeling approach. Transportation Research Part B Methodological. 2021;145(3):185–211. doi: 10.1016/j.trb.2021.01.008. [DOI] [Google Scholar]

[b0130] Ransolin N., Saurin T.A., Formoso C.T. Integrated modelling of built environment and functional requirements: Implications for resilience. Applied Ergonomics. 2020;88 doi: 10.1016/j.apergo.2020.103154. [DOI] [PubMed] [Google Scholar]

[b0135] Rodríguez-Méndez V., Ser-Giacomi E., HernándezGarcía E. Clustering coefficient and periodic orbits in flow networks. Chaos. 2017;27(3):35-46. doi: 10.1063/1.4971787. [DOI] [PubMed] [Google Scholar]

[b0140] Samsuddin N.M., Takim R., Nawawi A.H., Alwee S.N.A.S. Disaster Preparedness Attributes and Hospital's Resilience in Malaysia. Procedia Engineering. 2018;212:371–378. doi: 10.1016/j.proeng.2018.01.048. [DOI] [Google Scholar]

[b0145] Sekiguchi M., Ishiwata E. Global dynamics of a discretized sirs epidemic model with time delay. Journal of Mathematical Analysis & Applications. 2010;371(1):195–202. doi: 10.1016/j.jmaa.2010.05.007. [DOI] [Google Scholar]

[b0150] Sheu S.-H., Liu T.-H., Zhang Z.-G., Tsai H.-N. Optimum replacement policy for cumulative damage models based on multi-attributes. Computers & Industrial Engineering. 2020;139 doi: 10.1016/j.cie.2019.106206. [DOI] [Google Scholar]

[b0155] Siskos E., Burgherr P. Multicriteria Decision Support for the Evaluation of Electricity Supply Resilience: Exploration of Interacting Criteria. European Journal of Operational Research. 2022;298(2):611–626. doi: 10.1016/j.ejor.2021.07.026. [DOI] [Google Scholar]

[b0160] Tariverdi M., Fotouhi H., Moryadee S., Miller-Hooks E. Health care system disaster-resilience optimization given its reliance on interdependent critical lifelines. Journal of Infrastructure Systems. 2018;25(1):04018044. doi: 10.1061/(ASCE)IS.1943-555X.0000465. [DOI] [Google Scholar]

[b0165] Wang J.R., Wang J.P., Liu M.X., Li Y.W. Global stability analysis of an SIR epidemic model with demographics and time delay on networks. Physica A-Statistical Mechanics & Its Applications. 2014;410:268–275. doi: 10.1016/j.physa.2014.05.011. [DOI] [Google Scholar]

[b0170] Wang Y.C., Xiao R.B. An ant colony-based resilience approach to cascading failures in cluster supply network. Physica A. 2016;462:150–166. doi: 10.1016/j.physa.2016.06.058. [DOI] [Google Scholar]

[b0175] Wells E.M., Boden M., Tseytlin I., Linkov I. Modeling critical infrastructure resilience under compounding threats: A systematic literature review. Progress in Disaster Science. 2022;15 doi: 10.1016/j.pdisas.2022.100244. [DOI] [Google Scholar]

[b0180] Whitman M.G., Barker K., Johansson J., Darayi M. Component importance for multi-commodity networks: Application in the Swedish railway. Computers & Industrial Engineering. 2017;112:274–288. doi: 10.1016/j.cie.2017.08.004. [DOI] [Google Scholar]

[b0185] Worldometer. (2023). Retrieved from https://www.worldometers.info/coronavirus/. Accessed at 12:02, Feb 28, 2023 (Greenwich time).

[b0190] Xin Y., Gao C., Wang Z., Zhen X.Y., Li X.H. Discerning Influential Spreaders in Complex Networks by Accounting the Spreading Heterogeneity of the Nodes. IEEE Access. 2019;9:92070–92078. doi: 10.1109/ACCESS.2019.2927775. [DOI] [Google Scholar]

[b0195] Zeng Z.G., Fang Y.P., Zhai Q., Du S. A Markov reward process-based framework for resilience analysis of multistate energy systems under the threat of extreme events. Reliability Engineering & System Safety. 2021;209(7) doi: 10.1016/j.ress.2021.107443. [DOI] [Google Scholar]

[b0200] Zhang J.P., Jin Z. The analysis of an epidemic model on networks. Applied Mathematics & Computation. 2011;217(17):7053–7064. doi: 10.1016/j.amc.2010.09.063. [DOI] [Google Scholar]

[b0205] Zhang W.J., Shi X.L., Huang A.Q., Hua G.W., Teunter R.H. Optimal stock and capital reserve policies for emergency medical supplies against epidemic outbreaks, European Journal of Operational Research. ISSN. 2021;2021:0377–2217. doi: 10.1016/j.ejor.2021.06.026. [DOI] [Google Scholar]

[b0210] Zhao X., Wu C., Wang S., Wang X. Reliability analysis of multi-state k-out-of-n: G system with common bus performance sharing. Computers & Industrial Engineering. 2018;124:359–369. doi: 10.1016/j.cie.2018.07.034. [DOI] [Google Scholar]

[b0215] Zheng J.F., Gao Z.Y., Zhao X.M. Clustering and congestion effects on cascading failures of scale-free networks. EPL (Europhysics Letters) 2007;79(5):46–56. doi: 10.1209/0295-5075/79/58002. [DOI] [Google Scholar]

[b0220] Zhou J., Huang N., Coit D.W., Felder F.A. Combined effects of load dynamics and dependence clusters on cascading failures in network systems. Reliability Engineering and System Safety. 2018;170(1):116–126. doi: 10.1016/j.ress.2017.10.008. [DOI] [Google Scholar]

[b0225] Zou, C. C., Towsley, D. & Gong, W. (2004). Email worm modeling and defense. 13th International Conference on Computer Communications and Networks (IEEE Cat. No.04EX969). 10.1109/ICCCN.2004.1401687.

PERMALINK

Cascading failures and resilience optimization of hospital infrastructure systems against the COVID-19

Hongyan Dui

Kaixin Liu

Shaomin Wu

Abstract

1. Introduction

1.1. Background

1.2. Literature reviews

1.3. Knowledge gaps, novelty and contributions

1.4. Overview

2. A COVID-19 propagation model based on node heterogeneity

2.1. Model indicators

2.2. The propagation model

Fig. 1.

3. Cascading failure model of HIS’s

3.1. Indicators of cascading failures of HIS’s

Fig. 2.

3.2. Process analysis of cascading failures of HIS’s

3.3. Cascading failure model of HIS’s

4. Resilience optimization model for HIS’s

4.1. Restoration of hospital

4.2. The restoration benefits of hospital

Fig. 3.

Fig. 4.

Fig. 5.

4.3. Resilience optimization of HIS’s based on Markov decision process

Fig. 6.

Fig. 7.

5. Application

5.1. Data and methods of simulation

Table 1.

Fig. 8.

Fig. 9.

Fig. 10.

Fig. 11.

5.2. Analysis of simulation results

Fig. 12.

Fig. 13.

Fig. 14.

Fig. 15.

Fig. 16.

Fig. 17.

Fig. 18.

Fig. 19.

6. Conclusions and future work

6.1. Research content

6.2. Managerial implication

6.3. Future work

CRediT authorship contribution

Declaration of Competing Interest

Acknowledgements

Footnotes

Appendix A. Supplementary material

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases