Intervention on default contagion under partial information in a financial network

Yang Xu

doi:10.1371/journal.pone.0209819

. 2019 Jan 15;14(1):e0209819. doi: 10.1371/journal.pone.0209819

Intervention on default contagion under partial information in a financial network

Yang Xu ^1,^*

Editor: Fenghua Wen²

PMCID: PMC6333338 PMID: 30645587

Abstract

We study the optimal interventions of a regulator (a central bank or government) on the illiquidity default contagion process in a large, heterogeneous, unsecured interbank lending market. The regulator has only partial information on the interbank connections and aims to minimize the fraction of final defaults with minimal interventions. We derive the analytical results of the asymptotic optimal intervention policy and the asymptotic magnitude of default contagion in terms of the network characteristics. We extend the results of Amini, Cont and Minca’s work to incorporate interventions and adopt the dynamics of Amini, Minca and Sulem’s model to build heterogeneous networks with degree sequences and initial equity levels drawn from arbitrary distributions. Our results generate insights that the optimal intervention policy is “monotonic” in terms of the intervention cost, the closeness to invulnerability and connectivity. The regulator should prioritize interventions on banks that are systematically important or close to invulnerability. Moreover, the regulator should keep intervening on a bank once having intervened on it. Our simulation results show a good agreement with the theoretical results.

Introduction

The systemic risk in a financial network has been drawing more and more interest of regulators and researchers, especially after the Asian financial crisis in the late 1990’s and the more recent economic recession during 2007-2009. Financial institutions (hereafter, banks) are connected to form an interbank network to allow liquidity reallocation between the banks in that banks with liquidity surpluses can lend to banks with liquidity deficits. However, the interbank network may also introduce aggregate liquidity shortage and default contagion. Liquidity shock such as a run will cause some banks to default, leading to losses of their creditor banks through interbank connections, which may in turn result in losses of their creditors. In the financial crisis of 2007-2009 the interbank market dysfunctioned because the market participants perceived heightened counterparty risk and liquidity risk [1], and the severe reduction in transaction volume was a major contributing factor to the collapse of many banks. When the interbank market is stressed or freezes, the central bank as the lender of last resort has to provide extensive short term liquidity support. For example, the Federal Reserve in the US established many facilities, including the traditional discount window, the Term Auction Facility (TAF), Primary Dealer Credit Facility (PDCF), and Term Securities Lending Facility (TSLF). The central bank or government also recapitalize the banks and provide risk capital in the form of a bailout. Naturally, we ask the following questions: Should the regulator (the central bank or government) intervene if the bankruptcy of one or several banks has occurred or is imminent? What is the optimal intervention policy of the regulator based on the measurable features of the network and the banks, such as the degrees of connectivity and the levels of capitalization? How much improvement can the optimal strategy achieve regarding the fraction of market protected from defaults?

To answer these questions we study the uncollateralized interbank funding market where the majority of interbank loans are overnight. The connections are constantly changing so the regulator may not know exactly all the connections over time. After some liquidity incident, e.g. runs on a few banks, some banks default, which initiates the default contagion process. During the process, the regulator has to intervene on the defaulted banks to prevent the contagion from spreading. Due to the system panics, no banks want to lend new loans to other banks but meanwhile they are still obliged to pay back their current loans which are due in the time frame of the model. So it is reasonable to fix the in and out degrees of banks and assume all the connections do not change any more after the inception of default contagion. Theoretically, the regulator would be able to find out the connections between the banks by communicating with the banks; however, the contagion process happens so fast that the regulator may not have ample time to find out the connections between the banks. In other words, the regulator will have to intervene while all the connections between the banks are unknown. So the regulator has only partial information because in the beginning the regulator only knows the initially defaulted banks and the magnitude of connections of each bank but the connections between the banks are unknown. Every time a default occurs the regulator learns the connections of the defaulted bank (i.e. the banks that are affected by the default), represented by the connections being revealed after the default.

We set up a probability space under which the financial network is generated by a uniform matching of the in and out degrees (a configuration model). A directed link in the network represents one unit of loan. In the following we may use “bank” and “node” interchangeably. Before an external shock to the system, each node has a positive equity level (the difference of total assets and liabilities), which indicates the number of defaulted loans due to the default of its debtors a node can withstand before it defaults. In other words, it is the “distance to default”. After an external shock, some nodes in the system default initially and we set their equity levels to zero. We adopt the dynamics of the model in [2, 3]. When a node defaults, it defaults on all of its loans. We assume a zero recovery rate of the loan, i.e. the creditor receives zero value from the loan, which is the most realistic assumption for short term default as suggested in [4]. We assume there is a time span between a node’s default and the time its creditor records the loan as a loss (by writing down the loan from its balance sheet). We model this time span by independently and identically distributed exponential random variables. After the affected node records the defaulted loan, it may request the regulator for interventions. If the regulator decides to intervene by replacing the defaulted loan or by infusing one unit of equity, the equity level of the affected node will stay the same, otherwise its equity level will decrease by one. Once the equity level reaches zero, the node defaults. We assume that the once a bank has defaulted it cannot become liquid again within the time horizon of the model because it is very unlikely for a bank that has declared default to gain enough capital in the short term considered in the model.

We emphasize that one essential feature of the model is partial information. Because it takes time and effort for the regulator to find out the exact connections in the interbank market and the default contagion process may progress very rapidly, the regulator may have to intervene even before it is able to figure out the connections in order to take early actions and save the maximum amount of banks from bankruptcy. During the contagion process, the regulator only knows the default set (the set of defaulted nodes) with some out-links revealed, but meanwhile other out-links remain hidden until the affected nodes record the defaulted loans. More generally, unlike the complete information assumption in other theoretical models, the partial information setting aligns with the reality better, as pointed out by [5]: “Interbank exposure data are never publicly available, and in many countries nonexistent even for central regulators”.

Our methodology is illustrated in Fig 1. The regulator’s goal is to minimize the number of final defaulted nodes with the minimum number of interventions. Thus we obtain a stochastic control problem ${min}_{μ_{n}} {obj}_{n} (G r_{n}, μ_{n})$ where the objective function depends on the graph Gr_n and the intervention sequence μ_n, shown in (1). We aim to solve it for the optimal intervention sequence $μ_{n}^{*}$ and thus obtain the optimal objective function value ${obj}_{n} (G r_{n}, μ_{n}^{*})$ , shown in (2). However, solving the problem with the usual dynamic programming approach will incur intractability problem because of the fast expansion of the state space as pointed out in [3], especially for a heterogeneous network. We take an alternative approach based on the fact that under some regularity conditions, the objective function converges as n → ∞. We solve the asymptotic optimal control problem min_μ obj(Gr, μ) in (3) where obj, Gr and μ are the limit forms of obj_n, Gr_n and μ_n, respectively, and obtain the optimal intervention μ* and the objective function obj(Gr, μ*) in (4) which allow us to construct the optimal intervention sequence $μ_{n}^{*}$ for a finite n through μ* and approximate ${obj}_{n} (G r_{n}, μ_{n}^{*})$ with obj(Gr, μ*). Our results of the numerical experiments validate the approximation for networks with sizes close to the real financial networks.

Relations to previous literature

Our work is closely related to the current literature on the role of the central bank as the lender of last resort, including providing liquidity by a loan, recapitalizing the banks and bank bailout. These studies differ in their perspectives and focuses in their models. The influencing Diamond & Dybvig model [6] about the market panics and bank runs has two Nash equilibria: depositors withdraw only for their real expenditure needs or a bank run. [7] extend the Diamond & Dybvig model to a financial network of four banks to study the default contagion. The interbank network is formed to allocate liquidity among the banks to satisfy regional liquidity demands. In a complete market where the banks exchange deposits or in a disconnected market no contagion occurs while in an incomplete market where the banks do not exchange deposits with all other banks, high connectivity may entail contagion. The role of the central bank is thus to complete the market. [8] introduce a similar model but assume the depositors are uncertain about where they have to consume. In their model the central bank acts as a “crisis manager”: when a bank is to be liquidated, the central bank has to organize the bypass of the defaulting bank in the payment network and provide liquidity to the banks that depend on the defaulting bank. [9] consider that the fire sales of the banking assets occur when a large number of banks default and investors outside the banking sector who are inefficient users may end up purchasing the liquidated assets. To avoid the allocation inefficiency, the regulator may bail out the banks directly or provides liquidity to surviving banks to purchase defaulting banks. [10] argue that the government should bail out banks in distress because it can provide liquidity more efficiently than private investors. [11] consider three forms of regulator interventions: buying equity, purchasing assets and providing debt guarantee to alleviate debt overhang in a financial market but the regulator has limited information and resources. All the works discuss the optimal interventions based on equilibrium analysis. In contrary, our model emphasizes the aspect of the interbank market that it is a complex network and focus on how the regulator should make intervention decisions under the network dynamics. Moreover, some papers study the related problem about the banks in the interbank network bailing each other out, such as [12] and [13].

In addition to the theoretical studies, empirical studies abound. [14] analyze the data on the interbank transactions derived from the main euro area payment system and find that the European Central Bank took the role of the overnight unsecured interbank market in liquidity provision to the banks during the global financial crisis in 2008-2010. [15] analyze the daily transaction data and find that in 2008 counterparty risk plays a more important role than liquidity hoarding in reducing liquidity and increasing the cost of finance in the federal funds market in the US. By analyzing supervisory data of Germany, [16] find that regulatory interventions decrease liquidity creation while capital support does not affect it. [17] discuss the relations between liquidity regulations and the lender of last resort practice and argue that they are complementary rather than conflicting tools.

Our work is also related to the strand of literature on systemic risk and default contagion in the financial networks without considering regulator interventions. These works focus on understanding the dependence of the default contagion on various features of the financial network and the banks within it, including the degrees of connectivity, the equity levels and so on. Similarly, there are mainly two types of literature: empirical and theoretical. The empirical studies conduct statistical analyses on the interbank markets using data on interbank lending as far as they are available and provide an overview of the structural characteristics of the interbank network in different countries ([4, 18, 19]). The theoretical studies model the financial network with network models but differ in their assumptions about the network structure and approaches: some focus on “stylized” networks whose structures are hypothetical ([7, 20, 21, 22]) while others rely on simulations ([4, 23]). Among them, [24] and [25] propose random network models that allow more realistic and heterogeneous structures. [26] survey theoretical works on contagion and systemic risk in financial networks and categorize them according to different topics including network connectivity, bank heterogeneity, uncertainty in financial markets, and portfolio composition of the banks.

In regards to section Introduction, this paper is closely related to [24] and [2, 3]. [24] study the magnitude of default contagion in a heterogeneous network with given degree sequence and arbitrary distribution of weights and derive the analytical expressions of the asymptotic fractions of defaults in terms of the network characteristics. Our work incorporates interventions into a model proven to be equivalent to theirs. Thus if there are no interventions, the asymptotic fraction of final defaults will be the same as in [24]. [2] consider a stylized core-periphery financial network as an intermediary to provide liquidity to fund projects in outside economy but it may also incur contagion when the banks hoard liquidity. The regulator intervenes by providing loans to defaulting banks. [3] consider a similar core-periphery model where the regulator intervenes by injecting equity. We adopt the dynamics of their model that constructs the default set under interventions through a configuration model because the configuration model can be adapted to the contagion process [27]. But we differ from [2, 3] in two important ways. First, our model is a more general heterogeneous random network with degree sequences and initial equity levels drawn from arbitrary distributions. More importantly, [2, 3] focus on the benefits and costs of the connectivity in the presence of the regulator and draw conclusions mainly from numerical studies, while we focus on the optimal intervention policy and its relations to the network characteristics. Mathematically we have successfully addressed two major mathematical difficulties arising from considering interventions on a general complex network: Interventions introduce discontinuity into the asymptotic process thus the main supporting theorem used in [24] is no longer applicable directly; moreover, the high dimensional optimal control problem we obtain later is well known among control theorists difficult to solve, especially analytically. We give analytical formulations for the asymptotic optimal intervention policy as well as the asymptotic number of interventions and final defaulted banks. The asymptotic results provide a good approximation to real financial networks, which are heterogeneous and have several hundred to a thousand of banks thanks to the fast convergence behavior of our results.

Contributions

The main contribution is that we have proposed a new approach to determine the optimal intervention strategy on contagions in a large and heterogeneous financial network-to be specific, we derive the asymptotic optimal intervention strategy as the size of the network tends to infinity and then show that it is a good approximation of the optimal intervention strategy for a real financial network. This new approach has the advantage of avoiding the stylized model in the equilibrium analysis or the intractability of the dynamic programming approach in previous literature and enabling us to obtain analytical results. In light of this, we derive rigorous asymptotic results of the optimal strategy for the regulator and the fraction of final defaulted banks under the dynamics of the default contagion process in a heterogeneous network with a degree sequence and initial equity levels that can be drawn from arbitrary distributions. The analytical expressions are presented in terms of measurable features of the network. For convergence of the results, we assume the network is sparse as in [24] which is supported by the empirical studies of real financial networks ([28, 29]).

The key insights of our findings of the optimal strategies are summarized in the following. We should only consider intervening on a bank when it records the loss of a defaulted loan and is very close to default. The optimal intervention policy depends strongly on the intervention cost. The smaller the intervention cost, the more interventions are implemented. Moreover, the optimal intervention policy is “monotonic” with respect to the measurable features of the network. We should not intervene on the banks with out degrees in a certain range regardless of their other features; for those banks worth interventions, the larger the sum of initial equity and accumulative interventions received, the earlier we should begin intervention on them; the time to start intervention on a node is also “monotonic” in its in and out degrees. Interestingly, once we begin intervening on a node, we keep intervening on it every time it records the loss of a defaulted loan. By comparing the fractions of final defaults under no interventions and the optimal intervention policy, we are able to quantify the improvement made by interventions in terms of the network features. This gives guidance for the maximum impact the regulator can have to offset the effects of default contagion.

The paper is organized as follows. We set up the model and introduce the stochastic control problem (SCP) in section Model description and dynamics. In section The asymptotic control problem we formulate the asymptotic control problem Eq (9) that gives the limit for the objective function of the SCP as the size of the network goes to infinity and present the necessary conditions for the optimal intervention policy, which lead to the main theorems. In section Numerical experiments we show the results of the numerical experiments to validate the approximation of Eq (4) by Eq (9). We present in Appendix A: Proofs all the proofs and in Appendix B: Wormald’s theorem and Appendix C: Extended pontryagin maximum principle the two theorems used in the proofs as well as a list of notations in Appendix D: Preliminary list of notations.

Model description and dynamics

Basic setup

We consider default contagion in an unsecured interbank lending market under short term illiquidity risk based on the model of [2]. Due to the system panics, no banks want to lend new loans to other banks but meanwhile they are still obliged to pay back their current loans which are due in the time frame of the model, so we fix the in and out degrees of banks in the network, denoted as (d⁻(v), d⁺(v))_v∈[n], where [n] = {1, …, n} the set of nodes. Let m = ∑_v∈[n] d⁻(v) = ∑_v∈[n] d⁺(v).

Then we model the financial network with prescribed degree sequence (d⁻(v), d⁺(v))_v∈[n] as an unweighted directed network $([n], E_{n})$ , where $E_{n}$ denotes the set of links. A directed link $(v, w) \in E_{n}$ represents v borrows a unit of loan from w, i.e. v is obliged to repay w one unit of loan. We allow multiple loans to exist between two nodes and also self links representing the internal loans between different departments of the bank.

Now we set up a probability space $(G_{n, m}, P)$ where G_n,m is the set of networks on n nodes with at most m directed links. So the random financial network with m directed links lives in this probability space and under $P$ the law of the random link set $E_{n}$ is determined as follows. We start with n unconnected nodes and assign node v d⁻(v) in-half-links and d⁺(v) out-half-links. An in-half-link represents an offer of a loan and an out-half-link a demand for a loan. Then the m in-half-links and m out-half-links are matched uniformly so that the borrowers and lenders are determined. The resulting random network is called the configuration model.

The uniform matching of the in and out-half-links allows us to construct the random network sequentially: at every step we can choose any out half-link by any rule and choose the in-half-link uniformly over all unconnected in-half-links to form a directed link. This is because conditional on any subset of connected links, the unconnected links also follow the uniform distribution. Moreover, the conditional law of unconnected links only depends on the number of connected links, not the matching history. Additionally we can restrict the matching to choosing only the out-half-links from the defaulted nodes so that we can model the development of the default set with their revealed out-links.

Remark 1. As a result of the uniform connection of in and out-half-links, a node gets selected with the probability proportional to the number of its unconnected in-half-links. The rationale is that when the regulator searches for the lender of a defaulting loan, a bank that lends out more loans to other banks is more likely to be the lender and be affected by the defaulting loan.

Then we endow a node v ∈ [n] with its initial equity level $e (v) \in N_{0} ≔ {0, 1, 2, \dots}$ which represents the number of defaulted loans v can tolerate until v defaults, so it is the “distance to default”. Next after the system receives some external shock, some nodes default and the system begins to evolve. Define time 0 as the time right after the shock. Let ${(G_{k})}_{0 \leq k \leq m}$ be the filtration for the probability space $(G_{n, m}, P)$ which models the arrival of new information, i.e. the revealed link at each step. Because this implies that the the remaining equity of the selected node will decrease by one, ${(G_{k})}_{0 \leq k \leq m}$ also models the default contagion at the same time. Note in the following the network with the set of revealed links evolves in the space G_n,m as the result of the contagion process.

Initial condition

Define the set of initially defaulted nodes $D_{0} ≔ {v \in [n] : e (v) \leq 0}$ and the σ-algebra representing the information available initially $G_{0} ≔ σ ({(d^{-} (v), d^{+} (v), e (v))}_{v \in [n]})$ . Given the degree pairs for all nodes (d⁻(v), d⁺(v))_v∈[n], envision for a node v ∈ [n], there are d⁻(v) in-half-links each representing a loan another node is obliged to pay v and d⁺(v) out-half-links each representing a loan v is obliged to pay another node. Let $c_{k}^{v}$ be the sum of initial equity and accumulative number of interventions on node v and $l_{k}^{v}$ be the number of revealed in-links of node v at step k, so $c_{0}^{v} = e (v)$ , $l_{0}^{v} = 0$ . An example of the initial condition of a four node network is illustrated in Fig 2 with $D_{0} = {1, 2}$ .

Fig 2 — (d⁻(v), d⁺(v)) are the degrees and e(v) is the initial equity of node v. The nodes in the initial default set $D_{0} = {1, 2}$ are marked in blue.

Dynamics

We adopt the dynamics from [2]. At the kth step for k ∈ [1, m], if the out-links of nodes in $D_{k - 1}$ have not all been revealed, then the new link is revealed following the rule: an out-half-link of any node in $D_{k - 1}$ is picked by any rule and then connected uniformly to another unconnected in-half-link. Let (V_k, W_k) be a pair of random variables denoting the link from node V_k to node W_k being revealed at step k. We call W_k is selected at step k. Assume (V_k, W_k) = (v, w), then the uniformity in connecting the half-links leads to the probability of w being selected conditional on $G_{k - 1}$ as

\begin{matrix} P (W_{k} = w ∣ G_{k - 1}) \\ = \frac{number of w ’ s unrevealed in-links at k - 1}{total number of unrevealed in-links at k - 1} \\ = \frac{d^{-} (w) - l_{k - 1}^{w}}{m - (k - 1)} . \end{matrix}

(1)

So a node is selected with the probability proportional to the number of its unrevealed (unconnected) in-half-links. After a directed link (v, w) is revealed, then proceed with the following steps:

Update $G_{k} = σ (G_{k - 1} \cup {(v, w)})$ .
Update the number of revealed out-links: $l_{k}^{w} = l_{k - 1}^{w} + 1$ and $l_{k}^{η} = l_{k - 1}^{η}$ for η ≠ w.
Determine the intervention μ_k ∈ {0, 1} $G_{k}$ measurable at step k for the selected node w. Note that $c_{k}^{η} \leq l_{k}^{η}$ indicates that the node η has defaulted by step k. Because we do not intervene on defaulted node, μ_k = 0 if $c_{k - 1}^{w} \leq l_{k - 1}^{w}$ .
Update $c_{k}^{w} = c_{k - 1}^{w} + μ_{k}^{w}$ , otherwise $c_{k}^{η} = c_{k - 1}^{η}$ for η ≠ w.
Update the default set $D_{k}$ . If $c_{k}^{w} \leq l_{k}^{w}$ and $w \notin D_{k - 1}$ , then $D_{k} = D_{k - 1} \cup {w}$ , otherwise $D_{k} = D_{k - 1}$ .

If all out-links from $D_{k}$ have been revealed, the process ends and let the process end time be T_n = k, otherwise repeat the process. Define $D_{T_{n}}$ as the number of defaulted nodes by the process end time T_n. In Fig 3 we show the first step of the dynamics for the network in Fig 2. The link (1, 4) is revealed (connected) and node 4 is selected with probability $\frac{2}{6}$ . Because the node 4 is liquid, the regulator needs to decide whether to intervene. Node 4 remains liquid if it receives one intervention, or it will default and be included in the default set.

Fig 3 — Node 4 is selected with probability $\frac{2}{6}$ and a link between 1 and 4 is revealed. If one intervention is applied, node 4 remains liquid, otherwise its equity level will decrease by one and node 4 defaults.

Remark 2. The notion of partial information is reflected by the fact that there are always unrevealed links before the process ends and the regulator cannot make decisions depending on the knowledge of it. The regulator can only be certain that if the remaining equity is nonpositive $(c_{k}^{η} \leq l_{k}^{η})$ then the node η has defaulted at step k. On the other hand, every out-half-link from the default set represents a defaulted loan which is possible to impact every node in the network at a later time. So all the current liquid vulnerable nodes are subject to default at a later time and the regulator should take this into account when making intervention decisions at each step.

Let R_k be the accumulative number of interventions by step k and $D_{k} = | D_{k} |$ be the number of defaults at step k. The regulator aims to minimize the number of defaulted nodes by T_n with the minimum amount of interventions, so we define the objective function as a linear combination of the (scaled) number of interventions and defaults by the end of the process T_n as

\begin{matrix} J_{n} = E (K \frac{R_{T_{n}}}{n} + \frac{D_{T_{n}}}{n} ∣ G_{0}), \end{matrix}

(2)

where K > 0 is the relative “cost” of an intervention. Further by the definition of $c_{T_{n}}^{v}$ and noting that a node defaults at last if $c_{T_{n}}^{v} \leq l_{T_{n}}^{v}$ , i.e. the number of defaulted loans exceeds the total of the initial equity level and the number of interventions received by T_n, we can express $R_{T_{n}}$ and $D_{T_{n}}$ as

\begin{matrix} R_{T_{n}} = & \sum_{v \in [n]} (c_{T_{n}}^{v} - e (v)), \\ D_{T_{n}} = & \sum_{v \in [n]} 1_{(c_{T_{n}}^{v} \leq l_{T_{n}}^{v})} . \end{matrix}

(3)

Now we define the stochastic optimal control problem as

\begin{matrix} min_{μ \in U} J_{n}, \end{matrix}

(4)

where μ = (μ_k)_{1≤k ≤ m}, μ_k ∈ {0, 1} and $U$ contains all ${(G_{k})}_{0 \leq k \leq m}$ adapted process μ.

The asymptotic control problem

Assumptions and definitions

We assume that a bank cannot become liquid again once it has defaulted, thus we cannot save defaulted banks. This assumption is reasonable in the setting of default contagion in a stressed network and a short time window. Nor do we intervene on invulnerable nodes, because they never default but intervening on them will only prevent us from saving the banks that are very close to default especially when the interventions are costly.

In the model description we only intervene on the node that is selected at each step. Now we show that even if the regulator intervenes on multiple nodes and applies more than one unit of credit every time, it will not be better.

Proposition 1. For the stochastic control problem Eq (4), we only need to consider intervening on a node that, when selected, has only one unit of equity remaining.

We see proposition 1 implies that it is never optimal to intervene on a node if it is not selected or has more than one unit of equity remaining when selected. Let (i, j, c, l) be the state of a node, meaning it has the in and out degree (i, j), sum of the initial equity and the number of interventions c and l revealed in-links. Note that by definition l ≤ i. We characterize nodes with states because nodes with the same state have the same probability of being selected at each step and are statistically the same in influencing other nodes. Note in particular:

c = 0 denotes that the node has defaulted initially.
c − l denotes the remaining equity or “distance to default”, i.e. the number of times of being selected before a node defaults without interventions. Thus c ≤ l means the node has defaulted.
Because l ≤ i by definition, i < c implies that a node is invulnerable, i.e. even all loans lent out to the counterparties are written down from the balance sheet, the node still has positive remaining equity. On the contrary, 0 < c ≤ i denotes the node has the possibility to default, i.e. vulnerable.
In the beginning of the contagion process, all nodes have l = 0, i.e. are in states of the form (i, j, c, 0).

Then we define the state of the system at each step. Note that the number of nodes that have defaulted initially (c = 0) or invulnerable (i < c) in the beginning will not change throughout the process, so we only need to keep track of the nodes that are initially vulnerable (0 < c ≤ i) and currently liquid and if needed, we can always calculate the number of defaulted nodes at any time in the process. Further note that the possible states throughout the process for nodes that are vulnerable in the beginning and liquid at a later step are

\begin{matrix} Γ ≔ {(i, j, c, l) : 0 \leq i, 0 \leq j, 0 \leq l < c \leq i or c = i + 1, l = i} . \end{matrix}

(5)

Note particularly the state (i, j, i + 1, i) is the result that a node in state (i, j, i, i − 1) is selected and receives one intervention and thus becomes invulnerable.

Definition 1. (State variable S_k) Let $S_{k}^{i, j, c, l}$ denote the number of nodes that are vulnerable initially and are in state (i, j, c, l) at step k, for k = 0, …, m and $S_{k} {≔ (S_{k}^{i, j, c, l})}_{(i, j, c, l) \in Γ}$ be the state of the system. Note in the following we may use α to represent (i, j, c, l)∈Γ and write $S_{k}^{α}$ instead of $S_{k}^{i, j, c, l}$ to simplify the notation.

Recall m = m(n) is the number of the total in (or out) degree of the network, which is also the maximum steps of the process. Throughout this paper we follow the convention that the superscript (usually a multi-index) denotes the state and the subscript denotes the time (discrete or continuous), e.g. $s_{τ}^{i, j, c, l}$ , $u_{τ}^{i, j, c, c - 1}$ , $s_{t}^{i, j, c, l}$ , $w_{t}^{i, j, c, l}$ and $u_{t}^{i, j, c, c - 1}$ in the following. Then we define the empirical probability of in, out degrees and initial equity levels.

Definition 2. (Empirical probability) Define the empirical probability of the triplet (in degree, out degree, initial equity level) as

\begin{matrix} P_{n} (i, j, c) = \frac{1}{n} | {v \in [n] ∣ d^{-} (v) = i, d^{+} (v) = j, e (v) = c} | . \end{matrix}

(6)

Note that $\sum_{c \geq 0} P_{n} (i, j, c) = \frac{1}{n} ∣ {v \in [n] ∣ d^{-} (v) = i, d^{+} (v) = j} ∣$ represents the empirical probability of the in and out degree pair (i, j).

Previously we use W_k to denote the selected node at step k. Now with a little abuse of notation, let W_k denote the state of the selected node at step k, k = 1, …, m, so W_k ∈ Γ⁺ ≔ {(i, j, c, l):0 ≤ i, 0 ≤ j, 0 ≤ c, 0 ≤ l ≤ i}. We consider a Markovian control policy $G_{n} = (g_{1}^{(n)} (S_{0}, W_{1}), \dots, g_{m}^{(n)} (S_{m - 1}, W_{m}))$ where $g_{k + 1}^{(n)} : N_{0}^{| Γ |} \times Γ^{+} \to {0, 1}$ specifies the intervention at step k + 1 on the selected node which has state W_k+1 given the state S_k and the superscript (n) shows the dependence on n.

Letting P_n = (P_n(i, j, c))_{i,j,0≤c≤i}, we rewrite the terms J_n as $J_{G_{n}} (P_{n})$ , $R_{T_{n}}$ as $R_{T_{n}} (G_{n}, P_{n})$ and $D_{T_{n}}$ as $D_{T_{n}} (G_{n}, P_{n})$ in Eq 4 based on G_n and P_n, so

\begin{matrix} R_{T_{n}} (G_{n}, P_{n}) = & \sum_{k = 1}^{T_{n}} g_{k}^{(n)} (S_{k - 1}, W_{k}), \\ D_{T_{n}} (G_{n}, P_{n}) = & n \sum_{i, j} P_{n} (i, j, 0) + n \sum_{i, j, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{T_{n}}^{i, j, c, l} \\ = & n \sum_{i, j, 0 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{T_{n}}^{i, j, c, l} . \end{matrix}

(7)

Note that the first equality for $D_{T_{n}} (G_{n}, P_{n})$ holds because the nodes that default at the end of the process consist of two parts: the nodes that have defaulted initially i.e. n∑_i,j P_n(i, j, 0) and those nodes that are vulnerable initially and default during the process i.e. $n \sum_{i, j, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{T_{n}}^{i, j, c, l}$ .

Assumption 1. Consider a sequence $([n], E_{n})$ of random networks, indexed by the size of the network n. For each $n \in N,$ (d⁻(v))_v∈[n], (d⁺(v))_v∈[n] are sequences of nonnegative integers with ∑_v∈[n] d⁻(v) = ∑_v∈[n] d⁺(v) and such that for some probability distribution p on $N_{0}^{3}$ independent of n with λ ≔ ∑_i,j,c ip(i, j, c) = ∑_i,j,c jp(i, j, c)<∞, the following holds

P_n(i, j, c) → p(i, j, c) ∀ i, j, c ≥ 0 as n → ∞.
∑_v∈[n][(d⁻(v))²+ (d⁺(v))²] = O(n).

Note that the second assumption implies by uniform integrability that $\frac{m (n)}{n} \to λ$ as n → ∞ and recall that m(n) ≔ ∑_v∈[n] d⁻(v) = ∑_v∈[n] d⁺(v). Since k ≤ m(n), for large n it holds that $\frac{k}{n} \leq \frac{m (n)}{n} \leq λ + 1$ . Assumption 1 essentially implies the network is sparse which is justified in many empirical study literature on the structure of financial networks [24].

Remark 3. We previously defined that the vector P_n only includes P_n(i, j, c) in the range 0 ≤ c ≤ i, i.e. the fractions of initially defaulted and vulnerable nodes. Accordingly define p ≔ (p(i, j, c))_{i,j,0≤c≤i}, i.e. the vector p only includes p(i, j, c) in the range 0 ≤ c ≤ i.

Next we present our assumptions on the control functions $g_{k}^{(n)}$ .

Assumption 2. Define Φ ≔ {(i, j, c, c − 1): 0 ≤ i, 0 ≤ j, 1 ≤ c ≤ i}. Let $G_{n} = (g_{1}^{(n)}, \dots, g_{m}^{(n)})$ be the a control policy (a sequence of control functions) for the contagion process on a network of size n where n is large enough such that $\frac{m (n)}{n} \leq λ + 1$ . Assume that

\begin{matrix} g_{k + 1}^{(n)} (s, w) = {\begin{matrix} u_{\frac{k}{n}}^{i, j, c, c - 1} & if w = (i, j, c, c - 1) \in Φ \\ 0 & otherwise, \end{matrix} \end{matrix}

(8)

for 0 ≤ k ≤ m − 1. $u_{τ}^{i, j, c, c - 1} = u^{i, j, c, c - 1} (τ)$ where u^{i, j, c, c−1}: [0, λ + 1] → {0, 1} is a piecewise constant function on [0, λ + 1], i.e. there is a partition of the interval into a finite set of intervals such that u^{i, j, c, c−1} is constant 0 or 1 on each interval. Let u = (u^β)_β∈Φ and Π contain all piecewise constant vector function u on [0, λ + 1].

Note that Φ includes possible states having the distance to default equal to one and Φ ⊂ #x0393;. Further $g_{k + 1}^{(n)} (s, w) = 0$ for w ∉ Φ follows from proposition 1. In the following we may use β to represent (i, j, c, c − 1) ∈ Φ and write $u_{τ}^{β}$ instead of $u_{τ}^{i, j, c, c - 1}$ to simplify the notation.

Remark 4. By this assumption the function u is independent of the state but only a function of time. This implies that the control function $g_{k + 1}^{(n)} (s, w)$ depends on the scaled time $\frac{k}{n}$ and the state of the currently selected node w but not on the state s. We will show it suffices to consider such control policy G_n later after proposition 3 because given a function u, we can predict a deterministic process to which the scaled stochastic contagion process converges in probability at any time as the size of the network n → ∞. Moreover, this type of control policies is the one that can be solved in the optimal control problem Eq (39) we will introduce later.

In summary, assumption 1 assumes the convergence of the empirical probabilities of the in and out degrees and the initial equity. On the other hand, proposition 3 indicates that the control functions depend on the scaled time and the state of the currently selected node. These two assumptions allow us to define the following asymptotic control problem by ensuring that the limits in the objective function are well defined.

Definition 3. For a sequence of networks with P_n and G_n satisfying assumption 1 and assumption 2, respectively, define the asymptotic control problem as

\begin{matrix} min_{u \in Π} lim_{n \to \infty} J_{G_{n}} (P_{n}) \\ = & min_{u \in Π} lim_{n \to \infty} K E \frac{R_{T_{n}} (G_{n}, P_{n})}{n} + E \frac{D_{T_{n}} (G_{n}, P_{n})}{n} . \end{matrix}

(9)

In the following we will show the limits in Eq (9) are well defined by applying Wormald’s theorem [30].

Dynamics of the default contagion process with interventions

Recall that R_k is the accumulative number of interventions up to step k, so

\begin{matrix} R_{0} & = 0, \\ R_{k} & = \sum_{ℓ = 1}^{k} g_{ℓ} (S_{ℓ - 1}, W_{ℓ}) \\ = \sum_{ℓ = 1}^{k} \sum_{β \in Φ} 1_{(W_{ℓ} = β)} u_{\frac{ℓ - 1}{n}}^{β} . \end{matrix}

(10)

We shall show that (S_k, R_k)_{k=0, …, m} is a controlled Markov chain given a control policy G_n. In Fig 4 we illustrate for the same (i, j) pair the state space as well as their transition relations between the states.

To describe the transition probabilities, assume the state of the selected node at step k + 1 is W_k+1 = (i, j, c, l), for k = 0, …, m − 1, there are three possibilities:

The selected node has defaulted, i.e. c ≤ l or the node is invulnerable, i.e. c > i, then S_k+1 = S_k, R_k+1 = R_k.

The selected node is vulnerable but has the “distance to default” greater than one, i.e. c − l ≥ 2, then the node w is selected with probability $\frac{(i - l) S_{k}^{i, j, c, l}}{m - k}$ and
$\begin{matrix} S_{k + 1}^{i, j, c, l} = & S_{k}^{i, j, c, l} - 1, \\ S_{k + 1}^{i, j, c, l + 1} = & S_{k}^{i, j, c, l + 1} + 1, \\ R_{k + 1} = & R_{k}, \end{matrix}$ (11)
while other entries of S_k+1 are the same as S_k.
The selected node has the “distance to default” of one, i.e. c − l = 1, then the node is selected with probability $\frac{(i - c + 1) S_{k}^{i, j, c, c - 1}}{m - k}$ and by assumption 2,
$\begin{matrix} S_{k + 1}^{i, j, c, c - 1} = & S_{k}^{i, j, c, c - 1} - 1, \\ S_{k + 1}^{i, j, c + 1, c} = & S_{k}^{i, j, c + 1, c} + g_{k + 1}^{(n)} (S_{k}, (i, j, c, c - 1)) \\ = & S_{k}^{i, j, c + 1, c} + u_{\frac{k}{n}}^{i, j, c, c - 1}, \\ R_{k + 1} = & R_{k} + u_{\frac{k}{n}}^{i, j, c, c - 1}, \end{matrix}$ (12)
while other entries of S_k+1 are the same as S_k.

Let ${(F_{k})}_{k = 0, \dots, m}$ be the natural filtration of S_k, $Δ S_{k}^{α} = S_{k + 1}^{α} - S_{k}^{α}$ , α ∈ Γ and ΔR_k = R_k+1 − R_k, it follows that

\begin{matrix} E [Δ S_{k}^{i, j, c, 0} | F_{k}] = & - \frac{i S_{k}^{i, j, c, 0}}{m - k} for 1 \leq c \leq i, \\ E [Δ S_{k}^{i, j, c, l} | F_{k}] = & \frac{(i - l + 1) S_{k}^{i, j, c, l - 1}}{m - k} - \frac{(i - l) S_{k}^{i, j, c, l}}{m - k} \\ for 3 \leq c \leq i, 1 \leq l \leq c - 2, \\ E [Δ S_{k}^{i, j, c, c - 1} | F_{k}] = & \frac{(i - c + 2) S_{k}^{i, j, c - 1, c - 2}}{m - k} u_{\frac{k}{n}}^{i, j, c - 1, c - 2} \\ + \frac{(i - c + 2) S_{k}^{i, j, c, c - 2}}{m - k} - \frac{(i - c + 1) S_{k}^{i, j, c, c - 1}}{m - k} \\ for 2 \leq c \leq i, \\ E [Δ S_{k}^{i, j, i + 1, i} | F_{k}] = & \frac{S_{k}^{i, j, i, i - 1}}{m - k} u_{\frac{k}{n}}^{i, j, i, i - 1}, \\ E [Δ R_{k} | F_{k}] = & \sum_{(i, j, c, c - 1) \in Φ} \frac{(i - c + 1) S_{k}^{i, j, c, c - 1}}{m - k} u_{\frac{k}{n}}^{i, j, c, c - 1} . \end{matrix}

(13)

Convergence of the default contagion process with interventions

Based on the dynamics of the contagion process under interventions described in the previous section, we will show that the state variable S_k, the accumulative interventions R_k, the number of defaults D_k and the number of unrevealed out-links from the default set $D_{k}^{-}$ (defined later) after being scaled by n all converge to a deterministic process which depends on the solution of the system of ODEs we will present now. Then we are able to show that the stochastic control problem Eq (4) converges to the asymptotic control problem Eq (9).

Definition 4. (ODEs of s_τ) Given a set of piecewise constant function u = (u^β)_β∈Φ on [0, λ], i.e. u ∈ Π, define the system of ordinary differential equations (ODEs) of $s_{τ} = {(s_{τ}^{α})}_{α \in Γ}$ as

\begin{matrix} \frac{d s_{τ}^{i, j, c, 0}}{d τ} = & - \frac{i s_{τ}^{i, j, c, 0}}{λ - τ} for 1 \leq c \leq i, \\ \frac{d s_{τ}^{i, j, c, l}}{d τ} = & \frac{(i - l + 1) s_{τ}^{i, j, c, l - 1}}{λ - τ} - \frac{(i - l) s_{τ}^{i, j, c, l}}{λ - τ} \\ for 3 \leq c \leq i, 1 \leq l \leq c - 2, \\ \frac{d s_{τ}^{i, j, c, c - 1}}{d τ} = & \frac{(i - c + 2) s_{τ}^{i, j, c - 1, c - 2}}{λ - τ} u_{τ}^{i, j, c - 1, c - 2} + \frac{(i - c + 2) s_{τ}^{i, j, c, c - 2}}{λ - τ} \\ - \frac{(i - c + 1) s_{τ}^{i, j, c, c - 1}}{λ - τ} for 2 \leq c \leq i, \\ \frac{d s_{τ}^{i, j, i + 1, i}}{d τ} = & \frac{s_{τ}^{i, j, i, i - 1}}{λ - τ} u_{τ}^{i, j, i, i - 1} . \end{matrix}

(14)

The ODEs can be expressed in the form $\frac{d s_{τ}}{d τ} = h (τ, s_{τ}; u_{τ})$ where h = (h^α)_α∈#x0393;.

For what is needed below we analyze the solutions of the ODEs in definition 4 for a subinterval of [0, λ] on which u_τ is a constant vector function.

Proposition 2. Let $s_{τ} = {(s_{τ}^{α})}_{α \in Γ}$ satisfy the system of ordinary differential equations in definition 4 with the initial conditions $s_{τ_{1}} = s_{1} ≔ {(s_{1}^{α})}_{α \in Γ}$ and assume u_τ is a constant vector function u_τ = b ≔ (b^β)_β∈Φ in the interval [τ₁, τ₂) ⊆ [0, λ) where b^β ∈ {0, 1} is a constant, then the solution s_τ on [τ₁, τ₂) is

\begin{array}{l} s_{τ}^{i, j, c, l} = & {(\frac{λ - τ}{λ - τ_{1}})}^{i - l} \sum_{r = 0}^{l} s_{1}^{i, j, c, r} (\begin{matrix} i - r \\ l - r \end{matrix}) {(1 - \frac{λ - τ}{λ - τ_{1}})}^{l - r} \\ for 2 \leq c \leq i, 0 \leq l \leq c - 2, \end{array}

(15)

\begin{array}{l} s_{τ}^{i, j, c, c - 1} = & {(\frac{λ - τ}{λ - τ_{1}})}^{i - c + 1} \sum_{r = 0}^{c - 1} \sum_{q = r + 1}^{c} \prod_{k = q}^{c - 1} b^{i, j, k, k - 1} s_{1}^{i, j, q, r} (\begin{matrix} i - r \\ c - 1 - r \end{matrix}) {(1 - \frac{λ - τ}{λ - τ_{1}})}^{c - 1 - r} \\ for 1 \leq c \leq i, \end{array}

(16)

s_{τ}^{i, j, i + 1, i} = s_{1}^{i, j, i + 1, i} + \sum_{r = 0}^{i - 1} \sum_{q = r + 1}^{i} \prod_{k = q}^{i} b^{i, j, k, k - 1} s_{1}^{i, j, q, r} {(1 - \frac{λ - τ}{λ - τ_{1}})}^{i - r},

(17)

where $\prod_{k = c}^{c - 1} b^{i, j, k, k - 1} ≔ 1$ . As a direct result, if we take the initial condition $s_{τ_{1}}^{i, j, c, l} = p (i, j, c) 1_{(l = 0)}$ for (i, j, c, l)∈Γ at τ₁ = 0, it follows that

\begin{matrix} s_{τ}^{i, j, c, l} & = p (i, j, c) (\binom{i}{l}) {(1 - \frac{τ}{λ})}^{i - l} {(\frac{τ}{λ})}^{l} \\ for 2 \leq c \leq i, 0 \leq l \leq c - 2 . \end{matrix}

(18)

Remark 5. We discuss some properties of s_τ. Observe that the ODEs are “separable” in that $s_{τ}^{i, j, c, l}$ only depends on the entries of s_τ and p with the same (i, j). Fix an (i, j) pair, define Γ_i,j ≔ {(c, l):0 ≤ l < c ≤ i or c = i + 1, l = i}. If u_τ is a constant vector of 1’s on [τ₁, τ₂) ⊆ [0, λ), then we can show after some algebra that for any τ ∈ [τ₁, τ₂),

\begin{matrix} \sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l} = \sum_{(c, l) \in Γ_{i, j}} s_{τ_{1}}^{i, j, c, l} . \end{matrix}

(19)

If there exists some c₀ such that 1 ≤ c₀ ≤ i, $u_{τ}^{i, j, c_{0}, c_{0} - 1} = 0$ , then $\sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l} < \sum_{(c, l) \in Γ_{i, j}} s_{τ_{1}}^{i, j, c, l}$ . Since the initial condition is $s_{0}^{i, j, c, l} = p (i, j, c) 1_{(l = 0)}$ for 1 ≤ c ≤ i, it follows that

\begin{matrix} \sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l} \leq \sum_{1 \leq c \leq i} p (i, j, c) . \end{matrix}

(20)

In the following part our goal is to approximate $\frac{R_{k}}{n}$ and $\frac{D_{k}}{n}$ as n → ∞ given a function u. However, the number of variables depends on n, so we need to bound the terms associated with large in or out degrees. Fix ϵ > 0 and by assumption 1 we have that

\begin{matrix} λ = \sum_{i, j, c} i p (i, j, c) = \sum_{i, j, c} j p (i, j, c) < \infty, \end{matrix}

(21)

then there exists an integer M^ϵ such that

\begin{matrix} \sum_{i \geq M^{ϵ}} \sum_{j, c} i p (i, j, c) + \sum_{j \geq M^{ϵ}} \sum_{i, c} j p (i, j, c) < ϵ, \end{matrix}

(22)

so letting i ∨ j = max{i, j}, we have

\begin{matrix} \sum_{i \lor j \geq M^{ϵ}, c} j p (i, j, c) \\ = & \sum_{i \geq M^{ϵ}} \sum_{j < M^{ϵ}} \sum_{c} j p (i, j, c) + \sum_{i \geq M^{ϵ}} \sum_{j \geq M^{ϵ}} \sum_{c} j p (i, j, c) + \sum_{i < M^{ϵ}} \sum_{j \geq M^{ϵ}} \sum_{c} j p (i, j, c) \\ \leq & \sum_{i \geq M^{ϵ}} \sum_{j < M^{ϵ}} \sum_{c} i p (i, j, c) + \sum_{i \geq M^{ϵ}} \sum_{j \geq M^{ϵ}} \sum_{c} j p (i, j, c) + \sum_{i < M^{ϵ}} \sum_{j \geq M^{ϵ}} \sum_{c} j p (i, j, c) \\ < & ϵ . \end{matrix}

(23)

We can prove similarly that there exists an integer L^ϵ such that ∑_{i∨j≥L^ϵ,c} ip(i, j, c) < ϵ, but without loss of generality we write M^ϵ instead of L^ϵ in what follows. Moreover, by assumption 1, as n → ∞,

\begin{matrix} \sum_{i, j, c} i P_{n} (i, j, c) = \sum_{i, j, c} j P_{n} (i, j, c) \to λ < \infty, \end{matrix}

(24)

so for n large enough, we can show that

\begin{matrix} \sum_{i \lor j \geq M^{ϵ}, c} j P_{n} (i, j, c) & < ϵ, \\ \sum_{i \lor j \geq M^{ϵ}, c} i P_{n} (i, j, c) & < ϵ . \end{matrix}

(25)

So we define the integer M^ϵ formally.

Definition 5. Given any ϵ > 0, define M^ϵ as the integer such that

\begin{matrix} \sum_{i \lor j \geq M^{ϵ}, c} i p (i, j, c) & < ϵ, \\ \sum_{i \lor j \geq M^{ϵ}, c} j p (i, j, c) & < ϵ . \end{matrix}

(26)

Accordingly, define

\begin{matrix} Γ^{ϵ} ≔ & {(i, j, c, l) : i \lor j < M^{ϵ}, 0 \leq l < c \leq i or c = i + 1, l = i}, \\ Φ^{ϵ} ≔ & {(i, j, c, c - 1) : i \lor j < M^{ϵ}, 0 \leq c \leq i}, \\ \hat{λ} ≔ & λ - ϵ, \end{matrix}

(27)

where a ∨ b = max{a, b}.

Next we show that the scaled state variable S_k and R_k converges in probability to the solution of the ODEs in definition 4 given the function u. The difficulty in proving proposition 3 arises from the fact that the right sides of the ODEs for s_τ in definition 4 are discontinuous due to interventions so the auxiliary Wormald’s theorem in Appendix B: Wormald’s theorem is not applicable and needs to be adapted.

Proposition 3. Consider a sequence of networks with initial conditions (P_n)_{n ≥ 1} satisfying assumption 1 and let (G_n)_n≥1 be the sequence of control policies for the contagion process on the sequence of networks and (G_n)_n≥1 satisfy assumption 2 with the function u = (u^β)_β∈Φ^ϵ, then

\begin{matrix} sup_{0 \leq k \leq n \hat{λ}} \frac{S_{k}^{α}}{n} - s_{\frac{k}{n}}^{α} = & O (n^{- \frac{1}{4}}), \\ sup_{0 \leq k \leq n \hat{λ}} \frac{{\tilde{R}}_{k}}{n} - \tilde{r}_{\frac{k}{n}} = & O (n^{- \frac{1}{4}}), \end{matrix}

(28)

with probability $1 - O (n^{\frac{1}{4}} exp (- n^{\frac{1}{4}}))$ and α ∈ Γ^ϵ, where $s_{τ} = {(s_{τ}^{α})}_{α \in Γ^{\in}}$ is the solution for the ODEs in definition 4 with the initial conditions $s_{0}^{i, j, c, l} = p (i, j, c) 1_{(l = 0)}$ and

\begin{matrix} {\tilde{R}}_{0} & = 0, \\ {\tilde{R}}_{k} & = \sum_{ℓ = 1}^{k} \sum_{β \in Φ^{ϵ}} 1_{(W_{ℓ} = β)} u_{\frac{ℓ - 1}{n}}^{β}, \end{matrix}

(29)

and

\begin{matrix} {\tilde{r}}_{τ} = \int_{0}^{τ} \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) s_{t}^{i, j, c, c - 1}}{λ - t} u_{t}^{i, j, c, c - 1} d t . \end{matrix}

(30)

From proposition 3 we see that given (P_n)_n≥1 and (G_n)_n≥1 satisfying assumption 1 and assumption 2, respectively, the scaled stochastic process $\frac{S_{k}}{n}$ converges to the deterministic process $s_{\frac{k}{n}}$ for any k in $[0, n \hat{λ}]$ . This justifies assumption 2 on the control policy because given a control policy G_n depending on the function u, we can predict with high probability the scaled stochastic contagion process at any time k.

Next we discuss the convergence of the scaled number of defaults $\frac{D_{k}}{n}$ and the process end time $\frac{T_{n}}{n}$ . Note in definition 6, definition 7 and proposition 4 it is not required that i ∨ j < M^ϵ

Definition 6. Define $D_{k}^{-}$ the number of unrevealed out links from the default set at step k.

Recall that D_k denotes the number of defaulted nodes at step k which consist of two parts: the nodes that have defaulted initially n∑_i,j P_n(i, j, 0) and those that are vulnerable initially and default by step k, i.e. $n \sum_{i, j, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{k}^{i, j, c, l}$ , thus

\begin{matrix} D_{k} = & n \sum_{i, j} P_{n} (i, j, 0) + n \sum_{i, j, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{k}^{i, j, c, l} \\ = & n \sum_{i, j, 0 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} S_{k}^{i, j, c, l} . \end{matrix}

(31)

Similarly, among all defaulted nodes at step k the nodes with out degree j consist of two parts: the nodes that have defaulted initially n∑_i P_n(i, j, 0) and those nodes that are vulnerable initially and default by step k, $n \sum_{i, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{i, 0 \leq l < c \leq i or c = i + 1, l = i} S_{k}^{i, j, c, l}$ , thus

\begin{matrix} D_{k}^{-} = & \sum_{j} j (n \sum_{i} P_{n} (i, j, 0) + n \sum_{i, 1 \leq c \leq i} P_{n} (i, j, c) - \sum_{i, 0 \leq l < c \leq i or c = i + 1, l = i} S_{k}^{i, j, c, l}) - k \\ = & n \sum_{i, j, 0 \leq c \leq i} j P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ} j S_{k}^{i, j, c, l} - k . \end{matrix}

(32)

Correspondingly we make the following definitions to approximate $\frac{D_{k}}{n}$ and $\frac{D_{k}^{-}}{n}$ as n → ∞.

Definition 7. Define

\begin{matrix} d_{τ} = & \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ} s_{τ}^{i, j, c, l}, \\ d_{τ}^{-} = & \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ} j s_{τ}^{i, j, c, l} - τ . \end{matrix}

(33)

Proposition 4. Based on definition 6 and definition 7, it follows that

\begin{matrix} sup_{0 \leq k \leq n \hat{λ}} | \frac{D_{k}^{-}}{n} - d_{\frac{k}{n}}^{-} | & \leq o_{p} (1) + 2 ϵ, \\ sup_{0 \leq k \leq n \hat{λ}} | \frac{D_{k}}{n} - d_{\frac{k}{n}} | & \leq o_{p} (1) + 2 ϵ . \end{matrix}

(34)

To summarize the results we have so far, we have shown in proposition 3 and proposition 4 that the state variable S_k, the accumulative interventions R_k, the number of defaults D_k and the number of unrevealed out-links from the default set $D_{k}^{-}$ after being scaled by n all converge to a deterministic process which depends on the solution of ODEs in definition 4. This convergence applies to any k before $n \hat{λ}$ . By definition 6, $T_{n} = min {0 \leq k \leq m : D_{k}^{-} = 0}$ . Additionally define $τ_{f} = inf {0 \leq τ \leq λ : d_{τ}^{-} = 0}$ . Next we show that when $\frac{T_{n}}{n}$ converges in probability to τ_f, then $\frac{R_{T_{n}}}{n}$ and $\frac{D_{T_{n}}}{n}$ also converge in probability to the corresponding deterministic variables, $r_{τ_{f}}$ and $d_{τ_{f}}$ , which in light of the boundedness of $\frac{R_{T_{n}}}{n}$ and $\frac{D_{T_{n}}}{n}$ further implies convergence in expectations, thus the limits in Eq (9) are well defined.

Proposition 5. Consider a sequence of networks with initial conditions (P_n)_n≥1 satisfying assumption 1 and let (G_n)_n≥1 be the sequence of control policies for the contagion processes on the sequence of networks and (G_n)_n≥1 satisfy assumption 2 with the function u. If τ_f = λ, or τ_f < λ and $\frac{d}{d τ} d_{τ_{f}}^{-} < 0$ , it follows that as n → ∞,

\begin{matrix} \frac{R_{T_{n}} (G_{n}, P_{n})}{n} \overset{p}{\to} & r_{τ_{f}} (u, p), \\ \frac{D_{T_{n}} (G_{n}, P_{n})}{n} \overset{p}{\to} & d_{τ_{f}} (u, p) . \end{matrix}

(35)

where

\begin{matrix} r_{τ_{f}} = \int_{0}^{τ_{f}} \sum_{(i, j, c, c - 1) \in Φ} \frac{(i - c + 1) s_{t}^{i, j, c, c - 1}}{λ - t} u_{t}^{i, j, c, c - 1} d t . \end{matrix}

(36)

Further it follows that as n → ∞,

\begin{matrix} E \frac{R_{T_{n}} (G_{n}, P_{n})}{n} \to & r_{τ_{f}} (u, p), \\ E \frac{D_{T_{n}} (G_{n}, P_{n})}{n} \to & d_{τ_{f}} (u, p) . \end{matrix}

(37)

Under the conditions in proposition 5, the asymptotic control problem Eq (9) becomes

\begin{matrix} min_{u \in Π} K \cdot r_{τ_{f}} (u, p) + d_{τ_{f}} (u, p) . \end{matrix}

(38)

In the following let $u_{τ} = {(u_{τ}^{β})}_{β \in Φ}$ and u = (u_τ)_{τ ∈ [0, λ]}.

Substituting the expressions of $r_{τ_{f}} (u, p)$ and $d_{τ_{f}} (u, p)$ in Eqs (36) and (33) respectively into Eq (38) and putting together the system of ordinary differential equations of $s_{τ} = {(s_{τ}^{α})}_{α \in Γ}$ , i.e. $\frac{d}{d τ} s_{τ} = h (τ, s_{τ}; u_{τ})$ in definition 4 as well as the condition that determines τ_f, i.e. $d_{τ_{f}}^{-} = 0$ , we attain the following deterministic optimal control problem.

\begin{matrix} min_{u, τ_{f}} & K \cdot r_{τ_{f}} (u, p) & + d_{τ_{f}} (u, p) \\ st & \frac{d}{d τ} s_{τ} & = h (τ, s_{τ}; u_{τ}) \\ s_{0}^{i, j, c, l} & = p (i, j, c) 1_{(l = 0)} \\ d_{τ_{f}}^{-} & = 0 \\ u_{τ}^{β} & \in {0, 1} \forall β \in Φ \\ τ_{f} & \in [0, λ), \end{matrix}

(39)

where $\frac{d}{d τ} s_{τ} = h (τ, s_{τ}; u_{τ})$ is defined as in definition 4 and

\begin{matrix} r_{τ_{f}} (u, p) & = \int_{0}^{τ_{f}} \sum_{i, j, 1 \leq c \leq i} \frac{(i - c + 1) s_{τ}^{i, j, c, c - 1}}{λ - τ} u_{τ}^{i, j, c, c - 1} d τ, \\ d_{τ_{f}} (u, p) & = \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ} s_{τ_{f}}^{i, j, c, l}, \\ d_{τ_{f}}^{-} & = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ} j s_{τ_{f}}^{i, j, c, l} - τ_{f} . \end{matrix}

(40)

Some difficulties arise because Eq (39) is an infinite dimensional optimal control problem. In light of assumption 1, it suffices to solve a finite dimensional problem to approximate the objective function of the infinite dimensional problem. First we define the finite dimensional optimal control problem.

Definition 8. (FOCP) For ϵ > 0, recall M^ϵ as in definition 5. Define the finite dimensional optimal control problem (FOCP) as Eq (39) with the indexes (i, j) restricted to i ∨ j < M^ϵ.

Remark 6. The restriction of (i, j) to i ∨ j < M^ϵ indicates that we use only p(i, j, c), i ∨ j < M^ϵ, 0 ≤ c ≤ i in the calculation. It is equivalent to setting p(i, j, c) = 0 for i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i while keeping p(i, j, c) for i ∨ j < M^ϵ, 0 ≤ c ≤ i unchanged, which implies asymptotically nodes with i ∨ j ≥ M^ϵ are all invulnerable. By the solution of the ODE in proposition 2, it implies that $s_{τ}^{α} = 0$ , for α ∈ Γ∖Γ^ϵ. Note we use tilde sign with the variables to indicate the indexes (i, j) are in the range i ∨ j < M^ϵ, for example, ${\tilde{r}}_{τ_{f}}$ , ${\tilde{d}}_{τ_{f}}$ and ${\tilde{d}}_{τ_{f}}^{-}$ .

We have the following lemma regarding the objective functions of the infinite and finite dimensional optimal control problems.

Lemma 1. Let $ζ (u, τ_{f}, p) ≔ K r_{τ_{f}} (u, p) + d_{τ_{f}} (u, p)$ be the objective function for the infinite dimensional Eq (39) and $\tilde{ζ} (u, τ_{f}, p) ≔ K \tilde{r}_{τ_{f}} (u, p) + {\tilde{d}}_{τ_{f}} (u, p)$ for (FOCP). Let $(u^{*}, τ_{f}^{*})$ and $(\tilde{u}, {\tilde{τ}}_{f})$ be the optimal solutions for the infinite dimensional Eq (39) and (FOCP), respectively, then for the same p we have that

\begin{matrix} | \tilde{ζ} (\tilde{u}, {\tilde{τ}}_{f}, p) - ζ (u^{*}, τ_{f}^{*}, p) | < (K + 1) ϵ . \end{matrix}

(41)

By lemma 1 we only need to solve the finite dimensional optimal control problem (FOCP) in definition 8. Because ϵ can be arbitrarily small, we can approximate the objective function of the infinite dimensional problem to any precision. Given p for (FOCP), the Pontryagin’s maximum principle provides the necessary conditions for the optimal control $\tilde{u}$ and end time ${\tilde{τ}}_{f}$ . We can obtain the optimal asymptotic number of interventions $\tilde{r}_{{\tilde{τ}}_{f}}$ and fraction of final defaults $\tilde{d}_{{\tilde{τ}}_{f}}$ , which lead to the main results of our work. In the next section we focus on solving (FOCP) and suppress the tilde sign for the variables for notational convenience.

Necessary conditions for the optimal control problem

In the following we solve the finite dimensional optimal control problem (FOCP) in definition 8. Throughout this section we understand that the degrees are in the bounded range i ∨ j < M^ϵ unless specified otherwise. We also suppress the tilde sign for notational convenience. Let t = t(τ) ≔ − ln(λ − τ), t₀ ≔ t(0) = −ln λ and t_f ≔ −ln(λ − τ_f). Note t(τ) is a strictly increasing function of τ and so is the inverse function τ = τ(t). We remark that we assume in the following that τ < λ which implies t_f < ∞, but later we can see that the solutions of s_τ, u_τ and w_τ do allow τ = λ. Then we can reformulate the optimal control problem Eq (39) into an autonomous one, i.e. the differential equations of the system dynamics do not depend on time explicitly. Let $u_{t} = {(u_{t}^{β})}_{β \in Φ^{\in}}$ and $u = {(u_{t})}_{t \geq t_{0}}$ (note previously u = (u_τ)_τ∈[0,λ]. Additionally, we allow an arbitrary starting time t₀ here).

\begin{matrix} (42) \\ \begin{matrix} \min_{u, t_{f}} & K \cdot r_{t_{f}} (u, p) & + d_{t_{f}} (u, p) \\ st & \frac{d}{d t} s_{t} & = h (s_{t}; u_{t}) \\ s_{t_{0}}^{i, j, c, l} & = p (i, j, c) 1_{(l = 0)} \\ d_{t_{f}}^{-} & = 0 \\ u_{t}^{β} & \in {0, 1}, \forall β \in Φ^{ϵ} \\ t_{f} & \in [0, \infty), \end{matrix} & (43) \end{matrix}

where $\frac{d}{d t} s_{t} = h (s_{t}; u_{t})$ denotes the system of differential equations

\begin{matrix} \frac{d s_{t}^{i, j, c, 0}}{d t} = & - i s_{t}^{i, j, c, 0} for 1 \leq c \leq i, \\ \frac{d s_{t}^{i, j, c, l}}{d t} = & (i - l + 1) s_{t}^{i, j, c, l - 1} - (i - l) s_{t}^{i, j, c, l} \\ for 3 \leq c \leq i, 1 \leq l \leq c - 2, \\ \frac{d s_{t}^{i, j, c, c - 1}}{d t} = & (i - c + 2) s_{t}^{i, j, c - 1, c - 2} u_{t}^{i, j, c - 1, c - 2} + (i - c + 2) s_{t}^{i, j, c, c - 2} \\ - (i - c + 1) s_{t}^{i, j, c, c - 1} \\ for 2 \leq c \leq i, \\ \frac{d s_{t}^{i, j, i + 1, i}}{d t} = & s_{t}^{i, j, i, i - 1} u_{t}^{i, j, i, i - 1}, \end{matrix}

(44)

and

\begin{matrix} r_{t_{f}} (u, p) & = \int_{t_{0}}^{t_{f}} \sum_{i, j, 1 \leq c \leq i} (i - c + 1) s_{t}^{i, j, c, c - 1} u_{t}^{i, j, c, c - 1} d t, \end{matrix}

(45)

\begin{matrix} d_{t_{f}} (u, p) & = \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} s_{t_{f}}^{i, j, c, l}, \end{matrix}

(46)

\begin{matrix} d_{t_{f}}^{-} & = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{t_{f}}^{i, j, c, l} - τ_{f} \\ = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{t_{f}}^{i, j, c, l} - λ (1 - e^{t_{0} - t_{f}}) . \end{matrix}

(47)

Note that Eq (47) follows from $\frac{1}{λ} = e^{t_{0}}$ and thus $τ_{f} = λ - e^{- t_{f}} = λ (1 - \frac{1}{λ} e^{- t_{f}}) = λ (1 - e^{t_{0} - t_{f}})$ .

To determine the necessary conditions for the optimal terminal time $t_{f}^{*}$ and optimal control $u_{t}^{*}$ in Eq (42), we need the Extended Pontryagin Maximum Principle in Appendix C: Extended pontryagin maximum principle. Then we put together the objective function Eq (43) and the necessary conditions to construct the optimization problem Eq (86) we will introduce later.

Applying the Extended Pontryagin Maximum Principle to the optimal control problem Eq (42) yields the following necessary conditions of optimality. Note in order to simplify notations, we suppress the apostrophe “*” in the following. In other words, we use s_t, u_t, w_t, v, t_f instead of $s_{t}^{*}, u_{t}^{*}, w_{t}^{*}, v^{*}, t_{f}^{*}$ to denote the optimal values.

Proposition 6. (Necessary conditions of optimality) Let (s_t, u_t)_{t∈[t₀, t_f]} be the optimal state and control trajectory of Eq (42) where t_f is the optimal terminal time. Define the Hamiltonian function

\begin{matrix} H (s_{t}, u_{t}, w_{t}) = & \sum_{i, j, 1 \leq c \leq i} w_{t}^{i, j, c, 0} (- i s_{t}^{i, j, c, 0}) \\ + \sum_{i, j, 2 \leq c \leq i, 1 \leq l \leq c - 1} w_{t}^{i, j, c, l} [(i - l + 1) s_{t}^{i, j, c, l - 1} - (i - l) s_{t}^{i, j, c, l}] \\ + \sum_{i, j, 2 \leq c \leq i + 1} (K + w_{t}^{i, j, c, c - 1}) (i - c + 2) s_{t}^{i, j, c - 1, c - 2} u_{t}^{i, j, c - 1, c - 2}, \end{matrix}

(48)

then there exists a piecewise continuously differentiable vector function $w_{t} = {(w_{t}^{α})}_{α \in Γ^{\in}} \in {\hat{C}}^{1} {[t_{0}, \infty)}^{∣ Γ^{\in} ∣}$ and a scalar $v \in R$ such that the following conditions hold:

The optimal control u_t satisfies that ∀t ∈ [t₀, t_f], 1 ≤ c ≤ i, if $s_{t}^{i, j, c, c - 1} > 0$ ,
$\begin{matrix} u_{t}^{i, j, c, c - 1} & = {\begin{matrix} 0 & if w_{t}^{i, j, c + 1, c} > - K \\ 1 & if w_{t}^{i, j, c + 1, c} < - K \\ 0 or 1 & if w_{t}^{i, j, c + 1, c} = - K, \end{matrix} \end{matrix}$ (49)

if $s_{t}^{i, j, c, c - 1} = 0$ ,
$\begin{matrix} u_{t}^{i, j, c, c - 1} & = 0 or 1 . \end{matrix}$ (50)
For 2 ≤ c ≤ i, 0 ≤ l ≤ c − 2,
$\begin{matrix} \frac{d}{d t} w_{t}^{i, j, c, l} = (i - l) (w_{t}^{i, j, c, l} - w_{t}^{i, j, c, l + 1}), \end{matrix}$ (51)
and for 1 ≤ c ≤ i,
$\begin{matrix} \frac{d}{d t} w_{t}^{i, j, c, c - 1} = (i - c + 1) (w_{t}^{i, j, c, c - 1} - (K + w_{t}^{i, j, c + 1, c}) u_{t}^{i, j, c, c - 1}), \end{matrix}$ (52)
and
$\begin{matrix} \frac{d}{d t} w_{t}^{i, j, i + 1, i} = 0 . \end{matrix}$ (53)

We denote the set of ordinary differential equations for w_t as $\frac{d}{d t} w_{t} = h^{'} (w_{t}; u_{t})$ .
$H (s_{t}, u_{t}, w_{t})$ is a constant for t ∈ [t₀, t_f].
Transversal conditions
$\begin{matrix} w_{t_{f}}^{i, j, c, l} & = v j - 1 \forall (i, j, c, l) \in Γ^{ϵ}, \end{matrix}$ (54)

$\begin{matrix} H (s_{t_{f}}, u_{t_{f}}, w_{t_{f}}) & = - v e^{- t_{f}}, \end{matrix}$ (55)

$\begin{matrix} d_{t_{f}}^{-} & = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{t_{f}}^{i, j, c, l} - λ (1 - e^{t_{0} - t_{f}}) \\ = 0 . \end{matrix}$ (56)

Remark 7. (Singular control policy) Observe that if the coefficient of $u_{t}^{i, j, c, c - 1}$ in the Hamiltonian function $H (s_{t}, u_{t}, w_{t})$ Eq (48), i.e. $(i - c + 1) (K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1}$ vanishes, $u_{t}^{i, j, c, c - 1} = 0$ or 1 both satisfy conditions of the Extended Pontryagin Maximum Principle in Appendix C: Extended pontryagin maximum principle i.e. minimizing $H (s_{t}, u_{t}, w_{t})$ . In other words, the Extended Pontryagin Maximum Principle cannot determine the optimal control $u_{t}^{i, j, c, c - 1}$ in this case. Moreover, since i − c + 1 > 0, so if $(K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1} = 0$ can be sustained over some interval (θ₁, θ₂) ⊂ [t₀, t_f], then $u_{t}^{i, j, c, c - 1}$ can be 0 or 1 at any time on (θ₁, θ₂) and switch arbitrarily often between 0 and 1. In the terminology of optimal control theory, the control u_t is “singular” on (θ₁, θ₂) and the corresponding portion of the state trajectory s_t on (θ₁, θ₂) is called a singular arc. Further note that $(K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1} = 0$ , t ∈ (θ₁, θ₂) implies two cases: $s_{t}^{i, j, c, c - 1} = 0$ or $s_{t}^{i, j, c, c - 1} > 0$ , $w_{t}^{i, j, c + 1, c} = - K$ , t ∈ (θ₁, θ₂). We can show that in the first case any feasible control function $u_{t}^{i, j, c, c - 1}$ will not affect other entries of s_t and the second case only occurs when c = i and (θ₁, θ₂) = (t₀, t_f).

Solutions of the necessary conditions

Throughout this section we understand that the degrees are in the bounded range i ∨ j < M^ϵ unless specified otherwise. A well known fact in the control theorists community is that solving for the optimal (s_t, u_t, w_t) from the necessary conditions presented in proposition 6 is difficult, especially analytically. In what follows, we solve for the optimal (s_t, u_t, w_t) in three steps.

First, solve the the two-point boundary value problem (TPBVP) consisting of the differential equations for s_t in Eq (42) and w_t in condition (2) of proposition 6 where for s_t the boundary values are given at t = t₀ and for w_t at t = t_f as follows.

\begin{array}{l} \begin{matrix} \frac{d}{d t} s_{t} & = h (s_{t}; u_{t}), \\ s_{t_{0}}^{i, j, c, l} & = p (i, j, c) 1_{(l = 0)} & \forall (i, j, c, l) \in Γ^{ϵ}, \\ \frac{d}{d t} w_{t} & = h^{'} (w_{t}; u_{t}), \\ w_{t_{f}}^{i, j, c, l} & = v j - 1 \forall (i, j, c, l) \in Γ^{ϵ}, \end{matrix} \end{array}

(57)

and additionally the optimal control policy u_t satisfies Eq (50) in condition 1 of proposition 6. We solve for (s_t, u_t, w_t) in terms of the auxiliary variables (v, t_f, t_s) and in the following we call these expressions in terms of (v, t_f, t_s) as the solutions of (s_t, u_t, w_t).

Second, because the optimal (s_t, u_t, w_t) satisfies the two equations in the necessary conditions of proposition 6, i.e. the Hamiltonian function Eq (55) at t = t_f and the terminal condition Eq (56), while minimizing the objective function Eq (43), we have the following optimization problem for (s_t, u_t, w_t):

\begin{array}{l} (58) \\ min_{s_{t}, u_{t}, w_{t}} & K \cdot i t_{t_{f}} (u, p) + d_{t_{f}} (u, p) & (59) \\ st & H (s_{t_{f}}, u_{t_{f}}, w_{t_{f}}) = - v e^{- t_{f}} & (60) \\ d_{t_{f}}^{-} = 0, & (61) \end{array}

where

\begin{matrix} r_{t_{f}} (u, p) & = \int_{t_{0}}^{t_{f}} \sum_{i, j, 1 \leq c \leq i} (i - c + 1) s_{t}^{i, j, c, c - 1} u_{t}^{i, j, c, c - 1} d t, \\ d_{t_{f}} (u, p) & = \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} s_{t_{f}}^{i, j, c, l}, \\ d_{t_{f}}^{-} & = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{t_{f}}^{i, j, c, l} - λ (1 - e^{t_{0} - t_{f}}) . \end{matrix}

(62)

After substituting (s_t, u_t, w_t) expressed in terms of (v, t_f, t_s) into the optimization problem (58), we are able to solve the optimal (v, t_f, t_s) based on which we can calculate the optimal (s_t, u_t, w_t).

Third, we change the variables from (v, t_f, t_s) to (v, y, z) for further simplification and attain the optimization problem Eq (86) later.

Now we carry out the first step. The main difficulty of solving problem Eq (57) comes from the fact that w_t and s_t depend on u_t which depends on w_t and s_t recursively through Eq (50). To disentangle the recursive dependence, the idea is to analyze the properties of s_t based on which we can either derive the properties of w_t or the explicit solutions of w_t depending on signs of vj − 1 + K. Then by Eq (50) we attain the optimal control policy u_t which leads to the solution of s_t.

It turns out that we only need $w_{t_{f}}$ as well as u_t and s_t in (58). From problem Eq (57) we know that $w_{t_{f}}^{β} = v j - 1$ for β ∈ Γ^ϵ. For u_t and s_t, we give out their solutions in the following directly due to the limited space of the paper. The solutions of u_t and s_t can be verified by substituting into problem Eq (57).

Lemma 2. The optimal control policy u_t in terms of the variables (v, t_f, t_s) is given as below.

For 1 ≤ c ≤ i except c = i and vj − 1 = −K, ∀t ∈ [t₀, t_f],

\begin{matrix} u_{t}^{i, j, c, c - 1} = 1_{(t \geq t^{i, j, c})}, \end{matrix}

(63)

where

\begin{matrix} t^{i, j, c} = {\begin{matrix} t_{f} & if v j - 1 \geq - K \\ t_{f} + ln (1 + \frac{K + v j - 1}{(i - c) K}) & if v j - 1 < - K and 1 \leq c < i + \frac{K + v j - 1}{K (1 - e^{t_{0} - t_{f}})} \\ t_{0} & otherwise, \end{matrix} \end{matrix}

(64)

If vj − 1 = −K, ∀t ∈ [t₀, t_f],

\begin{matrix} u_{t}^{i, j, i, i - 1} = 1_{(t \geq t_{s})} for some t_{s} \in [t_{0}, t_{f}] . \end{matrix}

(65)

The following is the solution for s_t.

Lemma 3. Letting p(i, j, i + 1) = 0, under the optimal control policy in lemma 2, we have the following solutions of s_t for the two-point boundary value problem (TPBVP).

For 2 ≤ c ≤ i, 0 ≤ l ≤ c − 2 and c = 1, l = 0,
$\begin{matrix} s_{t}^{i, j, c, l} = p (i, j, c) (\binom{i}{l}) {(e^{t_{0} - t})}^{i - l} {(1 - e^{t_{0} - t})}^{l} . \end{matrix}$ (66)
If vj − 1 < −K, consider t ∈ [t^{i, j, h}, t^{i, j, h−1}), for some 1 ≤ h ≤ i where
$\begin{matrix} t^{i, j, h} = {\begin{matrix} t_{f} & if h = 0 \\ t_{f} + ln (1 + \frac{K + v j - 1}{(i - h) K}) & if 1 \leq h < i + \frac{K + v j - 1}{K (1 - e^{t_{0} - t_{f}})} \\ t_{0} & otherwise . \end{matrix} \end{matrix}$ (67)

If 1 ≤ c < h,
$\begin{matrix} s_{t}^{i, j, c, c - 1} = p (i, j, c) (\binom{i}{c - 1}) {(e^{t_{0} - t})}^{i - c + 1} {(1 - e^{t_{0} - t})}^{c - 1} . \end{matrix}$ (68)

If h ≤ c ≤ i + 1,
$\begin{matrix} s_{t}^{i, j, c, c - 1} & = (\binom{i}{c - 1}) {(e^{t_{0} - t})}^{i - c + 1} \sum_{m = h}^{c} p (i, j, m) \\ \sum_{n = 0}^{m - 1} (\binom{c - 1}{n}) {(1 - e^{t_{0} - t^{i, j, m}})}^{n} {(e^{t_{0} - t^{i, j, m}} - e^{t_{0} - t})}^{c - 1 - n} . \end{matrix}$ (69)
If vj − 1 > −K, for 1 ≤ c ≤ i + 1, t ∈ [t₀, t_f],
$\begin{matrix} s_{t}^{i, j, c, c - 1} = p (i, j, c) (\binom{i}{c - 1}) {(e^{t_{0} - t})}^{i - c + 1} {(1 - e^{t_{0} - t})}^{c - 1} . \end{matrix}$ (70)
If vj − 1 = −K, for 1 ≤ c ≤ i, t ∈ [t₀, t_f],
$\begin{matrix} s_{t}^{i, j, c, c - 1} = p (i, j, c) (\binom{i}{c - 1}) {(e^{t_{0} - t})}^{i - c + 1} {(1 - e^{t_{0} - t})}^{c - 1}, \end{matrix}$ (71)
and
$\begin{matrix} s_{t}^{i, j, i + 1, i} = p (i, j, i) 1_{{t_{s} \leq t)} [{(1 - e^{t_{0} - t})}^{i} - {(1 - e^{t_{0} - t_{s}})}^{i}], \end{matrix}$ (72)
where t_s ∈ [t₀, t_f].

Since Eqs (56) and (55) require the state variable value particularly at t = t_f, we can apply lemma 3 at t = t_f to attain $s_{t_{f}}$ . Next we proceed to the second step, i.e. we substitute $(s_{t}, u_{t}, w_{t_{f}})$ in terms of (v, t_f, t_s) into the optimization program (58) which leads to the following results.

Proposition 7. Based on the solutions of optimal s_t in lemma 3 (particularly $s_{t_{f}}$ ), u_t in lemma 2 and $w_{t_{f}}^{α} = v j - 1$ , ∀α ∈ Γ^ϵ, letting

\begin{matrix} t^{i, j, c} = {\begin{matrix} t_{f} & if K + v j - 1 \geq 0 or c = 0 \\ t_{f} + ln (1 + \frac{K + v j - 1}{(i - c) K}) & if K + v j - 1 < 0 and 1 \leq c < i + \frac{K + v j - 1}{K (1 - e^{t_{0} - t_{f}})}, \\ t_{0} & otherwise, \end{matrix} \end{matrix}

(73)

the Hamiltonian equation Eq (60) at t = t_f is equivalent to

\begin{matrix} \sum_{j} max {- K, v j - 1} \sum_{i} i \sum_{c = 1}^{i} p (i, j, c) \sum_{m = c}^{i} (\binom{i - 1}{m - 1}) {(e^{t_{0} - t_{f}})}^{i - m + 1} \\ \sum_{n = 0}^{c - 1} (\binom{m - 1}{n}) {(1 - e^{t_{0} - t^{i, j, c}})}^{n} {(e^{t_{0} - t^{i, j, c}} - e^{t_{0} - t_{f}})}^{m - 1 - n} \\ = v λ e^{t_{0} - t_{f}} . \end{matrix}

(74)

The terminal condition Eq (61) is equivalent to

\begin{matrix} \sum_{i} \sum_{j} j {\sum_{c = 0}^{i} p (i, j, c) \sum_{n = c}^{i} (\binom{i}{n}) {(1 - e^{t_{0} - t^{i, j, c}})}^{n} {(e^{t_{0} - t^{i, j, c}})}^{i - n} \\ - 1_{(v j - 1 = - K)} p (i, j, i) [{(1 - e^{t_{0} - t_{f}})}^{i} - {(1 - e^{t_{0} - t_{s}})}^{i}]} \\ = & λ (1 - e^{t_{0} - t_{f}}) . \end{matrix}

(75)

And the objective function Eq (59) becomes

\begin{matrix} K \cdot r_{t_{f}} (u, p) + d_{t_{f}} (u, p) \\ = & K \sum_{i} \sum_{j} {\sum_{c = 1}^{i} p (i, j, c) \\ \sum_{m = c}^{i} \sum_{n = 0}^{c - 1} (m - c + 1) (\binom{i}{m}) (\binom{m}{n}) {(e^{t_{0} - t_{f}})}^{i - m} {(1 - e^{t_{0} - t^{i, j, c}})}^{n} {(e^{t_{0} - t^{i, j, c}} - e^{t_{0} - t_{f}})}^{m - n} \\ + 1_{(v j - 1 = - K)} p (i, j, i) [{(1 - e^{t_{0} - t_{f}})}^{i} - {(1 - e^{t_{0} - t_{s}})}^{i}]} \\ + \sum_{j} \sum_{i} {\sum_{c = 0}^{i} p (i, j, c) \sum_{n = c}^{i} (\binom{i}{n}) {(1 - e^{t_{0} - t^{i, j, c}})}^{n} {(e^{t_{0} - t^{i, j, c}})}^{i - n} \\ - 1_{(v j - 1 = - K)} p (i, j, i) [{(1 - e^{t_{0} - t_{f}})}^{i} - {(1 - e^{t_{0} - t_{s}})}^{i}]} . \end{matrix}

(76)

For the third step, we further simplify the expressions in proposition 7. Define

\begin{matrix} y & ≔ 1 - e^{t_{0} - t_{f}} = \frac{τ_{f}}{λ}, \\ z & ≔ 1 - e^{t_{0} - t_{s}}, \\ x^{i, j, c, c - 1} & ≔ 1 - e^{t_{0} - t^{i, j, c}} \\ = {\begin{matrix} y & if K + v j - 1 \geq 0 or c = 0 \\ 1 - (1 - y) \frac{(i - c) K}{(i - c + 1) K + v j - 1} & if K + v j - 1 < 0 and 1 \leq c < i + \frac{K + v j - 1}{K y} \\ 0 & otherwise, \end{matrix} \end{matrix}

(77)

where the first equality follows from t₀ = −ln λ and t_f = −ln(λ − τ_f). Because t₀ ≤ t_s ≤ t_f and the function 1 − e^t₀−t is increasing in t, 0 ≤ z ≤ y ≤ 1. As a result, we can substitute the new variables (y, v, z) into the objective function Eq (59), the Hamiltonian equation Eq (60) and the terminal condition Eq (61). Moreover, we add the definition of x^i,j,c,c−1 and 0 ≤ z ≤ y ≤ 1. Then we obtain a new optimization problem defined as Eq (86) in section Main results. After solving Eq (86) for (y*, v*, z*), we are able to calculate $u_{t}^{*}$ and $s_{t}^{*}$ (or $u_{τ}^{*}$ and $s_{τ}^{*}$ after changing the time index) in order to present theorem 2 and theorem 3.

Main results

Contagion process with no interventions

We first present the theorem when no interventions are provided in the contagion process. For ϵ > 0, recall M^ϵ is defined as in definition 5 and note that all indexes (i, j) are in the range i ∨ j < M^ϵ in what follows.

Definition 9. (I function) Define a function I: [0, 1] → [0, 1] as

\begin{matrix} I (y) ≔ \frac{1}{λ} \sum_{i \lor j < M^{ϵ}} j \sum_{c = 0}^{i} p (i, j, c) P (Bin (i, y) \geq c) \end{matrix}

(78)

where Bin(i, y) denotes a binomial random variable with i trials and the probability of occurrence y, so $P (Bin (i, y) \geq c) = \sum_{m = c}^{i} (\binom{i}{m}) y^{m} {(1 - y)}^{i - m}$ . I(y) is constructed to represent the asymptotic scaled total out degree from the default set at scaled time y and attains its form Eq (78) from the solution of a set of differential equations.

Since $I (0) = \frac{1}{λ} \sum_{i \lor j < M^{\in}} j p (i, j, 0) \geq 0$ , and from the definition of λ,

\begin{matrix} I (1) = \frac{1}{λ} \sum_{i \lor j < M^{ϵ}} j \sum_{c = 0}^{i} p (i, j, c) \leq 1, \end{matrix}

(79)

and I(y) is continuous and increasing, it has at least one fixed point in [0, 1]. Further define

\begin{matrix} J (y) ≔ \sum_{i \lor j < M^{ϵ}} \sum_{c = 0}^{i} p (i, j, c) P (Bin (i, y) \geq c) . \end{matrix}

(80)

Theorem 1. (Extends from theorem 3.8 of [24]) Consider a sequence of networks with initial conditions (P_n)_n≥1 satisfying assumption 1 where p = (p(i, j, c))_{i,j,0≤c≤i} such that p(i, j, c) = 0 for i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i and no interventions are implemented, let y* ∈ [0, 1] be the smallest fixed point of I.

If y* = 1, then asymptotically almost all nodes default during the contagion process, i.e.
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} 1 . \end{matrix}$ (81)
If y* < 1 and it is a stable fixed point, i.e. I′(y*) < 1, then asymptotically the fraction of final defaulted nodes
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} J (y^{*}) . \end{matrix}$ (82)

Particularly, if I(0) = 0 and I′(0) < 1, then
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} \sum_{i \lor j < M^{ϵ}} p (i, j, 0) . \end{matrix}$ (83)

Remark 8. Theorem 1 states that the stopping time of the default contagion process is fully governed by the smallest fixed point of the function I(y) and the asymptotic fraction of defaulted nodes at the end of the process can be 1, 0 or a fractional number, representing almost all nodes default, almost no nodes default or a partial number of nodes default, respectively.

Contagion process with interventions

We present the theorems for the contagion process with interventions as the result of solving the finite dimensional optimal control problem (FOCP) in definition 8. For ϵ > 0, recall M^ϵ is defined as in definition 5. By lemma 1, the optimal objective function value of (FOCP) is within (K + 1)ϵ distance to the optimal objective function value of the infinite dimensional Eq (39). That is why we solved (FOCP) and present the results below with (i, j) in the range i ∨ j < M^ϵ in the following. From remark 6, for a given vector p = (p(i, j, c))_{0≤i,0≤j,0≤c≤i}, the restriction of (i, j) to i ∨ j < M^ϵ indicates that we use only p(i, j, c), i ∨ j < M^ϵ, 0 ≤ c ≤ i in the calculation. It is equivalent to setting p(i, j, c) = 0, i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i while keeping p(i, j, c), i ∨ j < M^ϵ, 0 ≤ c ≤ i unchanged, which implies asymptotically nodes with i ∨ j ≥ M^ϵ are all invulnerable (Note all p(i, j, c) need not to sum up to one).

First we define the optimization problem Eq (86) based on which we present theorem 2 and theorem 3.

Definition 10. ( $\tilde{I}$ and $\tilde{J}$ function) Let x = (x^β)_β∈Φ^ϵ where x^β = x^β(y, v) and p = (p(i, j, c))_{i,j,0≤c≤i}. We define the functions $\tilde{I} (y, v, z)$ and $\tilde{J} (y, v, z)$ as

\begin{matrix} \tilde{I} (y, v, z) & = \frac{1}{λ} \sum_{i \lor j < M^{ϵ}} j \\ [\sum_{c = 0}^{i} p (i, j, c) P (Bin (i, x^{i, j, c, c - 1}) \geq c) - 1_{(v j - 1 = - K)} p (i, j, i) (y^{i} - z^{i})], \end{matrix}

(84)

\begin{matrix} \tilde{J} (y, v, z) & = \sum_{i \lor j < M^{ϵ}} \\ [\sum_{c = 0}^{i} p (i, j, c) P (Bin (i, x^{i, j, c, c - 1}) \geq c) - 1_{(v j - 1 = - K)} p (i, j, i) (y^{i} - z^{i})] . \end{matrix}

(85)

Note we may write $\tilde{I} (y; v, z)$ and $\tilde{J} (y; v, z)$ to indicate that we treat y as the variable and v, z as the fixed parameters. To present the main results, we define the following optimization problem.

Definition 11. Define the following optimization problem.

\begin{array}{l} (86) \\ \min_{y, v, z} & K \cdot \tilde{r} (y, v, z) + \tilde{J} (y, v, z) & (87) \\ st & (1 - y) \tilde{H} (y, v) = λ v (1 - y) & (88) \\ \tilde{I} (y, v, z) = y & (89) \\ x^{i, j, c, c - 1} = {\begin{matrix} y & if K + v j - 1 \geq 0 or c = 0 \\ 1 - (1 - y) \frac{(i - c) K}{(i - c + 1) K + v j - 1} & if K + v j - 1 < 0 \\ and 1 \leq c < i + \frac{K + v j - 1}{K y} \\ 0 & otherwise \end{matrix} \\ \forall (i, j, c, c - 1) \in Φ^{ϵ} & (90) \\ \begin{matrix} 0 \leq z \leq y \leq 1 \\ y, v, z \in ℝ, \end{matrix} & (91) \end{array}

where

\begin{matrix} \tilde{r} (y, v, z) & = \sum_{i \lor j < M^{ϵ}} {\sum_{c = 1}^{i} p (i, j, c) \\ \sum_{m = c}^{i} \sum_{n = 0}^{c - 1} (m - c + 1) P (Multin (i, x^{i, j, c, c - 1}, y - x^{i, j, c, c - 1}, 1 - y) = (n, m - n, i - m)) \\ + 1_{(v j - 1 = - K)} p (i, j, i) (y^{i} - z^{i})}, \\ \tilde{H} (y, v) & = \sum_{i \lor j < M^{ϵ}} max {- K, v j - 1} i \sum_{c = 1}^{i} p (i, j, c) \\ [P (Bin (i - 1, y) \geq c - 1) - P (Bin (i - 1, x^{i, j, c, c - 1}) \geq c)], \end{matrix}

(92)

where Bin(i, y) denotes a binomial random variable in i trials with the probability of occurrence y, so $P (Bin (i, y) \geq c) = \sum_{m = c}^{i} (\binom{i}{m}) y^{m} {(1 - y)}^{i - m}$ and Multin(i, x, y, 1 − x − y) = (a, b, i − a − b) denotes a multinomial distribution in i trials with the probabilities of occurrence of each of three types being x, y and 1 − x − y, and turns out to have a, b and i − a − b occurrences of each type, respectively, so $P (Multin (i, x, y, 1 - x - y) = (a, b, i - a - b)) = (\binom{i}{a, b, i - a - b}) x^{a} y^{b} {(1 - x - y)}^{i - a - b}$ .

Remark 9. A feasible solution (y, v, z) has its own meanings for the optimal control problem Eq (39) on the deterministic process (s_τ)_τ∈[0,λ]: $y = \frac{τ_{f}}{λ}$ is the scaled end time of the process; v is an intervention indicator in that we should intervene on nodes with out degree j satisfying vj − 1 ≤ −K and v also determines the starting time of the intervention; z specifies the starting time of the intervention for nodes in state (i, j, i, i − 1) when vj − 1 = −K. Moreover, the auxiliary variables (x^i,j,c,c−1)_{(i,j,c,c−1)∈Φ^ϵ} specifies the starting time of interventions for nodes with different (i, j) values.

Then we are ready to present the next main theorem about the optimal control policy.

Theorem 2. For the asymptotic control problem Eq (9), then

Consider a sequence of networks with initial conditions (P_n)_n≥1 satisfying assumption 1 where p = (p(i, j, c))_{i,j,0≤c≤i} such that p(i, j, c) = 0 for i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i.
Let (G_n)_n≥1 be the sequence of control policies for the contagion process on the sequence of networks and (G_n)_n≥1 satisfy assumption 2.
Let (y*, v*, z*) be the optimal solution for the optimization problem Eq (86).

If y* = 1, or y* ∈ [0, 1) and it is a stable fixed point of $\tilde{I} (y; v^{*}, z^{*})$ , i.e. $\frac{d}{d y} \tilde{I} (y^{*}; v^{*}, z^{*}) < 1$ , the optimal control policy $G_{n}^{*} = (g_{1}^{(n) *}, \dots, g_{m}^{(n) *})$ satisfies that for 0 ≤ k ≤ m − 1,

\begin{matrix} g_{k + 1}^{(n) *} (s, w) = {\begin{matrix} 1_{(k \geq n λ {(x^{*})}^{i, j, c, c - 1})} & if w = (i, j, c, c - 1) \in Φ^{ϵ} except \\ c = i and v^{*} j - 1 = - K \\ 1_{(k \geq n λ z^{*})} & if w = (i, j, i, i - 1) and v^{*} j - 1 = - K \\ 0 & otherwise, \end{matrix} \end{matrix}

(93)

where

\begin{matrix} {(x^{*})}^{i, j, c, c - 1} = {\begin{matrix} y^{*} & if K + v^{*} j - 1 \geq 0 or c = 0 \\ 1 - (1 - y^{*}) \frac{(i - c) K}{(i - c + 1) K + v^{*} j - 1} & if K + v^{*} j - 1 < 0 and \\ 1 \leq c < i + \frac{K + v^{*} j - 1}{K y^{*}} \\ 0 & otherwise, \end{matrix} \end{matrix}

(94)

for (i, j, c, c − 1)∈Φ^ϵ.

The next theorem states conclusions for the asymptotic fraction of final defaulted nodes under the optimal policy satisfying theorem 2.

Theorem 3. For the asymptotic control problem Eq 9, then

Consider a sequence of networks with initial conditions (P_n)_n≥1 satisfying assumption 1 where p = (p(i, j, c))_{i,j,0≤c≤i} such that p(i, j, c) = 0 for i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i.
Let (G_n)_n≥1 be the sequence of control policies for the contagion process on the sequence of networks and (G_n)_n≥1 satisfy assumption 2.
Let (y*, v*, z*) be the optimal solution for the optimization problem Eq (86).

Then under the optimal control policy $G_{n}^{*}$ as in theorem 2, we have the following conclusions for the asymptotic fraction of final defaulted nodes.

If y* = 1, then asymptotically almost all nodes default during the default contagion process, i.e.
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} 1 . \end{matrix}$ (95)
If y* ∈ [0, 1) and it is a stable fixed point of $\tilde{I} (y; v^{*}, z^{*})$ , i.e. $\frac{d}{d y} \tilde{I} (y^{*}; v^{*}, z^{*}) < 1$ , then asymptotically the fraction of final defaulted nodes
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} \tilde{J} (y^{*}, v^{*}, z^{*}), \end{matrix}$ (96)
where x^i,j,c,c−1 in $\tilde{J}$ is
$\begin{matrix} x^{i, j, c, c - 1} = {\begin{matrix} y^{*} & if K + v^{*} j - 1 \geq 0 or c = 0 \\ 1 - (1 - y^{*}) \frac{(i - c) K}{(i - c + 1) K + v^{*} j - 1} & if K + v^{*} j - 1 < 0 \\ and 1 \leq c < i + \frac{K + v^{*} j - 1}{K y^{*}} \\ 0 & otherwise, \end{matrix} \end{matrix}$ (97)
for (i, j, c, c − 1) ∈ Φ^ϵ. Particularly, if y* = 0 and $\frac{d}{d y} \tilde{I} (0; v^{*}, z^{*}) < 1$ , then
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} \sum_{i \lor j < M^{ϵ}} p (i, j, 0), \end{matrix}$ (98)
i.e. the final defaulted nodes only consist of the initially defaulted nodes.

In theorem 3 the first case indicates that the network is highly vulnerable and interventions are costly, then the regulator rather lets the whole network default without implementing any interventions, while in the second case interventions are less expensive or the contagion effect is not as high, it is better for the regulator to implement interventions to counteract the contagion process.

Discussion and summary

The key to solve Eq (86) depends on solving the two equations Eqs (88) and (89). First we claim that the optimal v* for Eq (86) must be nonpositive.

Lemma 4. For Eq (86), the optimal v* ≤ 0.

Here we give an algorithm to solve Eq (86) numerically.

Algorithm 1. Solving Eq (86) numerically.

Assume v = 0,
1. if K = 1, then solve Eqs (88) and (89) by e.g. Newton’s method, for y and z;
2. if K ≠ 1, then Eq (86) does not depend on z, so solve for y and let z ≤ y arbitrary.
Assume v < 0,
1. if K = 1, then Eq (86) does not depend on z, so solve for y and v such that v < 0 and let z ≤ y arbitrary;
2. if K ≠ 1,
3. let y = z and solve Eqs (88) and (89) for y and v such that 0 ≤ y ≤ 1 and v < 0;
4. if additionally K > 1, let $v = \frac{1 - K}{j}$ for j > 0 and solve Eqs (88) and (89) for y and z such that 0 ≤ z ≤ y ≤ 1 for each j ∈ {1, …, M^ϵ}.
Choose among all the solutions above (if any) the one that minimizes the objective function Eq (87).

Recall that a node is in state (i, j, c, l) if it has in and out degree pair (i, j), the sum of initial equity and accumulative interventions c (called total buffer) and l revealed in-links. Similar to [24], we call the in-links to a node that has only one unit of equity remaining (“distance to default” equal to one) as “contagious” links. So a node in state (i, j, 1, 0) has i contagious links and a node in state (i, j, 2, 1) has i − 1 contagious links and so on, as shown in Fig 5 the states associated with contagious links are colored in blue. The insights from the optimal interventions policy are summarized as follows.

It is never optimal to intervene on a node if it is not selected or has at least two units of remaining equity when selected. Thus the optimal control policy described in theorem 2 only specifies the optimal intervention decision on a node that, when selected, has one unit of equity remaining, i.e. l = c − 1. In other words, the use of interventions is to counteract the effects of contagious links.
The optimal control policy depends strongly on K, the relative cost of interventions. At a higher K value, interventions are costly and the regulator rather lets the contagion to evolve without any interventions. At a lower K value, the regulator implements more and more interventions, even a “complete” intervention strategy, that is, intervening on every selected node having the “distance to default” of one since the beginning of the process.
The optimal control policy is “monotonic” concerning the number of out degree of a node. There exists a cutoff value of the out degree such that it is only optimal to intervene on a node with out degree larger than this cutoff value and not otherwise, regardless of its in degree, total buffer and revealed in-links. For nodes with out degree equal to the cutoff value, we have the singular control case that only those in state (i, j, i, i − 1) needs interventions and the starting time of interventions is determined by the variable z from the optimization problem Eq (86).
For nodes that are candidates to receive interventions, the starting time of interventions (depends on the variable x^i,j,c,c−1) is “monotonic” in terms of the total buffer. The higher the total buffer is, the earlier we should begin to intervene as illustrated in Fig 5 that x^i,j,c,c−1 is decreasing in c. Moreover, the starting time is also “monotonic” in terms of the in and out degree. For the same out degree, the smaller the in degree is, the earlier the intervention should begin. If v < 0, the larger the out degree is, the earlier the intervention should begin. The economic meaning is that we should focus on systematically important nodes as well as nodes that are close to invulnerability and thus easier to save. In other words, we achieve our objective by protecting the nodes that would incur large impact on the network after defaulting and by bringing easy-to-save nodes into invulnerable states.
Once we have begun intervening on a node we should keep implementing interventions on it every time it is selected. In other words, we do not allow nodes that have received interventions to default. This is an interesting result. In the partial information setting, [3] consider the interventions on banks that record more write-downs later and default as the “wasted government money” and mention the tradeoff associated with intervention: potentially wasted money versus less capital depletion. But our finding is that there is no wasted money under the optimal policy and the tradeoff is the high intervention cost versus less magnitude of defaults.

Indeed, following the optimal policy, we are able to achieve earlier termination time of the contagion process and smaller fraction of final defaulted nodes. We can quantify the improvement by comparing $\tilde{I}$ and $\tilde{J}$ in theorem 3 with I and J defined in theorem 1, respectively. Note in the following we suppress the apostrophe “*” and the indexes (i, j) are in the range i ∨ j < M^ϵ.

$\tilde{I} (y, v, z)$ plays the same role as I(y) in theorem 1, which represents the asymptotic scaled total out degree from the default set at scaled time y. Since
$\begin{matrix} I (y) - \tilde{I} (y, v, z) \\ = & \frac{1}{λ} \sum_{i \lor j < M^{ϵ}} j {\sum_{c = 0}^{i} p (i, j, c) [P (Bin (i, y) \geq c) - P (Bin (i, x^{i, j, c, c - 1}) \geq c)] \\ + 1_{(v j - 1 = - K)} p (i, j, i) (y^{i} - z^{i})}, \end{matrix}$ (99)
and note that
$\begin{matrix} x^{i, j, c, c - 1} \leq y for (i, j, c, c - 1) \in Φ^{ϵ}, \end{matrix}$ (100)
thus
$\begin{matrix} P (Bin (i, y) \geq c) - P (Bin (i, x^{i, j, c, c - 1}) \geq c) \geq 0 . \end{matrix}$ (101)

Then for the same initial conditions p = (p(i, j, c))_{i∨j<M^ϵ,0≤c≤i}, the smallest fixed point of $\tilde{I} (y; v^{*}, z^{*})$ is no greater than that of I(y), which implies that the default contagion process under optimal interventions terminates no later than under no interventions.
Similarly $\tilde{J} (y, v, z)$ plays the same role as J(y) in theorem 1, which represents the asymptotic fraction of final defaulted nodes under the optimal control policy. The difference
$\begin{matrix} J (y) - \tilde{J} (y, v, z) \\ = & \sum_{i \lor j < M^{ϵ}} {\sum_{c = 0}^{i} p (i, j, c) [P (Bin (i, y) \geq c) - P (Bin (i, x^{i, j, c, c - 1}) \geq c)] \\ + 1_{(v j - 1 = - K)} p (i, j, i) (y^{i} - z^{i})} \geq 0 \end{matrix}$ (102)
quantifies the fraction of nodes that are prevented from default because of the optimal control policy.

Numerical experiments

Introduction

While the theoretical framework described before allows heterogeneous networks with degree sequences and initial equity levels drawn from arbitrary distributions, we present a relatively simplified case in numerical experiments for illustration purpose, in which the degree sequences and initial equity levels satisfy specific distributions. Consider a sequence of networks with the number of nodes n growing to infinity, whose in and out degrees are between 1 and 10, and each node’s in degree equal to its own out degree, i.e. d⁻(v) = d⁺(v), v ∈ [n], respectively, so we call either the in or out degree as the degree of the node. This allows us to combine two indexes i and j into one index i, so the state of a node becomes (i, c, l) and the empirical probability P_n and the limiting probability p of the degree and initial equity become P_n(i, c) and p(i, c) respectively. Additionally we assume the initial equity levels between 1 and 10. In sum, we consider the degree and initial equity level pair in the set $Γ^{^{'}} ≔ {(i, c) \in N_{0}^{2} : 1 \leq i \leq 10, 0 \leq c \leq 10}$ .

Next we decide on the limiting probability p. Note that Γ′ contains three initial types of nodes: defaulted (with c = 0), vulnerable (with c ≤ i) and invulnerable (with c > i). In this numerical experiment, we set the total fraction of initial defaults as ξ and assume the fraction of initial defaults is the same across all degrees, i.e. $p (i, 0) = \frac{ξ}{10}$ for i ∈ [1, 10]. For the initially liquid nodes, the joint probability of the degree and initial equity conditional on being liquid is constructed through a binormal copula with correlation ρ and two marginal probabilities. The marginal probabilities of the degree and initial equity are assumed to follow the Zipf’s law, i.e.

\begin{matrix} P (deg = i) = & \frac{i^{- (1 + a_{1})}}{\sum_{l = 1}^{10} l^{- (1 + a_{1})}}, \\ P (initial equity = c) = & \frac{c^{- (1 + a_{2})}}{\sum_{l = 1}^{10} l^{- (1 + a_{2})}}, \end{matrix}

(103)

where a₁, a₂ > 0. The Zipf’s law is a form of the power law with Pareto tails, which is observed for the distribution of the degrees and equity levels of the financial networks in many empirical studies, see e.g. [19, 29].

In a network of size n with the joint probability P_n(i, c) of the degree and initial equity, a contagion process under interventions occurs as described in section Dynamics. Recall that we only need to consider intervening on a node that, when selected, has only one unit of equity left, i.e. a node with “distance to default” equal to one. Here we consider two types of intervention policies, the optimal policy and the alternative policy: intervening on nodes with degree between 8 and 10 and “distance to default” equal to one from the beginning of the process. The alternative policy implies interventions on nodes with high degrees and close to default, representing the usual policy employed by the central bank or government in a real financial crisis.

Our objective is to verify the convergence in probability of $\frac{R_{T_{n}}}{n}$ and $\frac{D_{T_{n}}}{n}$ as well as the convergence of the scaled termination time $\frac{T_{n}}{m}$ as stated in proposition 5. Moreover, we shall study the convergence rate of the standard deviation and IQR (interquartile range) to examine if the asymptotic variables provide good approximations under realistic n values.

Under the optimal policy in the form given in theorem 2, the limits for $\frac{R_{T_{n}}}{n}$ , $\frac{D_{T_{n}}}{n}$ and $\frac{T_{n}}{m}$ as n → ∞ are $\tilde{r} (y^{*}, v^{*}, z^{*})$ , $\tilde{J} (y^{*}, v^{*}, z^{*})$ and y*, respectively in Eq (86) where (y*, v*, z*) is the optimal solution. On the other hand, the alternative policy is that for 0 ≤ k ≤ m − 1,

\begin{matrix} g_{k + 1}^{(n), alt} (s, w) = {\begin{matrix} 1_{(k \geq n λ x_{alt}^{i, c, c - 1})} & if w = (i, c, c - 1) \in Φ^{'} \\ 0 & otherwise, \end{matrix} \end{matrix}

(104)

where for (i, c, c − 1)∈Φ′,

\begin{matrix} x_{alt}^{i, c, c - 1} = {\begin{matrix} 0 & if i \in {8, 9, 10} \\ y & otherwise, \end{matrix} \end{matrix}

(105)

and y is the solution of $\frac{1}{λ} \sum_{i = 0}^{10} i \sum_{c = 0}^{i} p (i, c) P (Bin (i, x_{alt}^{i, c, c - 1}) \geq c) = y$ . Then the limits for $\frac{R_{T_{n}}}{n}$ , $\frac{D_{T_{n}}}{n}$ and $\frac{T_{n}}{m}$ as n → ∞ can be calculated as:

\begin{matrix} \frac{R_{T_{n}}}{n} \overset{p}{\to} & \sum_{i = 0}^{10} \sum_{c = 1}^{i} p (i, c) \\ \sum_{m = c}^{i} \sum_{n = 0}^{c - 1} (m - c + 1) P (Multin (i, x_{alt}^{i, c, c - 1}, y - x_{alt}^{i, c, c - 1}, 1 - y) \\ = (n, m - n, i - m)), \\ \frac{D_{T_{n}}}{n} \overset{p}{\to} & \sum_{i = 0}^{10} \sum_{c = 0}^{i} p (i, c) P (Bin (i, x_{alt}^{i, c, c - 1}) \geq c), \\ \frac{T_{n}}{m} \overset{p}{\to} & y, \end{matrix}

(106)

where $P (Bin (i, y) \geq c) = \sum_{m = c}^{i} (\binom{i}{m}) y^{m} {(1 - y)}^{i - m}$ and $P (Multin (i, x, y, 1 - x - y) = (a, b, i - a - b)) = (\binom{i}{a, b, i - a - b}) x^{a} y^{b} {(1 - x - y)}^{i - a - b}$ .

Simulation

The set up

We have the following setup.

A sequence of six networks with increasing number of nodes n ∈ {5⁴, 6⁴, …, 10⁴} and there are 100 runs for each network under either intervention policy.
To determine the asymptotic fraction p(⋅, ⋅) of the degree and initial equity pair (i, c) where (i, c) ∈ Γ′, we set the following parameters.
1. The fraction of initial defaults ξ = 0.5, indicating half of the nodes have defaulted. As stated before, we assume in this numerical experiment that the fraction of initial defaults is the same across all degrees, thus $p (i, 0) = \frac{ξ}{10}$ for i ∈ [1, 10].
2. The probability of the degree and initial equity for liquid nodes p(i, e), i ∈ {1, …, 10}, e ∈ {1, …, 10} is determined by a binormal coupula with the exponents of the marginal probabilities of the degree and initial equity (a₁, a₂) = (0.8, 0.7) and the correlation coefficient ρ = 0.9. Note that a smaller a₁ indicates larger fraction of nodes with higher degrees, thus higher connectivity and a smaller a₂ indicates larger fraction of nodes with higher initial equities, and ρ implies how likely that higher degree nodes have higher initial equities.
After determining the asymptotic fraction p(⋅, ⋅), we construct a sequence of empirical fractions P_n(⋅, ⋅) for each network that converge to p(⋅, ⋅) by
$\begin{matrix} P_{n} (i, c) = \frac{[n p (i, c)]}{n} (i, c) \in Γ^{^{'}}, \end{matrix}$ (107)
where [⋅] is the round function. In other words, the number of nodes with degree i and initial equity c are [np(i, c)] for a network of n nodes.
We consider two intervention policies described as before.
The relative cost for the interventions K = 0.5.

In the following we suppress T_n in the subscripts and write $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ in stead of $\frac{R_{T_{n}}}{n}$ , $\frac{D_{T_{n}}}{n}$ and $\frac{T_{n}}{m}$ respectively.

Theoretical results

The theoretical limits of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ under the optimal and alternative policies are summarized in Table 1.

Table 1. Theoretical limits of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ under the optimal and alternative policies.

Policies	$\frac{R}{n}$	$\frac{D}{n}$	$K \frac{R}{n} + \frac{D}{n}$	$\frac{T}{m}$
Optimal	0.306	0.503	0.657	0.728
Alternative	0.019	0.821	0.830	0.866

Open in a new tab

We first verify that the objective function $K \frac{R}{n} + \frac{D}{n}$ in Eq (86) is indeed less under the optimal policy. Moreover, compared with the alternative policy, the optimal policy intervens more but results in smaller fraction of final defaulted nodes and ends the contagion process earlier.

Simulation results

We show the plots for $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ under the optimal and alternative policies as shown in Figs 6–11.

Under either policy and for each variable, there are four plots in each figure. The first two plots are two boxplots. The above boxplot visualizes five summary statistics (min, mean−standard deviation, mean, mean+ standard deviation, max) while the bottom boxplot uses another set of summary statistics (1st quartile−1.5IQR, 1st quartile, median, 3rd quartile, 3rd quartile+ 1.5IQR) and the data outside the range are treated as outliers, where IQR stands for interquartile range, i.e. the difference between the third and the first quartiles.
The blue dashed horizontal line in each plot indicates the theoretical limits of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ based on p(⋅, ⋅) and the red solid line in each box indicates the theoretical limits of those values calculated with P_n(⋅, ⋅) for each n. We calculate the theoretical values in both ways because for small n, P_n(⋅, ⋅) determined by Eq (107) has a relatively large rounding error and thus deviates a bit from p(⋅, ⋅) but calculating using P_n(⋅, ⋅) instead of p(⋅, ⋅) can effectively remove the deviations in the inputs to the model. However, given p(⋅, ⋅), P_n(⋅, ⋅) is different for different n values, thus the theoretical values of a variable calculated with P_n(⋅, ⋅) are also different for different n’s.
The black dots in the boxplots indicates the results of 100 runs and they are jittered by a random amount left and right to avoid overplotting. From the black dots we can see the distributions of the results. Note that the black dots in the above and bottom boxplots show the same results for each n. They look different because they are jittered left and right by a different random amount.
The last two plots in every figure shows the log-log plot of the standard deviation and IQR of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ against n and a fitted straight line with the slope.

From the simulation results, we make the following conclusions.

From the boxplots of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ under both intervention policies, we observe that the mean or median converge to the calculated theoretical value with shrinking standard deviation or IQR. Because the theoretical value is a constant given the joint probability of degree and initial equity p(⋅, ⋅), convergence of mean to the theoretical value with variance converging to zero is equivalent to convergence in probability, this observation provides evidence for the convergences in probability of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ to their theoretical values, thus proving proposition 5 and theorem 3.
Be comparing the blue dashed line and the red solid line we see that the mean or median is closer to the red solid line, i.e. the theoretical value calculated with P_n(⋅, ⋅) instead of p(⋅, ⋅). This reflects the rounding error caused by Eq (107) in the inputs into the calculation. By using the more accurate fraction we observe that the closeness of the mean or median to the theoretical value does not vary in n although the results of different runs are more and more concentrated around the mean or median as n grows.
The log-log plots of the standard deviation and IQR of each variable with the fitted straight lines further show that both of them decrease with power law tails, i.e. in the form of z = Cx^−a where C is a constant and a > 0 is the exponent. The absolute value of the slope of the straight line serves as the exponent. It is interesting to observe that the exponents for the standard deviation and IQR of different variables are in the range 0.4 ∼ 0.5 under both intervention policies. This implies that the dispersions of all variables converge to zero at roughly the same rate under both policies.

Conclusion

From the simulation part we can make the following conclusions.

The convergences of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ to their theoretical values (stated in proposition 5) are supported by the simulation results. It is worth noting that the closeness of the mean or median to the theoretical value does not vary for different n after the rounding error in the initial fractions are removed, but the dispersion of the variable shrinks as n grows.
The dispersion of each variable decreases following a power law. The exponents are close to each other under both intervention policies and for all variables, indicating a uniform convergence rate of the dispersions of all the variables under both policies.

Appendix A: Proofs

Proof of proposition 1

We give a proof in words similar to the proof of proposition 3.4 in [2] for a different objective function of optimizing the value of the financial system at the end of the process under some budget constraint. We observe that the objective function J_n depends on the set of defaulted nodes only through its cardinality. Any node will affect the states of other nodes only after it defaults because the set of unrevealed out links of the default set determining the contagion process grows only after a node defaults. And it is possible for a default to occur only when a node has one unit of equity (distance to default equal to one) at the time of being selected. Before that time, the equity only decreases by one every time it is selected. Moreover, there is always a chance to intervene on a node before it defaults. However, if we intervene on a node that is not selected at the current step or has more than one units of remaining equity when selected, it is possible that the node may not be selected in the following steps before the process ends in which case we implemented redundant interventions without reducing the number of defaults.

Proof of proposition 2

Proof. Assume u_τ = b ≔ (b^β)_β∈Φ a constant vector for τ ∈ [τ₁, τ₂) ⊆ [0, λ), 0 ≤ τ₁ < τ₂ < ∞ and b^β ∈ {0, 1}. Note that the ODEs are “separable” in that $s_{τ}^{i, j, c, l}$ only depends on the entries of s_τ with the same (i, j), so we can only focus on the system of ODEs with the same (i, j). For the same (i, j), define Γ_i,j ≔ {(c, l):0 ≤ l < c ≤ i or c = i + 1, l = i} and suppress (i, j) in the superscripts in definition 4, then we obtain the system of ODEs for τ ∈ [τ₁, τ₂) with the initial condition $s_{τ_{1}} = s_{1} ≔ {(s_{1}^{c, l})}_{(c, l) \in Γ_{i, j}}$ .

Letting t = −ln(λ − τ), t₁ = −ln(λ − τ₁) and t₂ = −ln(λ − τ₂), we have an autonomous system of ODEs for t ∈ [t₁, t₂) that

\begin{matrix} \frac{d s_{t}^{c, 0}}{d t} = & - i s_{t}^{c, 0} for 1 \leq c \leq i, \\ \frac{d s_{t}^{c, l}}{d t} = & (i - l + 1) s_{t}^{c, l - 1} - (i - l) s_{t}^{c, l} \\ for 3 \leq c \leq i, 1 \leq l \leq c - 2, \\ \frac{d s_{t}^{c, c - 1}}{d t} = & (i - c + 2) s_{t}^{c - 1, c - 2} b^{c - 1, c - 2} \\ + (i - c + 2) s_{t}^{c, c - 2} - (i - c + 1) s_{t}^{c, c - 1} \\ for 2 \leq c \leq i, \\ \frac{d s_{t}^{i + 1, i}}{d t} = & s_{t}^{i, i - 1} b^{i, i - 1}, \end{matrix}

(108)

with the initial condition $s_{t_{1}} = s_{1}$ .

By induction, we can prove the solution s_t on [t₁, t₂) is

\begin{matrix} s_{t}^{c, l} = & e^{(i - l) (t_{1} - t)} \sum_{r = 0}^{l} s_{1}^{c, r} (\binom{i - r}{l - r}) {(1 - e^{t_{1} - t})}^{l - r} \\ for 2 \leq c \leq i, 0 \leq l \leq c - 2, \\ s_{t}^{c, c - 1} = & e^{(i - c + 1) (t_{1} - t)} \sum_{r = 0}^{c - 1} \sum_{q = r + 1}^{c} \prod_{k = q}^{c - 1} b^{k, k - 1} s_{1}^{q, r} (\binom{i - r}{c - 1 - r}) {(1 - e^{t_{1} - t})}^{c - 1 - r} \\ for 1 \leq c \leq i, \\ s_{t}^{i + 1, i} = & s_{1}^{i + 1, i} + \sum_{r = 0}^{i - 1} \sum_{q = r + 1}^{i} \prod_{k = q}^{i} b^{k, k - 1} s_{1}^{q, r} {(1 - e^{t_{1} - t})}^{i - r}, \end{matrix}

(109)

where $\prod_{k = c}^{c - 1} b^{k, k - 1} ≔ 1$ .

By changing the variable t to τ by t = −ln(λ − τ), Eqs (15), (16) and (17) follow. Let the initial condition be $s_{τ_{1}}^{i, j, c, l} = p (i, j, c) 1_{(l = 0)}$ for (i, j, c, l) ∈ Γ at τ₁ = 0, then Eq (18) follows from Eq (15).

Proof of proposition 3

Proof. For the following proof we need to adapt the Wormald’s theorem in Appendix B: Wormald’s theorem. For notational convenience we suppress the tilde sign for $\tilde{R}$ , $\tilde{r}$ . Since $\frac{m}{n} \to λ$ as n → ∞, for the given ϵ and $\hat{λ} = λ - \in$ , we can find $n_{0} \in N$ , such that $0 < \hat{λ} < \frac{m}{n} < λ + 0.1$ for n ≥ n₀. Let z = (z^α)_α∈Γ^ϵ and

\begin{matrix} U = {(τ, z, r) \in R^{| Γ^{ϵ} | + 2} : - ϵ < τ < \hat{λ}, - ϵ < z^{α} < 1.1, - ϵ < r < λ + 0.1}, \end{matrix}

(110)

then U contains the closure of

\begin{matrix} {(0, z, 0) : P (S_{0}^{α} = z^{α} n, \forall α \in Γ^{ϵ}, R_{0} = 0) \neq 0 for some n} . \end{matrix}

(111)

Define the stopping time $T_{U} = min {1 \leq k \leq m : (\frac{k}{n}, \frac{S_{k}}{n}, \frac{R_{k}}{n}) \notin U}$ .

By definition 1 and definition of R_k, $0 \leq S_{k}^{α} \leq n$ , α ∈ Γ^ϵ and 0 ≤ R_k ≤ (λ + 0.1)n hold ∀k ≥ 0 and n ≥ n₀. Recall that $S_{k} = {(S_{k}^{α})}_{α \in Γ^{\in}}$ and $\frac{S_{k}}{n} = {(\frac{S_{k}^{α}}{n})}_{α \in Γ^{\in}}$ . The following conditions hold:

For 0 ≤ k < T_U and α ∈ Γ^ϵ,
$\begin{matrix} | S_{k + 1}^{α} - S_{k}^{α} | & \leq 1, \\ | R_{k + 1} - R_{k} | & \leq 1, \end{matrix}$ (112)

i.e. ρ₁ = 1.

There exists ρ₂ = O(n⁻¹) such that for 0 ≤ k < T_U and α ∈ Γ^ϵ,

\begin{matrix} | E (S_{k + 1}^{α} - S_{k}^{α} ∣ F_{k}) - h^{α} (\frac{k}{n}, \frac{S_{k}}{n}) | & \leq ρ_{2}, \\ | E (R_{k + 1} - R_{k} ∣ F_{k}) - h_{0} (\frac{k}{n}, \frac{S_{k}}{n}) | & \leq ρ_{2}, \end{matrix}

(113)

where h = (h^α)_α∈Γ^ϵ and h₀ are

\begin{matrix} h^{i, j, c, l} (t, z) & = {\begin{matrix} - \frac{i z^{i, j, c, 0}}{λ - t} & if 1 \leq c \leq i, l = 0 \\ \frac{(i - l + 1) z^{i, j, c, l - 1}}{λ - t} - \frac{(i - l) z^{i, j, c, l}}{λ - t} & if 3 \leq c \leq i, 1 \leq l \leq c - 2 \\ \frac{(i - c + 2) z^{i, j, c - 1, c - 2}}{λ - t} u_{t}^{i, j, c - 1, c - 2} \\ + \frac{(i - c + 2) z^{i, j, c, c - 2}}{λ - t} - \frac{(i - c + 1) z^{i, j, c, c - 1}}{λ - t} & if 2 \leq c \leq i \\ \frac{z^{i, j, i, i - 1}}{λ - t} u_{t}^{i, j, i, i - 1} & if c = i, l = i - 1, \end{matrix} \\ h_{0} (t, z) & = \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) z^{i, j, c, c - 1}}{λ - t} u_{t}^{i, j, c, c - 1} . \end{matrix}

(114)

In particular, Eq (113) follows from

\begin{matrix} | \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) S_{k}^{i, j, c, c - 1}}{m - k} u_{\frac{k}{n}}^{i, j, c, c - 1} \\ - \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) \frac{S_{k}^{i, j, c, c - 1}}{n}}{λ - \frac{k}{n}} u_{\frac{k}{n}}^{i, j, c, c - 1} | \\ \leq & \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} | \frac{(i - c + 1) \frac{S_{k}^{i, j, c, c - 1}}{n}}{\frac{m}{n} - \frac{k}{n}} u_{\frac{k}{n}}^{i, j, c, c - 1} - \frac{(i - c + 1) \frac{S_{k}^{i, j, c, c - 1}}{n}}{λ - \frac{k}{n}} u_{\frac{k}{n}}^{i, j, c, c - 1} | \\ = & O (n^{- 1}) . \end{matrix}

(115)

However, for β ∈ Φ^ϵ, h^β and h₀ are not Lipschitz continuous because $u_{τ}^{β}$ can have step changes on [0, λ). So we need to adapt the proof. Note that $u_{τ}^{β}$ is piecewise constant {0, 1} valued function thus h^β(τ, s) and h₀(τ, s) are Lipschitz continuous in each interval where $\tilde{u} = {(u^{β})}_{β \in Φ^{\in}}$ is a constant vector and then we can apply the Wormald’s theorem in Appendix B: Wormald’s theorem. In the following define s_τ(τ′, x) as the solution of the ODEs,

\begin{matrix} \frac{d}{d τ} s_{τ} = h (τ, s_{τ}), \end{matrix}

(116)

with initial condition at τ′, s_τ′ = x ≔ (x^α)_α∈Γ^ϵ.

In what follows define the points where any component of ${\tilde{u}}_{τ}$ has a step change. $τ_{l} ≔ inf {τ > τ_{l - 1} : u_{τ}^{β} has a step change for some β \in Φ^{\in}} \land \hat{λ}$ for l ≥ 1 and τ₀ ≔ 0. Also let k_l = ⌈nτ_l⌉, where ⌈⋅⌉ is the ceiling function. As a result, k_l − 1 < nτ_l ≤ k_l. Recall the initial condition $s_{0} = {(s_{0}^{α})}_{α \in Γ^{\in}}$ with $s_{0}^{i, j, c, l} = p (i, j, c) 1_{(l = 0)}$ . Because every u^β for β ∈ Φ^ϵ has only a finite number of step changes on [0, λ) and Φ^ϵ is a finite set, there are in total a finite number of step changes for all the component functions of $\tilde{u}$ on [0, λ).

Then by the Wormald’s theorem, let $ρ = n^{- \frac{1}{4}}$ , it follows that

\begin{matrix} sup_{0 \leq k \leq k_{1} - 1} \frac{S_{k}^{α}}{n} - s_{\frac{k}{n}}^{α} (0, s_{0}) = O (n^{- \frac{1}{4}}) \end{matrix}

(117)

with probability $1 - O (n^{\frac{1}{4}} exp (- n^{\frac{1}{4}}))$ , ∀α ∈ Γ^ϵ. Note that we will write “with probability $1 - O (n^{\frac{1}{4}} exp (- n^{\frac{1}{4}}))$ ” as whp hereinafter.

In particular, we have that

\begin{matrix} \frac{S_{k_{1} - 1}^{α}}{n} - s_{\frac{k_{1} - 1}{n}}^{α} (0, s_{0}) = O (n^{- \frac{1}{4}}) whp. \end{matrix}

(118)

Additionally by the Wormald’s theorem again we have that

\begin{matrix} sup_{k_{1} \leq k \leq k_{2} - 1} \frac{S_{k}^{α}}{n} - s_{\frac{k}{n}}^{α} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}) = O (n^{- \frac{1}{4}}) whp. \end{matrix}

(119)

Note that

\begin{matrix} | \frac{S_{k_{1}}^{α}}{n} - \frac{S_{k_{1} - 1}^{α}}{n} | \leq \frac{1}{n} \forall α \in Γ^{ϵ}, \end{matrix}

(120)

and by the Lipschitz continuity of $s_{τ}^{α} (0, s_{0})$ on $(0, τ_{1}^{-})$ ,

\begin{matrix} s_{\frac{k_{1} - 1}{n}}^{α} (0, s_{0}) - s_{τ_{1}}^{α} (0, s_{0}) = O (n^{- 1}) . \end{matrix}

(121)

So by Eqs (118), (120) and (121), we have

\begin{matrix} | \frac{S_{k_{1}}^{α}}{n} - s_{τ_{1}}^{α} (0, s_{0}) | \leq & | \frac{S_{k_{1}}^{α}}{n} - \frac{S_{k_{1} - 1}^{α}}{n} | + | \frac{S_{k_{1} - 1}^{α}}{n} - s_{\frac{k_{1} - 1}{n}}^{α} (0, s_{0}) | \\ + | s_{\frac{k_{1} - 1}{n}}^{α} (0, s_{0}) - s_{τ_{1}}^{α} (0, s_{0}) | \\ = & n^{- 1} + O (n^{- \frac{1}{4}}) + O (n^{- 1}) whp. \end{matrix}

(122)

Thus we have that

\begin{matrix} ∥ \frac{S_{k_{1}}}{n} - s_{τ_{1}} (0, s_{0}) ∥ = O (n^{- \frac{1}{4}}) + O (n^{- 1}) whp. \end{matrix}

(123)

where ‖η‖ is the norm for $η \in R^{| Γ^{\in} |}$ . We do not specify the norm because all norms in $R^{l}$ are equivalent, $l \in N$ . From proposition 2 we see that the partial derivatives of $s_{τ}^{α} (τ^{'}, x)$ with respect to the initial time τ′ and every entry of x are continuous in τ′ and every entry of x respectively, and are bounded for any τ in a subinterval of $[0, \hat{λ})$ on which $\tilde{u}$ is a constant vector function, i.e.

\begin{matrix} ∥ \frac{\partial s_{τ}^{α} (τ^{'}, x)}{\partial (τ^{'}, x)} ∥ \leq M_{1} < \infty \end{matrix}

(124)

where M₁ is a constant. Recall that $| \frac{k_{1}}{n} - τ_{1} | < n^{- 1}$ , so by Eqs (123) and (124), it follows from the fundamentals of calculus (e.g. theorem 9.19 and 9.21 in [31]) that

\begin{matrix} s_{τ}^{α} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}) - s_{τ}^{α} (τ_{1}, s_{τ_{1}} (0, s_{0})) \\ = & s_{τ}^{α} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}) - s_{τ}^{α} (0, s_{0}) \\ = & O (n^{- \frac{1}{4}}) + O (n^{- 1}) whp, \end{matrix}

(125)

for τ ∈ (τ₁, τ₂). So it follows from Eq (119) that ∀α ∈ Γ^ϵ,

\begin{matrix} sup_{k_{1} \leq k \leq k_{2} - 1} \frac{S_{k}^{α}}{n} - s_{\frac{k}{n}}^{α} (0, s_{0}) = O (n^{- \frac{1}{4}}) whp. \end{matrix}

(126)

Similarly for R_k, define r_τ(τ′, x, y) as the solution of

\begin{matrix} \frac{d}{d τ} r_{τ} = h_{0} (τ, s_{τ}), \end{matrix}

(127)

with the initial condition at τ′, (s_τ′, r_τ′) = (x, y). Applying the Wormald’s theorem for R_k and r_τ gives that,

\begin{matrix} sup_{0 \leq k \leq k_{1} - 1} \frac{R_{k}}{n} - r_{\frac{k}{n}} (0, s_{0}, 0) & = O (n^{- \frac{1}{4}}) whp, \\ sup_{k_{1} \leq k \leq k_{2} - 1} \frac{R_{k}}{n} - r_{\frac{k}{n}} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}, \frac{R_{k_{1}}}{n}) & = O (n^{- \frac{1}{4}}) whp . \end{matrix}

(128)

In particular,

\begin{matrix} \frac{R_{k_{1} - 1}}{n} - r_{\frac{k_{1} - 1}{n}} (0, s_{0}, 0) = O (n^{- \frac{1}{4}}) whp, \end{matrix}

(129)

Further note that

\begin{matrix} | \frac{R_{k_{1}}}{n} - \frac{R_{k_{1} - 1}}{n} | \leq \frac{1}{n} \forall α \in Γ^{ϵ}, \end{matrix}

(130)

and by the Lipschitz continuity of r_τ(0, s₀, 0) on $(0, τ_{1}^{-})$ ,

\begin{matrix} r_{\frac{k_{1} - 1}{n}} (0, s_{0}, 0) - r_{τ_{1}} (0, s_{0}, 0) = O (n^{- 1}) . \end{matrix}

(131)

So by Eqs (129), (130) and (131) we have

\begin{matrix} | \frac{R_{k_{1}}}{n} - r_{τ_{1}} (0, s_{0}, 0) | \leq & | \frac{R_{k_{1}}}{n} - \frac{R_{k_{1} - 1}}{n} | + | \frac{R_{k_{1} - 1}}{n} - r_{\frac{k_{1} - 1}{n}} (0, s_{0}, 0) | \\ + | r_{\frac{k_{1} - 1}{n}} (0, s_{0}, 0) - r_{τ_{1}} (0, s_{0}, 0) | \\ = & n^{- 1} + O (n^{- \frac{1}{4}}) + O (n^{- 1}) whp. \end{matrix}

(132)

Here we apply the fact we shall prove later that the partial derivatives of r_τ(τ′, x, y) with respect to the initial time τ′ and every entry of x and y are continuous in τ′, every entry of x and y respectively, and are bounded for any τ in a subinterval of $[0, \hat{λ})$ on which $\tilde{u}$ is a constant vector function, i.e.

\begin{matrix} ∥ \frac{\partial r_{τ} (τ^{'}, x, y)}{\partial (τ^{'}, x, y)} ∥ \leq M_{2} < \infty \end{matrix}

(133)

for some constant M₂. Recall that $| \frac{k_{1}}{n} - τ_{1} | < n^{- 1}$ , so by Eqs (123), (132) and (133), it follows from the fundamentals of calculus that

\begin{matrix} r_{τ} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}, \frac{R_{k_{1}}}{n}) - r_{τ} (τ_{1}, s_{τ_{1}} (0, s_{0}), r_{τ_{1}} (0, s_{0}, 0)) \\ = & r_{τ} (\frac{k_{1}}{n}, \frac{S_{k_{1}}}{n}, \frac{R_{k_{1}}}{n}) - r_{τ} (0, s_{0}, 0) \\ = & O (n^{- \frac{1}{4}}) + O (n^{- 1}) whp, \end{matrix}

(134)

for τ ∈ (τ₁, τ₂). So it follows from Eq (128) that

\begin{matrix} sup_{k_{1} \leq k \leq k_{2} - 1} \frac{R_{k}}{n} - r_{\frac{k}{n}} (0, s_{0}, 0) = O (n^{- \frac{1}{4}}) whp. \end{matrix}

(135)

We can repeat the above procedure every time any $u_{τ}^{β}$ has a step change, β ∈ Φ^ϵ and there are only a finite number of step changes in [0, λ). Because $s_{τ}^{α} \leq 1$ and r_τ ≤ λ, $d_{\infty} ((s_{τ}, r_{τ}), \partial U) \geq 0.1 \geq C n^{- \frac{1}{4}}$ , for a sufficiently large constant C. Thus the supremum of τ that (s_τ, r_τ) can be extended to the boundary of U is $\hat{λ}$ , i.e. in Eq (177) of the Wormald’s theorem in Appendix B: Wormald’s theorem,

\begin{matrix} σ & = sup {τ \geq 0 : d_{\infty} ((s_{τ}, r_{τ}), \partial U) \geq C n^{- \frac{1}{4}}} \\ = \hat{λ} . \end{matrix}

(136)

So it follows that

\begin{matrix} sup_{0 \leq k \leq n \hat{λ}} \frac{S_{k}^{α}}{n} - s_{\frac{k}{n}}^{α} (0, s_{0}) & = O (n^{- \frac{1}{4}}) whp, \\ sup_{0 \leq k \leq n \hat{λ}} \frac{R_{k}}{n} - r_{\frac{k}{n}} (0, s_{0}, 0) & = O (n^{- \frac{1}{4}}) whp. \end{matrix}

(137)

At last we prove the claim that the partial derivatives of r_τ(τ′, x, y) with respect to the initial time τ′ and every entry of x and y are all continuous and bounded as in Eq (133) for any τ in a subinterval of $[0, \hat{λ})$ on which $\tilde{u}$ is a constant vector function b = (b^β)_β∈Φ^ϵ. Note first that r_τ with initial condition $\bar{s} = (s_{τ^{'}}, r_{τ^{'}})$ at τ = τ′ in a subinterval of $[0, \hat{λ})$ on which $\tilde{u} = b$ satisfies that

\begin{matrix} r_{τ} = r_{τ^{'}} + \int_{τ^{'}}^{τ} \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) b^{i, j, c, c - 1}}{λ - y} s_{y}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}}) d y . \end{matrix}

(138)

We shall prove the boundedness by showing the boundedness of $∥ \frac{\partial r_{τ}}{\partial \bar{s}} ∥$ and $∣ \frac{\partial r_{τ}}{\partial τ^{'}} ∣$ , seperately. First we take the derivatives of r_τ with respect to the initial condition $\bar{s}$ and obtain

\begin{matrix} \frac{\partial r_{τ}}{\partial \bar{s}} = e^{last} + \int_{τ^{'}}^{τ} \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) b^{i, j, c, c - 1}}{λ - y} \frac{\partial s_{y}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}})}{\partial \bar{s}} d y, \end{matrix}

(139)

where e^last is a vector of zeros except an entry of one at the last. The continuity of every entry in $\frac{\partial r_{τ}}{\partial \bar{s}}$ is obvious. For boundedness,

\begin{matrix} ∥ \frac{\partial r_{τ}}{\partial \bar{s}} ∥ \leq 1 + \int_{τ^{'}}^{τ} \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) b^{i, j, c, c - 1}}{λ - y} ∥ \frac{\partial s_{y}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}})}{\partial \bar{s}} ∥ d y . \end{matrix}

(140)

By Eq (124), $∥ \frac{\partial s_{y}^{i, j, c, c - 1}}{\partial \bar{s}} ∥ < M_{1}$ , so $∥ \frac{\partial r_{τ}}{\partial \bar{s}} ∥$ is bounded. Next we take the derivative of r_τ with respect to the initial time τ′ by the Leibniz integral rule and obtain that

\begin{matrix} \frac{\partial r_{τ}}{\partial τ^{'}} = & - \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) b^{i, j, c, c - 1}}{λ - τ^{'}} s_{τ^{'}}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}}) \\ + \int_{τ^{'}}^{τ} \sum_{(i, j, c, c - 1) \in Φ^{ϵ}} \frac{(i - c + 1) b^{i, j, c, c - 1}}{λ - y} \frac{\partial s_{y}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}})}{\partial τ^{'}} d y, \end{matrix}

(141)

where $s_{τ^{'}}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}}) = s_{τ^{'}}^{i, j, c, c - 1}$ . The continuity of $\frac{\partial r_{τ}}{\partial τ^{'}}$ follows. By Eq (124), $∣ \frac{\partial s_{y}^{i, j, c, c - 1} (τ^{'}, s_{τ^{'}})}{\partial τ^{'}} ∣$ is bounded, so $∣ \frac{\partial r_{τ}}{\partial τ^{'}} ∣$ is bounded. We have proved Eq (133).

Proof of proposition 4

Proof. For some [τ₁, τ₂) ⊆ [0, λ) on which u_τ is a constant vector function, by remark 5 we have for every fixed (i, j) pair and Γ_i,j = {(c, l):0 ≤ l < c ≤ i or c = i + 1, l = i} that

\begin{matrix} \sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l} \leq \sum_{1 \leq c \leq i} p (i, j, c), \end{matrix}

(142)

and thus it follows from Eq (23) that

\begin{matrix} 0 \leq & \sum_{i \lor j \geq M^{ϵ}} \sum_{0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} j s_{τ}^{i, j, c, l} \\ \leq & \sum_{i \lor j \geq M^{ϵ}} \sum_{0 \leq c \leq i} j p (i, j, c) < ϵ . \end{matrix}

(143)

Similarly because by the definition of $S_{k}^{i, j, c, l}$ for 1 ≤ k ≤ m, for fixed (i, j) pair, (i, j, c, l) ∈ Γ,

\begin{matrix} 0 \leq \sum_{c, l} \frac{S_{k}^{i, j, c, l}}{n} \leq \sum_{1 \leq c \leq i} P_{n} (i, j, c), \end{matrix}

(144)

thus it follows from Eq (25) that

\begin{matrix} 0 \leq & \sum_{i \lor j \geq M^{ϵ}} \sum_{0 \leq c \leq i} j P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} j \frac{S_{k}^{i, j, c, l}}{n} \\ \leq & \sum_{i \lor j \geq M^{ϵ}} \sum_{0 \leq c \leq i} j P_{n} (i, j, c) < ϵ . \end{matrix}

(145)

For any k where $0 \leq k \leq \hat{λ}$ , by proposition 3, it follows that

\begin{matrix} | \frac{D_{k}^{-}}{n} - d_{\frac{k}{n}}^{-} | < & | \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} j P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j \frac{S_{k}^{i, j, c, l}}{n} \\ - (\sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{\frac{k}{n}}^{i, j, c, l}) | + 2 ϵ \\ = & | \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} j (P_{n} (i, j, c) - p (i, j, c)) \\ - \sum_{(i, j, c, l) \in Γ^{ϵ}} j (\frac{S_{k}^{i, j, c, l}}{n} - s_{\frac{k}{n}}^{i, j, c, l}) | + 2 ϵ \\ \leq & \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} j | P_{n} (i, j, c) - p (i, j, c) | \\ + \sum_{(i, j, c, l) \in Γ^{ϵ}} j | \frac{S_{k}^{i, j, c, l}}{n} - s_{\frac{k}{n}}^{i, j, c, l} | + 2 ϵ \\ \leq & M^{ϵ} | Γ^{ϵ} | (o (1) + o_{p} (1)) + 2 ϵ \\ = & o_{p} (1) + 2 ϵ, \end{matrix}

(146)

and similarly,

\begin{matrix} | \frac{D_{k}}{n} - d_{\frac{k}{n}} | < & | \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} P_{n} (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} \frac{S_{k}^{i, j, c, l}}{n} \\ - (\sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} s_{\frac{k}{n}}^{i, j, c, l}) | + 2 ϵ \\ = & | \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} (P_{n} (i, j, c) - p (i, j, c)) \\ - \sum_{(i, j, c, l) \in Γ^{ϵ}} (\frac{S_{k}^{i, j, c, l}}{n} - s_{\frac{k}{n}}^{i, j, c, l}) | + 2 ϵ \\ \leq & \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} | P_{n} (i, j, c) - p (i, j, c) | \\ + \sum_{(i, j, c, l) \in Γ^{ϵ}} | \frac{S_{k}^{i, j, c, l}}{n} - s_{\frac{k}{n}}^{i, j, c, l} | + 2 ϵ \\ \leq & | Γ^{ϵ} | (o (1) + o_{p} (1)) + 2 ϵ \\ = & o_{p} (1) + 2 ϵ . \end{matrix}

(147)

Proof of proposition 5

Proof. By Eq (25) for n large enough and 1 ≤ k ≤ m, we have

\begin{matrix} \frac{1}{n} \sum_{ℓ = 1}^{k} \sum_{i \lor j \geq M^{ϵ}} \sum_{1 \leq c \leq i} 1_{(W_{ℓ} = (i, j, c, c - 1))} u_{\frac{ℓ - 1}{n}}^{i, j, c, c - 1} \\ \leq & \frac{1}{n} \sum_{i \lor j \geq M^{ϵ}} \sum_{1 \leq c \leq i} i n P_{n} (i, j, c) \\ \leq & \sum_{i \lor j \geq M^{ϵ}, c} i P_{n} (i, j, c) < ϵ . \end{matrix}

(148)

The first inequality holds because the number of times nodes with states in the range i ∨ j ≥ M^ϵ, 1 ≤ c ≤ i are selected during the process is bounded above by their total in degree. Similarly by Eq (23), for $τ \leq \hat{λ}$ ,

\begin{matrix} \int_{0}^{τ} \sum_{i \lor j \geq M^{ϵ}, 1 \leq c \leq i} \frac{(i - c + 1) s_{t}^{i, j, c, c - 1}}{λ - t} u_{t}^{i, j, c, c - 1} d t \\ \leq & \int_{0}^{τ} \sum_{i \lor j \geq M^{ϵ}, 1 \leq c \leq i} \frac{i p (i, j, c)}{λ - t} d t \\ \leq & ϵ \int_{0}^{τ} \frac{1}{λ - t} d t \\ = & ϵ ln \frac{λ}{λ - τ} \\ \leq & ϵ ln \frac{λ}{ϵ} = O (ϵ) . \end{matrix}

(149)

For any k where $0 \leq k \leq n \hat{λ}$ , by proposition 3 it follows that

\begin{matrix} | \frac{R_{k}}{n} - r_{\frac{k}{n}} | \leq & | \frac{{\tilde{R}}_{k}}{n} + \frac{1}{n} \sum_{ℓ = 1}^{k} \sum_{i \lor j \geq M^{ϵ}} \sum_{1 \leq c \leq i} 1_{(W_{ℓ} = (i, j, c, c - 1))} u_{\frac{ℓ - 1}{n}}^{i, j, c, c - 1} \\ - ({\tilde{r}}_{\frac{k}{n}} + \int_{0}^{\frac{k}{n}} \sum_{i \lor j \geq M^{ϵ}, 1 \leq c \leq i} \frac{(i - c + 1) s_{t}^{i, j, c, c - 1}}{λ - t} u_{t}^{i, j, c, c - 1} d t) | \\ \leq & | \frac{{\tilde{R}}_{k}}{n} - {\tilde{r}}_{\frac{k}{n}} | + O (ϵ) \\ \leq & o_{p} (1) + O (ϵ), \end{matrix}

(150)

thus we have that

\begin{matrix} sup_{0 \leq k \leq n \hat{λ}} | \frac{R_{k}}{n} - r_{\frac{k}{n}} | = o_{p} (1) + O (ϵ) . \end{matrix}

(151)

If τ_f = λ, it implies that $d_{τ}^{-} > 0$ for $τ \in (0, \hat{λ})$ , then it follows from proposition 4 that $\frac{T_{n}}{n} = \hat{λ} + O (ϵ) + o_{p} (1)$ . Then because at each step there is at most one more node defaulting, $\frac{D_{T_{n}}}{n} = \frac{D_{⌊ n \hat{λ} ⌋}}{n} + O (ϵ) + o_{p} (1)$ and from proposition 4 again, $\frac{D_{[n \hat{λ}]}}{n} = d_{\hat{λ}} + O (ϵ) + o_{p} (1)$ . ⌊⋅⌋ denotes the floor function. Further by the continuity of d_τ on [0, λ], $\frac{D_{T_{n}}}{n} = d_{λ} + O (ϵ) + o_{p} (1)$ . Similarly, by Eq (151) and the continuity of r_τ on [0, λ], we have that $\frac{R_{T_{n}}}{n} = r_{λ} + O (ϵ) + o_{p} (1)$ .

If τ_f < λ and $\frac{d}{d τ} d_{τ_{f}}^{-} < 0$ , by definition 4, s_τ is continuous and thus by Eq (33) $d_{τ}^{-}$ is also continuous. So there exists some τ′ > 0 such that $d_{τ}^{-} < 0$ for τ ∈ (τ_f, τ_f + τ′) by the continuity of $d_{τ}^{-}$ . Since ϵ is arbitrary, let ϵ be small enough such that ${inf}_{τ \in (τ_{f}, τ_{f} + τ^{'})} d_{τ}^{-} < - 2 ϵ$ and $\hat{τ}$ be the first time $d_{τ}^{-}$ reaches the minimum. Because $d_{\hat{τ}}^{-} < - 2 ϵ$ , then by proposition 4 $\frac{D_{⌊ n \hat{τ} ⌋}^{-}}{n} < 0$ with high probability, so it holds that $\frac{T_{n}}{n} = τ_{f} + O (ϵ) + o_{p} (1)$ . Again by the continuity of d_τ and r_τ on [0, λ], proposition 4 and Eq (151), $\frac{D_{T_{n}}}{n} = d_{τ_{f}} + O (ϵ) + o_{p} (1)$ and $\frac{R_{T_{n}}}{n} = r_{τ_{f}} + O (ϵ) + o_{p} (1)$ .

In both cases we conclude that Eq (35) holds by tending ϵ → 0.

To prove Eq (37), since $R_{T_{n}} \leq m \leq (λ + 0.1) n$ for large n and $D_{T_{n}} \leq n$ , $\frac{R_{T_{n}} (G_{n}, P_{n})}{n}$ and $\frac{D_{T_{n}} (G_{n}, P_{n})}{n}$ are bounded and thus uniformly integrable. For a sequence of uniformly integrable random variables, convergence in probability implies convergence in expectation. Therefore Eq (37) holds.

Proof of lemma 1

Proof. Solve (FOCP) for the optimal $(\tilde{u}, {\tilde{τ}}_{f})$ . Note that ${\tilde{u}}^{β} = 0$ for β ∈ Φ∖Φ^ϵ. If there exists some p(i, j, c)>0, i ∨ j ≥ M^ϵ, 0 ≤ c ≤ i, then by remark 5, at ${\tilde{τ}}_{f}$ by summing over (i, j) pairs satisfying i ∨ j ≥ M^ϵ we can show that

\begin{matrix} \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} s_{{\tilde{τ}}_{f}}^{i, j, c, l} \geq 0, \end{matrix}

(152)

and by the definition of ${\tilde{τ}}_{f}$ that

\begin{matrix} d_{{\tilde{τ}}_{f}}^{-} = & \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} j s_{{\tilde{τ}}_{f}}^{i, j, c, l} \\ + \sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{{\tilde{τ}}_{f}}^{i, j, c, l} - {\tilde{τ}}_{f} \\ = & \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} j s_{{\tilde{τ}}_{f}}^{i, j, c, l} \geq 0 . \end{matrix}

(153)

Now we construct a function u as $u_{τ} = {\tilde{u}}_{τ}$ , $τ \leq {\tilde{τ}}_{f}$ and $u_{τ}^{β} = 1$ , ${\tilde{τ}}_{f} < τ$ , β ∈ Φ. Note that under u, there are always interventions after ${\tilde{τ}}_{f}$ , thus by remark 5, for a fixed (i, j) pair with the set Γ_i,j = {(c, l):0 ≤ l < c ≤ i or c = i + 1, l = i}, $\sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l}$ will not change, i.e. $\sum_{(c, l) \in Γ_{i, j}} s_{τ}^{i, j, c, l} = \sum_{(c, l) \in Γ_{i, j}} s_{{\tilde{τ}}_{f}}^{i, j, c, l}$ for $τ > {\tilde{τ}}_{f}$ . Let τ_f be the solution of $d_{τ_{f}}^{-} = 0$ under u, then it follows that

\begin{matrix} d_{τ_{f}} - {\tilde{d}}_{{\tilde{τ}}_{f}} = & \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ} s_{τ_{f}}^{i, j, c, l} \\ - (\sum_{i \lor j < M^{ϵ}, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} s_{{\tilde{τ}}_{f}}^{i, j, c, l}) \\ = & \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} p (i, j, c) - (\sum_{(i, j, c, l) \in Γ} s_{{\tilde{τ}}_{f}}^{i, j, c, l} - \sum_{(i, j, c, l) \in Γ^{ϵ}} s_{{\tilde{τ}}_{f}}^{i, j, c, l}) \\ = & \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} s_{{\tilde{τ}}_{f}}^{i, j, c, l} \\ \leq & \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} p (i, j, c) < ϵ, \end{matrix}

(154)

and similarly

\begin{matrix} r_{τ_{f}} - {\tilde{r}}_{{\tilde{τ}}_{f}} & = \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ \ Γ^{ϵ}} j s_{{\tilde{τ}}_{f}}^{i, j, c, l} \\ \leq \sum_{i \lor j \geq M^{ϵ}, 0 \leq c \leq i} j p (i, j, c) < ϵ . \end{matrix}

(155)

By the definition of ζ and $\tilde{ζ}$ ,

\begin{matrix} ζ (u, τ_{f}, p) \leq \tilde{ζ} (\tilde{u}, {\tilde{τ}}_{f}, p) + (K + 1) ϵ . \end{matrix}

(156)

Recall $(u^{*}, τ_{f}^{*})$ is the optimal solution for the infinite dimensional Eq (39). By remark 6, (FOCP) assumes that the high degree nodes are invulnerable and because the $(\tilde{u}, {\tilde{τ}}_{f})$ is the optimal solution for (FOCP), it provides the lower bound for the optimal objective function of the infinite dimensional Eq (39), i.e.

\begin{matrix} \tilde{ζ} (\tilde{u}, {\tilde{τ}}_{f}, p) \leq ζ (u^{*}, τ_{f}^{*}, p) . \end{matrix}

(157)

Let the objective function be ζ(u, τ_f, p) under u, then by the optimality of $(u^{*}, τ_{f}^{*})$ , we have that

\begin{matrix} ζ (u^{*}, τ_{f}^{*}, p) \leq ζ (u, τ_{f}, p) . \end{matrix}

(158)

In sum, we have that

\begin{matrix} \tilde{ζ} (\tilde{u}, {\tilde{τ}}_{f}, p) \leq ζ (u^{*}, τ_{f}^{*}, p) \leq ζ (u, τ_{f}, p) \leq \tilde{ζ} (\tilde{u}, {\tilde{τ}}_{f}, p) + (K + 1) ϵ . \end{matrix}

(159)

Thus the conclusion follows.

Proof of proposition 6

Proof. We apply the Extended Pontryagin Maximum Principle (EPMP) in Appendix C: Extended pontryagin maximum principle. First we present the correspondence of a notation A in EPMP and B in our application in the form A → B.

\begin{matrix} t \to & t, \\ t_{0} \to & t_{0}, \\ t_{f} \to & t_{f}, \\ {(x_{t}^{i})}_{i \in {1, \dots, n_{x}}} \to & {(s_{t}^{α})}_{α \in Γ^{ϵ}}, \\ {(u_{t}^{i})}_{i \in {1, \dots, n_{u}}} \to & {(u_{t}^{β})}_{β \in Φ^{ϵ}}, \\ U \to & {0, 1}, \\ \overset{˚}{λ} \to & \overset{˚}{w}, \\ {(λ_{t}^{i})}_{i \in {1, \dots, n_{x}}} \to & {(w_{t}^{α})}_{α \in Γ^{ϵ}}, \\ ℓ (t, x_{t}, u_{t}) \to & K \sum_{i, j, 1 \leq c \leq i} (i - c + 1) s_{t}^{i, j, c, c - 1} u_{t}^{i, j, c, c - 1}, \\ ϕ (t_{f}, x_{t_{f}}) \to & d_{t_{f}}, \\ ψ_{k} (t_{f}, x_{t_{f}}) = 0, k = 1, \dots, n_{ψ} \to & d_{t_{f}}^{-} = 0 . \end{matrix}

(160)

Let $(\overset{˚}{w}, w_{t})$ be the adjoint variables then $\overset{˚}{w} = 1$ , since otherwise the necessary conditions of optimality becomes independent of the cost functional in Eq (42). The Hamiltonian function Eq (48) is a direct result of Eq (183). Note that n_ψ = 1 and

\begin{matrix} ψ (t, s) = \sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s^{i, j, c, l} - λ (1 - e^{t_{0} - t}) . \end{matrix}

(161)

Taking partial derivative yields $\frac{\partial}{\partial s} ψ (t_{f}, s_{t_{f}}) = (j, j, \dots, j)$ which has rank 1.

Since the Hamiltonian function is affine in the control variable u_t, by condition (1) of EPMP, we attain that, for 1 ≤ c ≤ i,

\begin{matrix} u_{t}^{i, j, c, c - 1} = {\begin{matrix} 0 & if (K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1} > 0 \\ 1 & if (K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1} < 0 \\ 0 or 1 & if (K + w_{t}^{i, j, c + 1, c}) s_{t}^{i, j, c, c - 1} = 0 . \end{matrix} \end{matrix}

(162)

By distinguishing the two cases $s_{t}^{i, j, c, c - 1} > 0$ and $s_{t}^{i, j, c, c - 1} = 0$ , we have the equivalent form in Eq (50).

Taking partial derivative of $H$ with regard to s yields the differential equations of w_t in condition (2). Note that $H$ is autonomous, then according to condition (3) of EPMP, $H (s_{t}, u_{t}, w_{t})$ is a constant for t ∈ [t₀, t_f], which is condition (3).

Then define

\begin{matrix} Ψ (t, s) ≔ & \sum_{i, j, 0 \leq c \leq i} p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} s^{i, j, c, l} \\ + v (\sum_{i, j, 0 \leq c \leq i} j p (i, j, c) - \sum_{(i, j, c, l) \in Γ^{ϵ}} j s_{t_{f}}^{i, j, c, l} - λ (1 - e^{t_{0} - t_{f}})) \end{matrix}

(163)

and taking partial derivatives with respect to s and t respectively by condition (4) of EPMP together with the terminal condition Eq (47) leads to condition (4).

Proof of theorem 1

Proof. For the contagion process without intervention, we relate our model to the auxiliary model used in the proof of theorem 3.8 in [24].

Recall that in Dynamics we are given a set of nodes [n] and the degree sequence (d⁻(v), d⁺(v))_v∈[n] as well as the initial equity levels (e(v))_v∈[n] and the network is constructed sequentially by matching any out half-link from the default set to a uniformly chosen unconnected in half-link at every step. For each node v we assign each in half-link a number ranging in {1,…,d⁻(v)}. Let ∑^v be the set of all permutations of the in half-links of node v ∈ [n], then a permutation π ∈ ∑^v specifies the order in which the in half-links are connected.

Because every in half-link of v represents one unit of loan, v will default after e(v) of its in half-links have been connected (or e(v) of its in links have been revealed) for every permutation π ∈ Σ^v. So the default threshold θ(v, π) for node v if the order in which the in half-links are connected is specified by π is θ(v, π) = e(v), ∀π ∈ Σ^v. Further our assumption 1 is equivalent to the assumption 4.1 and 4.2 in [24]. Moreover, under no intervention, the random graph generated in Dynamics conforms to the model defined in definition 5.4 in [24] with in and out degree sequences (d⁻(v), d⁺(v))_v∈[n] and default thresholds (e(v))_v∈[n] So by theorem 3.8 in [24] we achieve the conclusions of theorem 1.

Proof of theorem 2

Proof. To simplify the notations we suppress the apostrophe “*”. In lemma 2 we have presented the optimal control policy (u_t)_{t∈[t₀,t_f]} in terms of t,t₀,t_f,t_s,t^i,j,c. Recall that in Eq (77) we have the following relations,

\begin{matrix} y = & 1 - e^{t_{0} - t_{f}}, \\ z = & 1 - e^{t_{0} - t_{s}}, \\ x^{i, j, c, c - 1} = & 1 - e^{t_{0} - t^{i, j, c}} \\ = & {\begin{matrix} y & if K + v j - 1 \geq 0 or c = 0 \\ 1 - (1 - y) \frac{(i - c) K}{(i - c + 1) K + v j - 1} & if K + v j - 1 < 0 \\ and 1 \leq c < i + \frac{K + v j - 1}{K y} \\ 0 & otherwise, \end{matrix} \end{matrix}

(164)

as well as t = −ln(λ − τ), t₀ = −ln λ, so we can change the variable from t to τ. Particularly we apply mapping f(t) = 1 − e^t₀−t which is strictly increasing in t, then we have

\begin{matrix} f (t) & = \frac{τ}{λ}, \\ f (t_{0}) & = 0, \\ f (t^{i, j, c}) & = x^{i, j, c, c - 1}, \\ f (t_{s}) & = z, \\ f (t_{f}) & = y . \end{matrix}

(165)

We replace each variable t, t₀, t_f, t_s, t^i,j,c in lemma 2 with its corresponding variable in Eq (165) resulting in the expressions for $u_{τ}^{i, j, c, c - 1}$ . At last by assumption 2 on the relations between the control policy $G_{n} = (g_{1}^{(n)}, \dots, g_{m}^{(n)})$ and the function u, we have the conclusion in theorem 2.

Proof of theorem 3

Proof. In proposition 7 we have obtained the expressions for $d_{t_{f}}^{-}$ and $d_{t_{f}}$ with i ∨ j < M^ϵ in terms of (v, t_f, t_s), after change of variables to (v, y, z) with $y = \frac{τ_{f}}{λ}$ we have the following expressions for $d_{τ_{f}}^{-}$ and $d_{τ_{f}}$ with their relations to $\tilde{I} (y; v, z)$ and $\tilde{J} (y; v, z)$ in definition 10.

\begin{matrix} d_{τ_{f}}^{-} = & \sum_{i \lor j < M^{ϵ}} j [\sum_{c = 0}^{i} p (i, j, c) P (Bin (i, x^{i, j, c, c - 1}) \geq c) \\ - 1_{(v j - 1 = - K)} p (i, j, i) ({(\frac{τ_{f}}{λ})}^{i} - z^{i})] - τ_{f} \\ = & λ (\tilde{I} (\frac{τ_{f}}{λ}; v, z) - \frac{τ_{f}}{λ}), \\ d_{τ_{f}} = & \sum_{i \lor j < M^{ϵ}} [\sum_{c = 0}^{i} p (i, j, c) P (Bin (i, x^{i, j, c, c - 1}) \geq c) \\ - 1_{(v j - 1 = - K)} p (i, j, i) ({(\frac{τ_{f}}{λ})}^{i} - z^{i})] \\ = & \tilde{J} (\frac{τ_{f}}{λ}; v, z) . \end{matrix}

(166)

Suppose (y*, v*, z*) is an optimal solution for the optimization problem Eq (86) and note that y* is the smallest fixed point of $\tilde{I} (y; v^{*}, z^{*})$ and $y^{*} = \frac{τ_{f}^{*}}{λ}$ .

If y* = 1, then $τ_{f}^{*} = λ$ . By the definition of $d_{τ_{f}}^{-}$ , it can only occur when $\sum_{i \lor j < M^{\in}} j \sum_{c = 0}^{i} p (i, j, c) = λ$ and $z^{*} = \frac{τ_{f}^{*}}{λ} = 1$ , thus we have $d_{τ_{f}^{*}} = d_{λ} = 1$ , then by proposition 5,
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} 1, \end{matrix}$ (167)
which proves (1) of theorem 3.
If y* < 1 and ${\tilde{I}}^{'} (y^{*}; v^{*}, z^{*}) < 1$ , then $τ_{f}^{*} < λ$ and $\frac{d}{d τ} d_{τ_{f}^{*}}^{-} = {\tilde{I}}^{'} (\frac{τ_{f}^{*}}{λ}; v^{*}, z^{*}) - 1 < 0$ . Again it follows from proposition 5,
$\begin{matrix} \frac{D_{T_{n}}}{n} \overset{p}{\to} d_{τ_{f}^{*}} = \tilde{J} (y^{*}; v^{*}, z^{*}) . \end{matrix}$ (168)
which proves (2) of theorem 3. This concludes the proof of theorem 3.

It is important to note that the two cases in theorem 3 corresponds to $τ_{f}^{*} = λ$ , and $τ_{f}^{*} < λ$ , $\frac{d}{d τ} d_{τ_{f}^{*}}^{-} < 0$ , respectively. By proposition 5 they guarantees that the limits of $E \frac{R_{T_{n}} (G_{n}, P_{n})}{n}$ and $E \frac{D_{T_{n}} (G_{n}, P_{n})}{n}$ in Eq 9 as n → ∞ are well defined, which are $r_{τ_{f}}$ and $d_{τ_{f}}$ , respectively.

Proof of lemma 4

Proof. In the following we suppress “*”. Note first that if v > 0, x^i,j,c,c−1 is increasing in j. This implies that x^{i,j₁,c,c−1} < x^{i,j₂,c,c−1} for the two states in Φ^ϵ, (i, j₁, c, c − 1) and (i, j₂, c, c − 1) where j₁ < j₂. By theorem 2, this further implies that at some step k such that nλx^{i,j₁,c,c−1} ≤ k ≤ nλx^{i,j₂,c,c−1}, we should intervene on a node in state (i, j₁, c, c − 1) when it is selected at k but not on a node in state (i, j₂, c, c − 1). But this control policy is not optimal because both nodes are the same except the out degree and the node in state (i, j₂, c, c − 1) is systematically more important.

Appendix B: Wormald’s theorem

The following is from [30]. Let a ≥ 2 be a fixed integer and ${({(Y_{t}^{l})}_{1 \leq l \leq a})}_{_{t \geq 0}}$ denote a sequence of real valued random variables indexed by n with its natural filtration ${(F_{t})}_{t \geq 0}$ . Assume that there is a constant C₀ > 0 such that $∣ Y_{t}^{l} ∣ \leq C_{0} n$ for ∀n, t ≥ 0 and 1 ≤ l ≤ a. Let $f_{l} : R^{a + 1} \to R$ be functions and $U \subseteq R^{a + 1}$ be some bounded connected open set containing the closure of

\begin{matrix} {(0, z^{1}, \dots, z^{a}) : P (Y_{0}^{l} = z^{l} n, 1 \leq l \leq a) \neq 0 for some n} . \end{matrix}

(169)

Define the stopping time $T_{U} = inf {t \geq 1 : (\frac{t}{n}, \frac{Y_{t}^{1}}{n}, \dots, \frac{Y_{t}^{a}}{n}) \notin U}$ . Assume the following three conditions are satisfied:

(Boundedness) For some function ρ₁ = ρ₁(n)≥1 and ∀t < T_U and 1 ≤ l ≤ a,
$\begin{matrix} | Y_{t + 1}^{l} - Y_{t}^{l} | \leq ρ_{1} . \end{matrix}$ (170)
(Trend) For some function ρ₂ = ρ₂(n) = o(1) and ∀t < T_U and 1 ≤ l ≤ a,
$\begin{matrix} | E (Y_{t + 1}^{l} - Y_{t}^{l} ∣ F_{t}) - f_{l} (\frac{t}{n}, \frac{Y_{t}^{1}}{n}, \dots, \frac{Y_{t}^{a}}{n}) | \leq ρ_{2} . \end{matrix}$ (171)
(Lipschitz continuity) The functions (f_l)_{1≤l ≤ a} are continuous and satisfies a Lipschitz condition on
$\begin{matrix} U \cap {(t, z^{1}, \dots, z^{a}) : t \geq 0} \end{matrix}$ (172)
with the same Lipschitz constant for each l.

Then the following holds:

For $(0, {\hat{z}}^{1}, \dots, {\hat{z}}^{a}) \in U$ the system of differential equations
$\begin{matrix} \frac{d z^{l}}{d s} = f_{l} (s, z^{1}, \dots, z^{a}), 1 \leq l \leq a \end{matrix}$ (173)
has a unique solution in U for $z^{l} : R \to R$ passing through
$\begin{matrix} z_{0}^{l} = {\hat{z}}^{l}, 1 \leq l \leq a \end{matrix}$ (174)
and which extends to points arbitrarily close to the boundary of U.
Let ρ > ρ₂ and ρ = o(1). For a sufficiently large constant C, with probability $1 - O (\frac{ρ_{1}}{ρ} exp (- \frac{n ρ^{3}}{ρ_{1}^{3}}))$ , it holds that
$\begin{matrix} sup_{0 \leq t \leq n σ} (\frac{Y_{t}^{l}}{n} - z_{\frac{t}{n}}^{l}) = O (ρ) \end{matrix}$ (175)
where $z_{s}^{l}$ is the solution in (1) with
$\begin{matrix} z_{0}^{l} = \frac{Y_{0}^{l}}{n} \end{matrix}$ (176)
and
$\begin{matrix} σ = σ (n) = sup {s \geq 0 : d_{\infty} (({(z_{s}^{l})}_{1 \leq l \leq a}), \partial U) \geq C ρ}, \end{matrix}$ (177)
where d_∞(u, v) = max_{1≤i ≤ j}|u_i−v_i| for $u = (u_{1}, \dots, u_{j}) \in R^{j}$ and $v = (v_{1}, \dots, v_{j}) \in R^{j}$ .

Appendix C: Extended pontryagin maximum principle

The following is from [32]. Consider the optimal control problem to minimize the cost functional including a terminal term

\begin{matrix} J (u, t_{f}) ≔ \int_{t_{0}}^{t_{f}} ℓ (t, x_{t}, u_{t}) d t + ϕ (t_{f}, x_{t_{f}}), \end{matrix}

(178)

with fixed initial time t₀ and free terminal time t_f, subject to the dynamical system

\begin{matrix} {\dot{x}}_{t} = f (t, x_{t}, u_{t}); x_{t_{0}} = x_{0}, \end{matrix}

(179)

where the vector function $x \in \hat{C^{1}} {[t_{0}, T]}^{n_{x}}$ represents the state variables characterizing the behavior of the system at any time instant t, and some general terminal constraints

\begin{matrix} ψ_{k} (t_{f}, x_{t_{f}}) = 0, k = 1, \dots, n_{ψ} . \end{matrix}

(180)

The admissible controls shall be taken in the class of piecewise continuous functions

\begin{matrix} u \in U [t_{0}, T] ≔ {u \in \hat{C} {[t_{0}, T]}^{n_{u}} : u_{t} \in U for t_{0} \leq t \leq t_{f}}, \end{matrix}

(181)

with t_f ∈ [t₀, T], where T > t₀ and the nonempty, possibly closed and nonconvex set U denotes the control region.

Suppose ℓ and f are continuous and have continuous first partial derivatives with respect to (t, x, u) on $[t_{0}, T] \times R^{n_{x}} \times R^{n_{u}}$ , and also ϕ and $ψ ≔ {(ψ_{k})}_{k = 1, \dots, n_{ψ}}$ are continuous and have continuous first partial derivatives with respect to (t, x) on $[t_{0}, T] \times R^{n_{x}}$ . Suppose that the terminal constraints Eq (180) satisfy the constraint qualification

\begin{matrix} rank (\frac{\partial ψ}{\partial x} (t_{f}^{*}, x_{t_{f}^{*}}^{*})) = n_{ψ} \end{matrix}

(182)

where $\frac{\partial ψ}{\partial x} (t_{f}^{*}, x_{t_{f}^{*}}^{*})$ denotes the Jacobian matrix of the partial derivatives of components of ψ with respect to x evaluated at $(t_{f}^{*}, x_{t_{f}^{*}}^{*})$ . Define the Hamiltonian function

\begin{matrix} H (t, x, u, \overset{˚}{λ}, λ) = \overset{˚}{λ} ℓ (t, x, u) + λ^{T} f (t, x, u) . \end{matrix}

(183)

Let $(u^{*}, t_{f}^{*}) \in \hat{C} {[t_{0}, T]}^{n_{u}} \times [t_{0}, T)$ denote a minimizer for the problem, and $x^{*} \in \hat{C^{1}} [t_{0}, T]$ the optimal state, then there exists a n_x dimensional piecewise continuously differentiable vector function $λ_{t}^{*}$ and ${\overset{˚}{λ}}^{*} \in {0, 1}$ ( $({\overset{˚}{λ}}^{*}, λ_{t}^{*})$ are called adjoint variables) and a Lagrange multiplier vector $v^{*} \in R^{n_{ψ}}$ such that $({\overset{˚}{λ}}^{*}, λ_{t}^{*}) \neq 0$ for every $t \in [t_{0}, t_{f}^{*}]$ and the following conditions hold:

The function $H (t, x_{t}^{*}, w, {\overset{˚}{λ}}^{*}, λ_{t}^{*})$ attains its minimum on U at $w = u_{t}^{*}$ for every $t \in [t_{0}, t_{f}^{*}]$ , i.e.
$\begin{matrix} H (t, x_{t}^{*}, w, {\overset{˚}{λ}}^{*}, λ_{t}^{*}) \geq H (t, x_{t}^{*}, u_{t}^{*}, {\overset{˚}{λ}}^{*}, λ_{t}^{*}), \forall w \in U . \end{matrix}$ (184)
$(x_{t}^{*}, u_{t}^{*}, {\overset{˚}{λ}}^{*}, λ_{t}^{*})$ verifies the equations
$\begin{matrix} \frac{d}{d t} x_{t}^{*} & = f (t, x_{t}^{*}, u_{t}^{*}), \\ \frac{d}{d t} λ_{t}^{*} & = - \frac{\partial}{\partial x} H (t, x_{t}^{*}, u_{t}^{*}, {\overset{˚}{λ}}^{*}, λ_{t}^{*}) \end{matrix}$ (185)
at each instant t of continuity of u* and ${\overset{˚}{λ}}^{*} \in {0, 1}$ .
$H (t, x_{t}^{*}, u_{t}^{*}, {\overset{˚}{λ}}^{*}, λ_{t}^{*}) = H (t_{f}^{*}, x_{t_{f}^{*}}^{*}, u_{t_{f}^{*}}^{*}, {\overset{˚}{λ}}^{*}, λ_{t_{f}^{*}}^{*}) - \int_{t}^{t_{f}^{*}} \frac{\partial}{\partial t} H (τ, x_{τ}^{*}, u_{τ}^{*}, \overset{˚}{λ}, λ_{τ}^{*}) d τ$ . Therefore, if $\frac{\partial}{\partial t} H = 0$ , i.e. $H$ is autonomous, then $H$ is a constant over time.
(Transversal condition) Define $Ψ (t, x) ≔ {\overset{˚}{λ}}^{*} ϕ (t, x) + v^{* T} ψ (t, x)$ , then
$\begin{matrix} λ_{t_{f}^{*}}^{*} = & \frac{\partial}{\partial x} Ψ (t_{f}^{*}, x_{t_{f}^{*}}^{*}), \\ H (t_{f}^{*}, x_{t_{f}^{*}}^{*}, u_{t_{f}^{*}}^{*}, {\overset{˚}{λ}}_{t_{f}^{*}}^{*}, λ_{t_{f}^{*}}^{*}) = & - \frac{\partial}{\partial t} Ψ (t_{f}^{*}, x_{t_{f}^{*}}^{*}) \end{matrix}$ (186)
together with the terminal condition Eq (180) at $t = t_{f}^{*}$ , i.e. $ψ_{k} (t_{f}^{*}, x_{t_{f}^{*}}^{*}) = 0$ for k = 1, …, n_ψ.
The optimal control u* may or may not be continuous; in the latter case we have a corner point. In particular, the conditions that must hold at any corner point $θ \in [t_{0}, t_{f}^{*}]$ are
$\begin{matrix} x_{θ^{-}}^{*} = & x_{θ^{+}}^{*}, \\ λ_{θ^{-}}^{*} = & λ_{θ^{+}}^{*}, \\ H (θ^{-}, x_{θ}^{*}, u_{θ^{-}}^{*}, {\overset{˚}{λ}}^{*}, λ_{θ}^{*}) = & H (θ^{+}, x_{θ}^{*}, u_{θ^{+}}^{*}, {\overset{˚}{λ}}^{*}, λ_{θ}^{*}) . \end{matrix}$ (187)

Proof. See theorem 3.33 and theorem 3.34 in [32].

Appendix D: Preliminary list of notations

$D_{k}$ : the default set at step k;

$D_{k} = ∣ D_{k} ∣$ ;

$D_{T_{n}}$ : the number of defaulted nodes by the end of the process T_n;

$D_{k}^{-}$ : the number of unrevealed out links from the default set at step k;

$E_{n}$ : the set of links in a random network;

Gr_n: a graph on n nodes;

G_n,m: the set of networks on n nodes with at most m directed links;

${(G_{k})}_{0 \leq k \leq m}$ : the filtration on the default contagion process;

$G_{n} = (g_{1}^{(n)}, \dots, g_{m}^{(n)})$ : a control policy for a network of size n;

I(y),J(y), $\tilde{I} (y, v, z)$ , $\tilde{J} (y, v, z)$ : special functions defined for theorem 1, theorem 2 and theorem 3;

K: the “cost” of an intervention relative to a defaulted node;

M^ϵ: an integer defined based on ϵ;

$N_{0} ≔ {0, 1, 2, \dots}$ ;

$N ≔ {1, 2, \dots}$ , the set of positive integers;

$P$ : probability measure on the set G_n,m;

P_n(i, j, c): the empirical probability of the degrees and initial equity levels and P_n = (P_n(i, j, c))_{i,j,0≤c≤i};

Q_k: the set of hidden out links from the default set at step k;

R_k: the accumulative number of interventions by step k;

$R_{T_{n}}$ : the accumulative number of interventions by the end of the process T_n;

$S_{k}^{i, j, c, l}$ : the number of nodes that are vulnerable initially and in state (i, j, c, l) at step k and $S_{k} ≔ {(S_{k}^{α})}_{α \in Γ}$ ;

T_n: the contagion process end time;

$U$ : the set of all ${(G_{k})}_{0 \leq k \leq m}$ adapted process μ;

(V_k, W_k): a pair of random variables denoting the link from node V_k to node W_k revealed at step k; By abuse of notation, W_k denotes the state of the node selected at k when there is no confusion;

b ≔ (b^β)_β∈Φ: a vector of {0, 1} constants;

$c_{k}^{v}$ : the sum of initial equity and accumulative interventions of node v at step k;

d⁻(v),d⁺(v): the in and out degree of a node v;

$d_{τ}^{-}$ : the asymptotic number of unrevealed out links of the default set at time τ;

d_τ: the asymptotic fraction of defaults at time τ;

e(v): the initial equity of a node;

$g_{k}^{(n)}$ : the control function at step k for a network of size n;

h: the set of ODEs of s_τ and s_t;

h′: the set of ODEs of w_t;

$l_{k}^{v}$ : the number of revealed in links of node v at step k;

m = m(n): the total in or out degree of a network of size n, maybe index variable as well;

n: the number of nodes, may be index variable as well;

[n] = {1, …, n};

$u_{τ}^{i, j, c, c - 1}$ : a piecewise constant function; $u_{τ} = {(u_{τ}^{β})}_{β \in Φ}$ ;

r_τ: the asymptotic scaled number of interventions by time τ;

v: the Lagrange multiplier;

(v_k, w_k): the link from node v_k to node w_k revealed at step k;

$w_{t}^{i, j, c, l}$ : the adjoint variable;

ϵ: a positive real;

λ: the asymptotic mean of in (out) degree;

$\hat{λ} ≔ λ - \in$ ;

μ_k: intervention at step k and μ = (μ_k)_1≤k≤m;

μ_n: in Introduction denotes an intervention sequence for a network of n nodes;

τ_f: the end time in of the asymptotic process;

‖‖: any norm of $R^{l}$ for some $l \in N$ ;

⌈⋅⌉: the ceiling function;

[⋅]: the round function;

$\tilde{}$ : the corresponding variable resulted by the constraint i ∨ j < M^ϵ, e.g. ${\tilde{R}}_{k}$ , $\tilde{u}$ , ${\tilde{τ}}_{f}$ , ${\tilde{r}}_{τ}$ , ${\tilde{d}}_{τ}$ ;

Bin(i, y): a binomial random variable in i trials with the probability of occurrence y;

Multin(i, x, y, 1 − x − y) = (a, b, i − a − b): a multinomial distribution in i trials with the probabilities of occurrence of each of three types being x, y and 1 − x − y, and turns out to have a, b and i − a − b occurrences of each type.

Important sets:

\begin{matrix} Φ ≔ & {(i, j, c, c - 1) : 0 \leq i, 0 \leq j, 1 \leq c \leq i}, \\ Φ^{ϵ} ≔ & {(i, j, c, c - 1) : i \lor j < M^{ϵ}, 1 \leq c \leq i} . \\ Γ^{+} ≔ & {(i, j, c, l) : 0 \leq i, 0 \leq j, 0 \leq c, 0 \leq l \leq i}, \\ Γ ≔ & {(i, j, c, l) : 0 \leq i, 0 \leq j, 0 \leq l < c \leq i or c = i + 1, l = i}, \\ Γ^{ϵ} ≔ & {(i, j, c, l) : i \lor j < M^{ϵ}, 0 \leq l < c \leq i or c = i + 1, l = i}, \\ Γ_{i, j} ≔ & {(c, l) : 0 \leq l < c \leq i or c = i + 1, l = i} . \end{matrix}

(188)

Supporting information

S1 Dataset. Dataset generated from the numerical experiments and based on which Figs 6–11 are presented.

The spreadsheet contains two tables which includes respectively, under the optimal and alternative intervention policies the scaled number of interventions $\frac{R_{T_{n}}}{n}$ , the scaled number of defaults $\frac{D_{T_{n}}}{n}$ , the scaled process end time $\frac{T_{n}}{m}$ and the objective function value for the number of nodes n = 5⁴, 6⁴, …, 10⁴ and there are 100 runs for each n. The dataset is publicly available via https://figshare.com/articles/Simulation_result/7477562 with DOI 10.6084/m9.figshare.7477562.

(XLSX)

Click here for additional data file.^{(56.3KB, xlsx)}

Acknowledgments

The author thanks Lisa R Goldberg, Ilan Adler, Mariana Olvera-Cravioto and Alexander D Shkolnik for their valuable comments and suggestions.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

The author received no specific funding for this work.

References

1. Vollmer U. In: Backhaus J, editor. Lender of Last Resort. New York, NY: Springer; 2014. p. 1–6. [Google Scholar]
2. Amini H, Minca A, Sulem A. Control of interbank contagion under partial information. Journal of Financial Mathematics. 2015;6(1):1195–1219. 10.1137/140981538 [DOI] [Google Scholar]
3. Amini H, Minca A, Sulem A. Optimal equity infusions in interbank networks. Journal of Financial Stability. 2017;31:1–17. 10.1016/j.jfs.2017.05.008 [DOI] [Google Scholar]
4. Cont R, Moussa A, Santos EBe. Network Structure and Systemic Risk in Banking Systems. SSRN eLibrary. 2010; p. 327–368. [Google Scholar]
5.Hurd TR. Contagion! The Spread of Systemic Risk in Financial Networks; 2015.
6. Diamond DW, Dybvig PH. Bank Runs, Deposit Insurance, and Liquidity. Journal of Political Economy. 1983;91(3):401–419. 10.1086/261155 [DOI] [Google Scholar]
7. Allen F, Gale D. Financial contagion. Journal of Political Economy. 2000;1(108):1–33. 10.1086/262109 [DOI] [Google Scholar]
8. Freixas X, Parigi B, Rochet JC. Systemic Risk, Interbank Relations and Liquidity Provision by the Central Bank. Journal of Money, Credit and Banking. 2000;3(3):611–638. 10.2307/2601198 [DOI] [Google Scholar]
9. Acharya VV, Yorulmazer T. Cash-in-the-market pricing and optimal resolution of bank failures. Review of Financial Studies. 2008;21(6):2705–2742. 10.1093/rfs/hhm078 [DOI] [Google Scholar]
10. Gorton G, Huang L. Liquidity, efficiency, and bank bailouts. American Economic Review. 2004;94(3):455–483. 10.1257/0002828041464650 [DOI] [Google Scholar]
11. Philippon T, Schnabl P. Efficient Recapitalization. Journal of Finance. 2013;68(1):1–42. 10.1111/j.1540-6261.2012.01793.x [DOI] [Google Scholar]
12. Rogers LCG, Veraart LAM. Failure and Rescue in an Interbank Network. Management Science. 2013;59(4):882–898. 10.1287/mnsc.1120.1569 [DOI] [Google Scholar]
13. Leitner Y. Financial networks: Contagion, commitment, and private sector bailouts. Journal of Finance. 2005;60:2925–2953. 10.1111/j.1540-6261.2005.00821.x [DOI] [Google Scholar]
14. Garcia-de Andoain C, Heider F, Hoerova M, Manganelli S. Lending-of-last-resort is as lending-of-last-resort does: Central bank liquidity provision and interbank market functioning in the euro area. Journal of Financial Intermediation. 2016;28:32–47. 10.1016/j.jfi.2016.01.003 [DOI] [Google Scholar]
15. Afonso G, Kovner A, Schoar A. Stressed, not frozen: The federal funds market in the financial crisis. Journal of Finance. 2011;66(4):1109–1139. 10.1111/j.1540-6261.2011.01670.x [DOI] [Google Scholar]
16. Berger AN, Bouwman CHS, Kick T, Schaeck K. Bank liquidity creation following regulatory interventions and capital support. Journal of Financial Intermediation. 2016;26:115–141. 10.1016/j.jfi.2016.01.001 [DOI] [Google Scholar]
17.Carlson MA, Duygan-Bump B, Nelson WR. Why Do We Need Both Liquidity Regulations and a Lender of Last Resort? A Perspective from Federal Reserve Lending During the 2007-09 U.S. Financial Crisis; 2015. Available from: http://www.ssrn.com/abstract=2573767.
18. Furfine CH. Interbank Exposures: Quantifying the Risk of Contagion. Journal of Money, Credit and Banking. 2003;35(1):pp. 111–128. 10.1353/mcb.2003.0004 [DOI] [Google Scholar]
19. Boss M, Elsinger H. An empirical analysis of the network structure of the Austrian interbank market. Financial stability Report. 2004;(7):77–87. [Google Scholar]
20. May RM, Arinaminpathy N. Systemic risk: the dynamics of model banking systems. Journal of the Royal Society, Interface. 2010;7(46):823–838. 10.1098/rsif.2009.0359 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Haldane AG, May RM. Systemic risk in banking ecosystems. Nature. 2011;469:351–355 10.1038/nature09659 [DOI] [PubMed] [Google Scholar]
22. Gai P, Haldane A, Kapadia S. Complexity, concentration and contagion. Journal of Monetary Economics. 2011;58:453–470 10.1016/j.jmoneco.2011.05.005 [DOI] [Google Scholar]
23. Nier E, Yang J, Yorulmazer T, Alentorn A. Network models and financial stability. Journal of Economic Dynamics and Control. 2007;31(6):2033–2060 10.1016/j.jedc.2007.01.014 [DOI] [Google Scholar]
24. Amini H, Cont R, Minca A. Resilience to contagion in financial networks. Mathematical Finance. 2013;00(0):1–37. [Google Scholar]
25. Eisenberg L, Noe TH. Systemic risk in financial systems. Management Science. 2001;47(2):236–249 10.1287/mnsc.47.2.236.9835 [DOI] [Google Scholar]
26. Chinazzi M, Fagiolo G. Systemic Risk, Contagion, and Financial Networks: A Survey. SSRN Electronic Journal. 2013. 10.2139/ssrn.2243504 [DOI] [Google Scholar]
27.van der Hofstad R. Random Graphs and Complex Networks; 2014.
28. Furfine CH. The Microstructure of the Federal Funds Market. Financial Markets, Institutions and Instruments. 1999;8(5):24–44. 10.1111/1468-0416.00031 [DOI] [Google Scholar]
29. Bech ML, Atalay E. The topology of the federal funds market. Physica A: Statistical Mechanics and its Applications. 2010;389(22):5223–5246. 10.1016/j.physa.2010.05.058 [DOI] [Google Scholar]
30.Wormald NC. The Differential Equation Method for Random Network Processes and Greedy Algorithms. In: Lectures on Approximation and Randomized Algorithms; 1999. p. 73–155.
31.Rudin W. Principles of Mathematical Analysis. 3rd ed. McGraw-Hill Education; 1976. Available from: http://onlinelibrary.wiley.com/doi/10.1002/cbdv.200490137/abstract.
32.Chachuat BC. Nonlinear and Dynamic Optimization: From Theory to Practice—IC-32: Spring Term 2009. EPFL; 2009.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Dataset. Dataset generated from the numerical experiments and based on which Figs 6–11 are presented.

(XLSX)

Click here for additional data file.^{(56.3KB, xlsx)}

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.

[pone.0209819.ref001] 1. Vollmer U. In: Backhaus J, editor. Lender of Last Resort. New York, NY: Springer; 2014. p. 1–6. [Google Scholar]

[pone.0209819.ref002] 2. Amini H, Minca A, Sulem A. Control of interbank contagion under partial information. Journal of Financial Mathematics. 2015;6(1):1195–1219. 10.1137/140981538 [DOI] [Google Scholar]

[pone.0209819.ref003] 3. Amini H, Minca A, Sulem A. Optimal equity infusions in interbank networks. Journal of Financial Stability. 2017;31:1–17. 10.1016/j.jfs.2017.05.008 [DOI] [Google Scholar]

[pone.0209819.ref004] 4. Cont R, Moussa A, Santos EBe. Network Structure and Systemic Risk in Banking Systems. SSRN eLibrary. 2010; p. 327–368. [Google Scholar]

[pone.0209819.ref005] 5.Hurd TR. Contagion! The Spread of Systemic Risk in Financial Networks; 2015.

[pone.0209819.ref006] 6. Diamond DW, Dybvig PH. Bank Runs, Deposit Insurance, and Liquidity. Journal of Political Economy. 1983;91(3):401–419. 10.1086/261155 [DOI] [Google Scholar]

[pone.0209819.ref007] 7. Allen F, Gale D. Financial contagion. Journal of Political Economy. 2000;1(108):1–33. 10.1086/262109 [DOI] [Google Scholar]

[pone.0209819.ref008] 8. Freixas X, Parigi B, Rochet JC. Systemic Risk, Interbank Relations and Liquidity Provision by the Central Bank. Journal of Money, Credit and Banking. 2000;3(3):611–638. 10.2307/2601198 [DOI] [Google Scholar]

[pone.0209819.ref009] 9. Acharya VV, Yorulmazer T. Cash-in-the-market pricing and optimal resolution of bank failures. Review of Financial Studies. 2008;21(6):2705–2742. 10.1093/rfs/hhm078 [DOI] [Google Scholar]

[pone.0209819.ref010] 10. Gorton G, Huang L. Liquidity, efficiency, and bank bailouts. American Economic Review. 2004;94(3):455–483. 10.1257/0002828041464650 [DOI] [Google Scholar]

[pone.0209819.ref011] 11. Philippon T, Schnabl P. Efficient Recapitalization. Journal of Finance. 2013;68(1):1–42. 10.1111/j.1540-6261.2012.01793.x [DOI] [Google Scholar]

[pone.0209819.ref012] 12. Rogers LCG, Veraart LAM. Failure and Rescue in an Interbank Network. Management Science. 2013;59(4):882–898. 10.1287/mnsc.1120.1569 [DOI] [Google Scholar]

[pone.0209819.ref013] 13. Leitner Y. Financial networks: Contagion, commitment, and private sector bailouts. Journal of Finance. 2005;60:2925–2953. 10.1111/j.1540-6261.2005.00821.x [DOI] [Google Scholar]

[pone.0209819.ref014] 14. Garcia-de Andoain C, Heider F, Hoerova M, Manganelli S. Lending-of-last-resort is as lending-of-last-resort does: Central bank liquidity provision and interbank market functioning in the euro area. Journal of Financial Intermediation. 2016;28:32–47. 10.1016/j.jfi.2016.01.003 [DOI] [Google Scholar]

[pone.0209819.ref015] 15. Afonso G, Kovner A, Schoar A. Stressed, not frozen: The federal funds market in the financial crisis. Journal of Finance. 2011;66(4):1109–1139. 10.1111/j.1540-6261.2011.01670.x [DOI] [Google Scholar]

[pone.0209819.ref016] 16. Berger AN, Bouwman CHS, Kick T, Schaeck K. Bank liquidity creation following regulatory interventions and capital support. Journal of Financial Intermediation. 2016;26:115–141. 10.1016/j.jfi.2016.01.001 [DOI] [Google Scholar]

[pone.0209819.ref017] 17.Carlson MA, Duygan-Bump B, Nelson WR. Why Do We Need Both Liquidity Regulations and a Lender of Last Resort? A Perspective from Federal Reserve Lending During the 2007-09 U.S. Financial Crisis; 2015. Available from: http://www.ssrn.com/abstract=2573767.

[pone.0209819.ref018] 18. Furfine CH. Interbank Exposures: Quantifying the Risk of Contagion. Journal of Money, Credit and Banking. 2003;35(1):pp. 111–128. 10.1353/mcb.2003.0004 [DOI] [Google Scholar]

[pone.0209819.ref019] 19. Boss M, Elsinger H. An empirical analysis of the network structure of the Austrian interbank market. Financial stability Report. 2004;(7):77–87. [Google Scholar]

[pone.0209819.ref020] 20. May RM, Arinaminpathy N. Systemic risk: the dynamics of model banking systems. Journal of the Royal Society, Interface. 2010;7(46):823–838. 10.1098/rsif.2009.0359 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0209819.ref021] 21. Haldane AG, May RM. Systemic risk in banking ecosystems. Nature. 2011;469:351–355 10.1038/nature09659 [DOI] [PubMed] [Google Scholar]

[pone.0209819.ref022] 22. Gai P, Haldane A, Kapadia S. Complexity, concentration and contagion. Journal of Monetary Economics. 2011;58:453–470 10.1016/j.jmoneco.2011.05.005 [DOI] [Google Scholar]

[pone.0209819.ref023] 23. Nier E, Yang J, Yorulmazer T, Alentorn A. Network models and financial stability. Journal of Economic Dynamics and Control. 2007;31(6):2033–2060 10.1016/j.jedc.2007.01.014 [DOI] [Google Scholar]

[pone.0209819.ref024] 24. Amini H, Cont R, Minca A. Resilience to contagion in financial networks. Mathematical Finance. 2013;00(0):1–37. [Google Scholar]

[pone.0209819.ref025] 25. Eisenberg L, Noe TH. Systemic risk in financial systems. Management Science. 2001;47(2):236–249 10.1287/mnsc.47.2.236.9835 [DOI] [Google Scholar]

[pone.0209819.ref026] 26. Chinazzi M, Fagiolo G. Systemic Risk, Contagion, and Financial Networks: A Survey. SSRN Electronic Journal. 2013. 10.2139/ssrn.2243504 [DOI] [Google Scholar]

[pone.0209819.ref027] 27.van der Hofstad R. Random Graphs and Complex Networks; 2014.

[pone.0209819.ref028] 28. Furfine CH. The Microstructure of the Federal Funds Market. Financial Markets, Institutions and Instruments. 1999;8(5):24–44. 10.1111/1468-0416.00031 [DOI] [Google Scholar]

[pone.0209819.ref029] 29. Bech ML, Atalay E. The topology of the federal funds market. Physica A: Statistical Mechanics and its Applications. 2010;389(22):5223–5246. 10.1016/j.physa.2010.05.058 [DOI] [Google Scholar]

[pone.0209819.ref030] 30.Wormald NC. The Differential Equation Method for Random Network Processes and Greedy Algorithms. In: Lectures on Approximation and Randomized Algorithms; 1999. p. 73–155.

[pone.0209819.ref031] 31.Rudin W. Principles of Mathematical Analysis. 3rd ed. McGraw-Hill Education; 1976. Available from: http://onlinelibrary.wiley.com/doi/10.1002/cbdv.200490137/abstract.

[pone.0209819.ref032] 32.Chachuat BC. Nonlinear and Dynamic Optimization: From Theory to Practice—IC-32: Spring Term 2009. EPFL; 2009.

PERMALINK

Intervention on default contagion under partial information in a financial network

Yang Xu

Roles

Abstract

Introduction

Introduction

Fig 1.

Relations to previous literature

Contributions

Model description and dynamics

Basic setup

Initial condition

Fig 2. Financial network before default contagion occurs.

Dynamics

Fig 3. Dynamics at step one.

The asymptotic control problem

Assumptions and definitions

Dynamics of the default contagion process with interventions

Fig 4. The state space for the same (i, j) pair, 0 ≤ i, 0 ≤ j and their transition relations.

Convergence of the default contagion process with interventions

Necessary conditions for the optimal control problem

Solutions of the necessary conditions

Main results

Contagion process with no interventions

Contagion process with interventions

Discussion and summary

Fig 5. Optimal intervention policy summary where the states indicating one unit of equity remaining are colored in blue.

Numerical experiments

Introduction

Simulation

The set up

Theoretical results

Table 1. Theoretical limits of Rn, Dn and Tm under the optimal and alternative policies.

Simulation results

Fig 6. The boxplot and log-log plot of standard deviation and IQR for R/n under optimal policy.

Fig 11. The boxplot and log-log plot of standard deviation and IQR for T/m under alternative policy.

Fig 7. The boxplot and log-log plot of standard deviation and IQR for D/n under optimal policy.

Fig 8. The boxplot and log-log plot of standard deviation and IQR for T/m under optimal policy.

Fig 9. The boxplot and log-log plot of standard deviation and IQR for R/n under alternative policy.

Fig 10. The boxplot and log-log plot of standard deviation and IQR for D/n under alternative policy.

Conclusion

Appendix A: Proofs

Proof of proposition 1

Proof of proposition 2

Proof of proposition 3

Proof of proposition 4

Proof of proposition 5

Proof of lemma 1

Proof of proposition 6

Proof of theorem 1

Proof of theorem 2

Proof of theorem 3

Proof of lemma 4

Appendix B: Wormald’s theorem

Appendix C: Extended pontryagin maximum principle

Appendix D: Preliminary list of notations

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 1. Theoretical limits of $\frac{R}{n}$ , $\frac{D}{n}$ and $\frac{T}{m}$ under the optimal and alternative policies.