Analytic regularity and stochastic collocation of high-dimensional Newton iterates

Julio E Castrillón-Candás; Mark Kon

doi:10.1007/s10444-020-09791-1

. Author manuscript; available in PMC: 2021 Mar 1.

Published in final edited form as: Adv Comput Math. 2020 May 4;46(3):42. doi: 10.1007/s10444-020-09791-1

Analytic regularity and stochastic collocation of high-dimensional Newton iterates

Julio E Castrillón-Candás ¹, Mark Kon ¹

PMCID: PMC7201586 NIHMSID: NIHMS1575300 PMID: 32377059

Abstract

In this paper we introduce concepts from uncertainty quantification (UQ) and numerical analysis for the efficient evaluation of stochastic high dimensional Newton iterates. In particular, we develop complex analytic regularity theory of the solution with respect to the random variables. This justifies the application of sparse grids for the computation of statistical measures. Convergence rates are derived and are shown to be subexponential or algebraic with respect to the number of realizations of random perturbations. Due the accuracy of the method, sparse grids are well suited for computing low probability events with high confidence. We apply our method to the power flow problem. Numerical experiments on the non-trivial 39 bus New England power system model with large stochastic loads are consistent with the theoretical convergence rates. Moreover, compared to the Monte Carlo method our approach is at least 10¹¹ times faster for the same accuracy.

Keywords: Uncertainty Quantification, Newton-Kantorovich Theorem, Sparse Grids, Approximation Theory, Complex Analysis, Power Flow, Non-linear Stochastic Newtworks

1. Introduction

Newton iteration is a powerful method for solving many scientific and engineering problems, naturally arising in the context of power flow problems (Non-linear networks) [8], non-linear Partial Differential Equations [16], among others.

With the advent of massive computational resources, complex mathematical models are widely used for prediction in many scientific and engineering areas such as finance, weather forecasting, seismology, and semiconductor design. Due to the complex nature of these problems uncertainty naturally arises that can affect the reliability of these forecasts. Mathematically rigorous Uncertainty Quantification (UQ) has become an instrumental approach to judge the reliability of such predictions.

Uncertainty quantification is a mathematical approach that allows the charactizations of uncertainty through out the computational model and for a given Quantity of Interest (QoI). Even with present day computational resources, the application of UQ to large and complex non-linear networks such as electric power grids is a formidable undertaking. In particular, due the highly computational challenges. In this paper we seek to develop a UQ process that is mathematically rigorous, computationally efficient and easy to adapted with currently available power flow solvers [21, 32].

One of the most widely used UQ techniques is the Monte Carlo method [11], which is robust and easy to implement. Indeed, a deep analysis or understanding of the underlying stochastic model is not required, making this an attractive approach for the practicing engineer and scientist. However, convergence rates for iterative approximation methods can be very slow. For the application of UQ to the power flow problem, due to the large numbers of generators, loads and transmission lines one potentially faces, the problem can be high dimensional, non-linear, non-Gaussian, and not feasible with current computational resources. An alternative approach is the use of tensor product methods. However, these methods suffer significantly from the curse of dimensionality, thus making them unattractive even for moderate dimensionalities.

If the regularity of a QoI is relatively high with respect to the fundamental random variables, then application of stochastic collocation with Smolyak sparse grids [24, 25, 29] is a good choice. Indeed, this method has become popular in the field of computational applied mathematics and engineering as a surrogate model of stochastic Partial Differential Equations (sPDEs) [9] where the QoI is composed of moderately large numbers of random variables. The method is easy to implement and non-intrusive, i.e., each collocation point corresponds to uncoupled deterministic problems. The stochastic collocation method can be used with non-linear dependence of the QoI on the random variables. However, such grids still suffer from the curse of dimensionality. Alternative adaptive techniques have been developed, including anisotropic sparse grids [24], dimension adaptive quadrature [12] and quasi-optimal sparse grids [7, 26]. Yet these methods are still not feasible for very high dimensional problems and/or low regularity of the QoI. In addition, although quasi-optimal sparse grids lead to exponential convergence rates with respect to numbers of realizations, there is to our knowledge still no systematic way to construct them.

We have particular interest in the application of the Newton iteration to the solution of the power flow equations of electric grids [8]. In practice many of the generators (wind, solar, etc) and loads are stochastic in nature, and thus a traditional deterministic power flow analysis is insufficient. UQ applied to mathematical/statistical modeling of electrical power grids is still in its infancy. A major 2016 National Academy of Sciences report underscores the importance of this new area of research in its potential to contribute to this and the next generation of electric power grids [22]. The incorporation of uncertainty in the grid has gathered interest in the power system community.

In [15] Hockenberry et al. proposed the probabilistic collocation method (PCM) with applications to power systems. Although the results of this work are good, the uncertainty is computed only with respect to a single parameter. In [27] the authors test a polynomial chaos collocation and Galerkin approach to study the uncertainty of power flow on a small 2-bus power system. The results are good for low stochastic dimensions, but it is not clear how this approach will scale for large electric power grids, with higher associated dimensions. Moreover, there is little mathematical theory on the effectiveness of this approach.

More recently, in [17], Tang et al. proposed a dimension-adaptive sparse grid method [12] by using the off-the-shelf Matlab Sparse Grid Toolbox [18, 19]. The numerical results show the feasibility of this approach. However, there are also some weaknesses. The authors did not analyze the regularity of the power flow with respect to the uncertainty, so that the rate of convergence of the sparse grid is not known. Reduced regularity of the stochastic power flow can choke the accuracy. In contrast to Monte Carlo methods, which are robust, sparse grid methods are sensitive to the regularity of the function at hand. Lack of regularity will lead to erroneous results.

This motivates the application of numerical analysis and UQ theory to the Newton iteration for such problems as the electric power grid. Many of the ideas of this paper originate from the numerical solution of stochastic PDEs [9, 24, 25], where UQ is having a large impact. The goals of our paper is to integrate these methods with the theory and practice of power systems, where we believe that UQ will eventually have a strong impact. Furthermore, from the numerical analysis perspective regularity can be determined by complex analytic extensions of the functions of interest [3, 9, 31]. These lead to sharper convergence rates than regularity in terms of derivatives.

In Section 2 the mathematical background for this paper is introduced. In particular, the Newton-Kantorovich Theorem, sparse grids and convergence rates are discussed. Furthermore, complex analysis from the numerical analysis perspective for polynomial approximation is also treated. In Section 3 the complex analytic regularity of the solution of the Newton iteration is developed with respect to the random perturbations. In Section 4 the theory developed in Section 3 justifies the application of sparse grids to the power flow equations. The sparse grids are applied to the power flow equations of the 39 bus, 10 Generator, New England model. Subexponential or algebraic convergence rates are obtained that are consistent with the sparse grid convergence rates. These convergence rates make the sparse grid method suitable for computing stochastic moments to high accuracy. Furthermore, they will also be well suited for computing small event tail probabilities with high accuracy.

2. Mathematical background

In this section we introduce the general notation and mathematical background that will be used in this paper. We provide a summary of the three important topics: i) Newton-Kantorovich Theorem, ii) Stochastic spaces and iii) Sparse grids (approximation theory).

2.1. Newton-Kantorovich Theorem

Consider the Fréchet differentiable operator f : X → Y that maps a convex open set D of a Banach space X into a Banach space Y . Suppose that we are interested in finding an element x ∈ D such that

f (x) = 0 .

(1)

Assuming that a solution of equation (1) exists, a series of successive approximations x^v ∈ D, where $v \in ℕ_{0} ≔ ℕ \cup 0$ , can be built. Consider the space of bounded linear operators L(Y,X) from Y into X. Let J(x^v) be the Fréchet derivative of f(x^v). Suppose that J(x^v)⁻¹ ∈ L(Y,X) and consider the sequence

x^{v + 1} = x^{v} - J {(x^{v})}^{- 1} f (x^{v}) .

Assumption 1

We assume that that J satisfies the Lipschitz condition

‖ J (x) - J (y) ‖ \leq λ ‖ x - y ‖

(2)

for some constant λ ≥ 0, and all x, y ∈ D. Furthermore, assume that x⁰ ∈ D and there exist positive constants ϰ and δ such that

‖ J {(x^{0})}^{- 1} ‖ \leq ϰ,

‖ J {(x^{0})}^{- 1} f (x^{0}) ‖ \leq δ,

(3)

h ≔ 2 ϰ λ δ \leq 1,

U(x⁰, t*) ⊂ D, with U(x, r) the open ball {y : ‖y − x‖ ≤ r} and $t * = \frac{2}{h} (1 - \sqrt{1 - h}) δ$ .

If Assumption 1 above is satisfied, then by the Newton-Kantorovich theorem [2], for all $v \in ℕ_{0}$ ,

The Newton iterates x^v+1 = x^v − J(x^v)⁻¹f(x^v) and J(x^v)⁻¹ exist.
x^v ∈ U(x₀, t*) ⊂ D.
x* = lim_v→∞x^v exists, $x^{*} \in \bar{U (x^{0}, t^{*})}$ , and f(x*) = 0 uniquely.

2.2. Stochastic spaces

Let Ω be the set of outcomes from the complete probability space (Ω, $F$ , $ℙ$ ), where $F$ is a sigma algebra of events and $ℙ$ is a probability measure. Define $L_{ℙ}^{q} (Ω)$ , q ∈ [1, ∞], as the Banach spaces

L_{ℙ}^{q} (Ω) ≔ {u : Ω \to ℝ | \int_{Ω} {| u (ω) |}^{q} d ℙ (ω) < \infty} and

L_{ℙ}^{\infty} (Ω) ≔ {u : Ω \to ℝ | ℙ \underset{ω \in Ω}{-ess sup} | u (ω) | < \infty} .

Let W := [W₁,…,W_N] be an N-component random vector measurable in (Ω, $F$ , $ℙ$ ) that takes values on $Γ ≔ Γ_{1} \times \dots \times Γ_{N} \subset ℝ^{N}$ , with Γ_n := [−1, 1]. Let $B (Γ)$ be the Borel σ-algebra. Suppose that a induced measure μ_W on (Γ, $B (Γ)$ ) is defined as $μ_{W} (A) ≔ ℙ (W^{- 1} (A))$ for all $A \in B (Γ)$ . Given that the induced measure is absolutely continuous with respect to Lebesgue measure on Γ, then there exists a density function ρ(q) : Γ → [0, +∞) such that

ℙ (W \in A) ≔ ℙ (W^{- 1} (A)) = \int_{A} ρ (q) d q,

for any event $A \in B (Γ)$ . Furthermore for any measurable function $W \in {[L_{ℙ}^{1} (Γ)]}^{N}$ define the expected value as

E [W] = \int_{Γ} q ρ (q) d q .

Define also the Banach spaces (for q ≥ 1)

L_{ρ}^{q} (Γ) ≔ {u : Γ \to ℝ | \int_{Ω} {| u (q) |}^{q} ρ (q) d q < \infty} and

L_{ρ}^{\infty} (Γ) ≔ {u : Γ \to ℝ | ρ \underset{q \in Γ}{-ess sup} | u (q) | < \infty} .

Note that the above ρ-essential supremium is with respect to the measure induced by the density function ρ(·), rather than Lebesgue measure itself.

In general the density ρ(·) will not factorize into independent probability density functions, making higher dimensional manipulations difficult in some cases. In [3] the authors recommend use of an auxiliary probability density function $\hat{ρ} : Γ \to ℝ^{+}$ that factorizes into N independent ones, i.e.,

\hat{ρ} (q) = \prod_{n = 1}^{N} \hat{ρ} (q_{n}), q = (q_{1}, \dots, q_{N}) \in Γ,

(4)

where we will assume that ${‖ \frac{ρ (q)}{\hat{ρ} (q)} ‖}_{L^{\infty} (Γ)} < \infty$ . Note that in contrast L^∞(Γ) (without a subscript) is with respect to the supremium in the Lebesgue measure.

2.3. Sparse grids

Our objective is to efficiently approximate a function $u : Γ \to ℝ$ defined on high dimensional domains using global polynomials. The accuracy of the approach will directly depend on the regularity of the function. Let $P_{p} (Γ) \subset L^{2} (Γ)$ be the span of tensor product polynomials of degree at most p = (p₁, …,p_N); i.e., $P_{p} (Γ) = \otimes_{n = 1}^{N} P_{p_{n}} (Γ_{n})$ with $P_{p_{n}} (Γ_{n}) ≔ span (q_{n}^{m}, m = 0, \dots, p_{n})$ , n = 1,…,N. For univariate polynomial approximation, we define a sequence of levels i = 0, 1, 2, 3,… corresponding to increasing degrees $m (i) \in ℕ_{0}$ polynomial approximation for given coordinates. Here for a given approximation scheme, m(·) is a fixed function. In general our multivariate approximation scheme will assume different levels of approximation i_n for different coordinates n = 1,…,N.

We consider separate univariate Lagrange interpolants in Γ along each dimension n, given as $I_{n}^{m (i_{n})} : C^{0} (Γ_{n}) \to P_{m (i_{n}) - 1} (Γ_{n})$ . Specifically, let

I_{n}^{m (i_{n})} (u (q_{n})) ≔ \sum_{j_{n} = 1}^{m (i_{n})} u (q_{j_{n}}^{i}) l_{n, j_{n}} (q_{n}),

(5)

where ${l_{n, j}}_{j = 1}^{m (i_{n})}$ is a Lagrange basis for the space $P_{p_{n}} (Γ_{n})$ , the set ${q_{j_{n}}^{i}}_{j_{n} = 1}^{m (i_{n})}$ , represents m(i_n) discrete locations (the interpolation knots) in Γ_n, the index i_n ≥ 0 is the level of approximation, and $m (i_{n}) \in ℕ_{+}$ is the number of collocation nodes at level $i_{n} \in ℕ_{+}$ where m(0) = 0, m(1) = 1 and m(i_n) ≤ m(i_n +1) if i_n ≥ 1.

Remark 1 Since m(i_n) represents the number of interpolation knots for the Lagrange basis, we have p_n = m(i_n) + 1.

One of the most common approaches to constructing Lagrange interpolants in high dimensions is the formation of tensor products of $I_{n}^{m (i_{n})}$ along each dimension n. However, as N increases the dimension of $P_{p}$ increases as $\prod_{n = 1}^{N} (p_{n} + 1)$ . Thus even for moderate dimensions N the computational cost of a Lagrange approximation becomes prohibitive. However, in the case of sufficient complex analytic regularity of the function u with respect to the random variables defined on Γ, a better choice is the application of Smolyak sparse grids. In the rest of this section the construction of the classical Smolyak sparse grid (see e.g. [6, 29]) is summarized. More details can be found in [4].

Consider the difference operator along the n^th dimension given by

Δ_{n}^{m (i_{n})} ≔ I_{n}^{m (i_{n})} - I_{n}^{m (i_{n} - 1)} .

(6)

Given an integer w ≥ 0, called the approximation level, and a multi-index $i = (i_{1}, \dots, i_{N}) \in ℕ_{0}^{N}$ , let $g : ℕ_{0}^{N} \to ℕ$ be a strictly increasing function in each argument and define a sparse grid approximation of function u(q) ∈ C⁰(Γ), restricted in order by g:

S_{w}^{m, g} [u (q)] = \sum_{i \in ℕ_{0}^{N} : g (i) \leq w} \otimes_{n - 1}^{N} (Δ_{n}^{m (i_{n})}) (u (q)) .

(7)

We observe that the sparse grid approximation is constructed from a linear combination of polynomial tensor product interpolations. However, the size of the polynomial space is controlled by g(i) ≤ w in (7).

Define the following N − tuple m(i) := (m(i₁),…,m(i_N)) and consider the set of polynomial multi-degrees

Λ^{m, g} (w) = {p \in ℕ^{N}, g (m^{- 1} (p + 1)) \leq w},

where 1 is an N dimensional vector of ones, and the associated multivariate polynomial space

ℙ_{Λ^{m, g} (w)} (Γ) = span {\prod_{n = 1}^{N} q_{n}^{p_{n}}, with p \in Λ^{m, g} (w)} .

For a Banach space V let

C^{0} (Γ; V) ≔ {u : Γ \to V is continuous on Γ and max_{y \in Γ} ‖ u (y) ‖_{V} < \infty} .

It can be shown that the approximation formula given by $S_{w}^{m, g}$ is exact in $ℙ_{Λ^{m, g} (w)} (Γ)$ . We state the following proposition that is proved in [4].

Proposition 1

For any u ∈ C⁰(Γ; V ), we have $S_{w}^{m, g} [u] \in ℙ_{Λ^{m, g} (w)} \otimes V$ .
Moreover, $S_{w}^{m, g} [u] = u \forall u \in ℙ_{Λ^{m, g} (w)} \otimes V$ .

Remark 2 The tensor product space $ℙ_{Λ^{m, g} (w)} \otimes V$ is more easily understood as the space of polynomials with Banach-valued coefficients. Furthermore, $S_{w}^{m, g} [u] \in ℙ_{Λ^{m, g} (w)} \otimes V$ is interpreted as a sparse grid approximation of a V -valued continuous function.

A good choice of m and g is given by the Smolyak sparse grid definitions (see [6, 29])

m (i_{n}) = {\begin{array}{l} 1 & for i_{n} = 1 \\ 2^{i_{n} - 1} + 1 & for i_{n} > 1 \end{array} and g (i) = \sum_{n = 1}^{N} (i_{n} - 1) .

Furthermore $Λ^{m, g} (w) ≔ {p \in ℕ^{N} : \sum_{n} f (p_{n}) \leq w}$ where

f (p_{n}) = {\begin{array}{l} 0, p_{n} = 0 \\ 1, p_{n} = 1 \\ ⌈ {log}_{2} (p_{n}) ⌉, p_{n} \geq 2 \end{array} .

Other common choices are shown in Table 1.

Table 1.

Sparse grid approximations formulas for TD and HC.

Approx. space	sparse grid: m, g	polynomial space: Λ(w)
Total Degree (TD)	m(i_n) = i_n g(i) = ∑_n(i_n − 1) ≤ w	${p \in ℕ_{0}^{N} : \sum_{n} p_{n} \leq w}$
Hyperbolic Cross (HC)	m(i) = i g(i) = ∏_n(i_n) ≤ w + 1	${p \in ℕ_{0}^{N} : Π_{n} (p_{n} + 1) \leq w + 1}$

Open in a new tab

With this choice of (m, g) and the application of Clenshaw-Curtis abscissas (which are locations of interpolation points given as extrema of Chebyshev polynomials) leads to nested sequences of one dimensional interpolation formulae. In consequence, the sparse grid formed by this choice is highly compressed in comparison to the full tensor product grid. Another good choice includes Gaussian abscissas [23]. For any choice of m(i_n) > 1 the Clenshaw-Curtis abscissas are given by

q_{j_{n}}^{i_{n}} = - cos (\frac{π (j_{n} - 1)}{m (i_{n}) - 1}), j_{n} = 1, \dots, m (i_{n}) .

In Figure 1 an example of Clenshaw Curtis and Gaussian abscissas are shown for w = 5.

Fig. 1 — Clenshaw-Curtis (left) and Gaussian abscissas (right) for w = 5 levels.

As previously pointed out, the probability density function ρ does not necessarily factorize in higher dimensions. As an alternative we use the auxiliary distribution $\hat{ρ}$ , which factorizes as $\hat{ρ} (q) = \prod_{n = 1}^{N} {\hat{ρ}}_{n} (q_{n})$ and is close to the original distribution ρ(q). Suppose k is a given global index determined by the set of indices k₁ …,k_N as k = k₁+p₁(k₂−1)+p₁p₂(k₃−1)+p₁p₂p₃(k₄−1)+…. Given a function u : Γ → V , the quadrature scheme $E_{\hat{ρ}}^{p} [u]$ that approximates the integral $E [u (q)] ≔ \int_{Γ} u (q) \hat{ρ} (q) d q$ can now be computed based on the distribution $\hat{ρ} (q)$ as

E_{\hat{ρ}}^{p} [u] = \sum_{k = 1}^{N_{P}} ω_{k} u (q^{(k)}), ω_{k} = \prod_{n = 1}^{N} ω_{k_{n}} ω_{k_{n}} = \int_{Γ_{n}} l_{n, k_{n}}^{2} (q_{n}) {\hat{ρ}}_{n} (q_{n}) d q_{n},

with $q^{(k)} \in ℝ^{n}$ the locations of the quadrature knots, and $N_{p} \in ℕ$ the number of Gauss quadrature points controlling the accuracy of the quadrature scheme. Recall that $l_{n, k_{n}}^{2} (q_{n})$ define Lagrange polynomials defined in equation (5). The term $E [u (q)]$ can be approximated as

E [S_{w}^{m, g} [u (q)]] \approx E_{\hat{ρ}}^{p} [S_{w}^{m, g} [u (q)] \frac{ρ}{\hat{ρ}}],

and similarly the variance var[u(q)] is approximated as

var [u (q)] \approx E [{(S_{w}^{m, g} [u (q)])}^{2}] - E {[S_{w}^{m, g} [u (q)]]}^{2} *** \approx E_{\hat{ρ}}^{p} [{(S_{w}^{m, g} [u (q)])}^{2} \frac{ρ}{\hat{ρ}}] - E_{\hat{ρ}}^{p} {[S_{w}^{m, g} [u (q)] \frac{ρ}{\hat{ρ}}]}^{2} .

Remark 3 The weights $ω_{k_{n}}$ and node locations q^(k) are computed from the auxiliary density $\hat{ρ}$ . For standard distributions of $\hat{ρ}$ such as uniform and Gaussian, these are already tabulated to full accuracy. Otherwise they must be computed by solving for roots of orthogonal polynomials and using a quadrature scheme. However, the integrals involved are only one dimensional. See [3] (Section 2) for details.

We now develop some rigorous numerical bounds for the accuracy of the sparse grid approximation. Let $C_{mix}^{k} (Γ; ℝ)$ denote the space of functions with continuous mixed derivatives up to degree k:

C_{mix}^{k} (Γ; ℝ) = {u : Γ \to ℝ : \frac{\partial^{α_{1}, \dots, α_{N}} u}{\partial^{α_{1}} q_{1} \dots \partial^{α_{N}} q_{N}} \in C^{0} (Γ; ℝ), n = 1, \dots, N, α_{n} \leq k}

and equipped with the following norm:

‖ u ‖_{C_{mix}^{k} (Γ; ℝ)} = {u : Γ \to max_{q \in Γ} | \frac{\partial^{α_{1}, \dots, α_{N}} u (q)}{\partial^{α_{1}} q_{1} \dots \partial^{α_{N}} q_{N}} | < \infty} .

Assume that $u \in C_{mix}^{k} (Γ; ℝ)$ . In [6] the authors show that it must follow that

{‖ u - S_{w}^{m, g} [u] ‖}_{L^{\infty} (Γ)} \leq C (k, N) ‖ u ‖_{C_{mix}^{k} (Γ)} η^{- k} {(log η)}^{(k + 2) (N - 1) + 1},

where η is the number of knots of the sparse grid $S_{w}^{m, g}$ . However, the coefficient C(k,N) is in general not known [6].

If the function u admits a complex analytic extension, a better approach for deriving error bounds for the polynomial approximation arises from exploitation of analysis in the complex plane. In [25] the authors derive $L_{ρ}^{\infty} (Γ)$ bounds based on analytic extensions of u on a well defined region $Ψ \subset ℂ^{N}$ with respect to the variables q. These bounds are explicit, and the coefficients can be estimated and depend on the size of the region Ψ.

In [24, 25] error estimates are derived for the isotropic and anisotropic Smolyak sparse grids with the choice of Clenshaw-Curtis and Gaussian abscissas. Let η be the number of collocation knots and $E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}} ≔ Π_{n = 1}^{N} E_{n, {\hat{σ}}_{n}} \subset Ψ \subset ℂ^{N}$ , where

E_{n, {\hat{σ}}_{n}} = {z \in ℂ with Re z = \frac{e^{δ_{n}} + e^{- δ_{n}}}{2} cos (θ), Im z = \frac{e^{δ_{n}} - e^{- δ_{n}}}{2} sin (θ) : θ \in [0, 2 π), {\hat{σ}}_{n} \geq δ_{n} \geq 0}

and ${\hat{σ}}_{n} > 0$ (see the Bernstein ellipse in Figure 2). The authors show that the error ${‖ u - S_{w}^{m, g} [u] ‖}_{L_{ρ}^{\infty} (Γ)}$ exhibits algebraic or sub-exponential convergence with respect to η (see Theorems 3.10, 3.11, 3.18 and 3.19 in [25] for more details) whenever $u \in C^{0} (Γ, ℝ)$ admits an analytic extension on the polyellipse $E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}}$ .

Fig. 2 — Bernstein ellipse along the n^th dimension. The ellipse crosses the real axis at $\frac{e^{{\hat{σ}}_{n}} + e^{- {\hat{σ}}_{n}}}{2}$ and the imaginary axis at $\frac{e^{{\hat{σ}}_{n}} - e^{- {\hat{σ}}_{n}}}{2}$ .

We now recall the definition of Chebyshev polynomials, useful in deriving error estimates on sparse grids. Let $T_{k} : Γ_{1} \to ℝ$ , k = 0, 1,…, be a k^th order Chebyshev polynomial over [−1, 1]. These polynomials are defined recursively as:

T_{0} (y) = 1, T_{1} (y) = y, \dots, T_{k + 1} (y) = 2 y T_{k} (y) - T_{k - 1} (y), \dots .

The following theorem characterizes approximation of analytic functions using Chebyshev polynomials.

Theorem 1

Let u be analytic and absolutely bounded by M on $E_{log ζ}$ , $ζ > 1$ . Then the expansion

u (y) = α_{0} + 2 \sum_{k = 1}^{\infty} α_{k} T_{k} (y),

holds for all $y \in E_{log ζ}$ where

α_{k} = \frac{1}{π} \int_{- 1}^{1} \frac{u (y) T_{k} (y)}{1 - y^{2}} d y

and, additionally, |α_k| ≤ M/ζ^k. Furthermore if y ∈ [−1,1] then

| u (y) - α_{0} - 2 \sum_{k = 1}^{m} α_{k} T_{k} (y) | \leq \frac{2 M}{ζ - 1} ζ^{- m} .

Proof See Theorem 8.2 in [31] □

We follow the arguments in [3, 23] and using the fact that the interpolation operator $I_{n}^{m (i_{n})}$ is exact on the space $P_{p_{n} - 1}$ , i.e. for any $v \in P_{p_{n} - 1}$ we have that $I_{n}^{m (i_{n})} (v) = v$ , it can be shown that if u is continuous on [−1,1] and has an analytic extension on $E_{σ_{n}}$ we have, from Theorem 1,

{‖ (I - I_{n}^{m (i_{n})}) u ‖}_{L_{ρ}^{\infty} (Γ_{n})} \leq (1 + Λ_{m (i)}) min_{v \in P_{m (i_{n}) - 1}} ‖ u - v ‖_{C^{0} (Γ, ℝ)} **** \leq (1 + Λ_{m (i)}) \frac{2 M (u)}{e^{σ_{n}} - 1} e^{- σ_{n} m (i_{n})},

where $Λ_{m (i_{n})}$ is the Lebesgue constant and is bounded by 2π⁻¹(log(m − 1)+1) (see [3]) and M = M(u) is the maximal value of u on $E_{σ_{n}}$ . Thus, for n = 1,…,N we have

{‖ (I - I_{n}^{m (i)}) u ‖}_{L_{ρ}^{\infty} (Γ_{n})} \leq M (u) C ({\hat{σ}}_{n}) i_{n} e^{- σ_{n} 2^{i_{n}}},

(8)

where $σ_{n} = \frac{{\hat{σ}}_{n}}{2} > 0$ and $C (σ_{n}) ≔ \frac{2}{(e^{σ_{n}} - 1)}$ . Recalling the definition of $Δ_{n}^{m (i_{n})}$ from equation (6), we have that for all n = 1,…,N

{‖ Δ {(u)}^{m (i_{n})} ‖}_{L_{ρ}^{\infty} (Γ_{n})} = {‖ (I_{n}^{m (i_{n})} - I_{n}^{m (i_{n} - 1)}) u ‖}_{L_{ρ}^{\infty} (Γ_{n})} \leq {‖ (I - I_{n}^{m (i_{n})}) u ‖}_{L_{ρ}^{\infty} (Γ_{n})} + {‖ (I - I_{n}^{m (i_{n} - 1)}) u ‖}_{L_{ρ}^{\infty} (Γ_{n})} \leq 2 M (u) C (σ_{n}) i_{n} e^{- σ_{n} 2^{i_{n} - 1}} .

(9)

By applying equation (9) to Lemma 3.5 in [25], we are now in a position to slightly modify Theorems 3.10 and 3.11 in [25] and restate them into a single theorem given below. However, the following assumptions and definitions are first needed:

We set $\hat{σ} \equiv {min}_{n = 1, \dots, N} {\hat{σ}}_{n}$ , i.e. for an isotropic sparse grid the general sub-exponential decay will be restricted by the smallest $\hat{σ}$ .
Let
$\tilde{M} (u) = sup_{g \in E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}}} | u (g) |,$

$σ = \hat{σ} / 2, μ_{1} = \frac{σ}{1 + log (2 N)}, and μ_{2} (N) = \frac{log (2)}{N (1 + log (2 N))},$

$a (δ, σ) ≔ exp (δ σ {\frac{1}{σ {log}^{2} (2)} + \frac{1}{log (2) \sqrt{2 σ}} + 2 (1 + \frac{1}{log (2)} \sqrt{\frac{π}{2 σ}})}),$

${\tilde{C}}_{2} (σ) = 1 + \frac{1}{log 2} \sqrt{\frac{π}{2 σ}}, δ^{*} (σ) = \frac{e log (2) - 1}{{\tilde{C}}_{2} (σ)},$

$C_{1} (σ, δ, \tilde{M} (u)) = \frac{4 \tilde{M} (u) C (σ) a (δ, σ)}{e δ σ},$

$μ_{3} = \frac{σ δ^{*} {\tilde{C}}_{2} (σ)}{1 + 2 log (2 N)}, and$

$Q (σ, δ^{*} (σ), N, \tilde{M} (u)) = \frac{C_{1} (σ, δ^{*} (σ), \tilde{M} (u))}{exp (σ δ^{*} (σ) {\tilde{C}}_{2} (σ))} \frac{max {1, C_{1} (σ, δ^{*} (σ), \tilde{M} (u))}^{N}}{| 1 - C_{1} (σ, δ^{*} (σ), \tilde{M} (u)) |} .$

Theorem 2

Suppose that $u \in C^{0} (Γ; ℝ)$ has an analytic extension on $E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}}$ and is absolutely bounded by $\tilde{M} (u)$ . If w > N / log 2 and a sparse grid with Clenshaw-Curtis abscissas is used, then the following bound is valid:

{‖ u - S_{w}^{m, g} u ‖}_{L_{ρ}^{\infty} (Γ)} \leq Q (σ, δ^{*} (σ), N, \tilde{M} (u)) η^{μ_{3} (σ, δ^{*} (σ), N)} exp (- \frac{N σ}{2^{1 / N}} η^{μ_{2} (N)}),

(10)

Furthermore, if w ≤ N/log 2 then the following algebraic convergence bound holds:

{‖ u - S_{w}^{m, g} u ‖}_{L_{ρ}^{\infty} (Γ)} \leq \frac{C_{1} (σ, δ^{*} (σ), \tilde{M} (u)) max {1, C_{1} (σ, δ^{*} (σ), \tilde{M} (u))}^{N}}{| 1 - C_{1} (σ, δ^{*} (σ), \tilde{M} (u)) |} η^{- μ_{1}} .

(11)

Proof This is proved by applying the inequality of equation (9) to the proof of Theorems 3.10 and 3.11 in [25]. □

In many practical cases not all dimensions of Γ are equally important. In these cases the dimensionality of the sparse grid can be significantly reduced by means of anisotropic sparse grids. It is not hard to construct associated anisotropic sparse approximation formulae by having the restriction function g depend on the input random variables q_n.

2.4. Analyticity

Throughout this paper we will apply several important complex analysis results. Suppose that $f : U \to ℂ$ is a complex valued function defined on the open set U. Rewrite f into real and complex parts as f(z) := u(z) + iv(z), where z = x + iy. The Cauchy-Riemann equations are then stated as:

\frac{\partial u}{\partial x} = \frac{\partial v}{\partial y} \frac{\partial u}{\partial y} = - \frac{\partial v}{\partial x}

(12)

From the Cauchy-Riemann criterion (Chap1, p3 in [14]) if f satisfies equation (12) and is continuously differentiable on the real axis then f is analytic.

We state Hartog’s theorem (Chap1, p32 in [20]). This theorem allows us to determine analyticity for multivariate functions a series of single dimensional analytic functions.

Theorem 3 (Hartog’s theorem)

Let $U \subseteq ℂ^{N}$ be an open set and $f : U \to ℂ$ . Suppose that for each j = 1,…,N and each fixed z₁,…,z_j−1,z_j+1,…,z_N the function ψ ↦ f(z₁,…,z_j−1,ψ,z_j+1,…,z_N) is holomorphic, in the classical one variable sense, on the set $U (z_{1}, \dots, z_{j - 1}, z_{j + 1}, \dots, z_{N}) \equiv {ψ \in ℂ : (z_{1}, \dots, z_{j - 1}, ψ, z_{j + 1}, \dots, z_{N}) \in U}$ . Then f is continuous on U.

From Osgood’s lemma continuity of the function f on U implies analyticity (Chap1, p2 in [14]).

3. Analyticity of the Newton iteration

It is profitable here for purposes of clarification to consider the Newton iteration in a general function space context. Specifically, let X and Y (see section 2) be Banach spaces. Consider the following problem: Find x ∈ D ⊂ X such that

f (x, q) = 0,

(13)

where q ∈ Γ and f : D × Γ → Y . Equation (13) is then solved using the Newton iteration under the conditions of the Newton-Kantorovich Theorem (see section 2).

The convergence rate of the Newton iterates based on a sparse grid approximation as a function of grid size is directly affected by the regularity properties with respect to parameters q ∈ Γ. Regularity is characterized in terms of an analytic extension in $ℂ^{N}$ of the iterates.

In the sequel we will treat q ∈ Γ as a random parameter, which will be suppressed occasionally. Thus for example, below we will write f ≡ f_q, J ≡ J_q, M_v ≡ M_v,q, etc., and we can write f(x, q) ≡ f_q(x). For any q ∈ Γ (which we fix for now) consider the Newton sequence

x^{v} = M_{v} (x^{v - 1}) \equiv x^{v - 1} - J {(x^{v - 1})}^{- 1} f (x^{v - 1}) .

(14)

where J : X → Y is the Fréchet derivative of f : D → Y . Assume that x₀ ∈ D₀ ⊂ D and for all $v \in ℕ$ let D_v ⊂ D be the successive images under the map M_v, so that M_v(D_v−1) = D_v. Note at this point the random parameter q ∈ Γ is unchanging throughout the iteration; the iterated domains D_v however depend on q.

Suppose that the parameter q is now extended to a complex parameter g with g ∈ Ψ ⊃ Γ, where $Ψ \subset ℂ^{N}$ . We can now form a complex extension of the sequence (14) as follows. We will complexify the pair (x, q) into a pair of complex variables (z, g), with z the complexification of x. Assume that for f(x⁰) ≡ f_q(x⁰) : D₀ → E₀ there exists an analytic extension f(x⁰) ≡ f_g(z⁰) : Θ₀ → Φ₀, where Θ₀ and Φ₀ are contained in a suitable complex Banach spaces, which are the respective complex extensions of X and Y . Given a function f on a real linear domain D, and a function f* on a complex linear domain Θ ⊃ D, we say that f* is an analytic extension of f if f* is analytic on its domain, and the restriction f*|_D = f. When there exists an analytic extension f* we say that f can be analytically extended.

Remark 4 Note that as before we write f ≡ f_g, J ≡ J_g, M_v ≡ M_v,g, etc. It is understood from context that the notational equivalence is over the extension of the variables (x, q) into the complex pair (z, g).

Similarly, assume that J(x⁰) ∈ L(D₀,E₀) can be extended analytically as J(z⁰) ∈ L(Θ₀,Φ₀). Here L(·,·) is the space of bounded operators between two spaces. Through the above complexifications, equation (12) defines a complexification of the mapping M_v. We now repeat the above iteration using the complexified maps defined here. Thus there exists a series of sets Θ₀,…,Θ_v and Φ₀,…,Φ_v, such that for f(x^v) : D_v → E_v and J(x^v) ∈ L(D_v,E_v) the analytic extensions f(z^v) : Θ_v → Φ_v and J(z^v) ∈ L(Θ_v,Φ_v) are onto. Thus the sequence (14) is extended in $ℂ^{N}$ as follows: Let z⁰ = x⁰ and for all $v \in ℕ$ and g ∈ Ψ (which we also fix) form the sequence

z^{v} = M_{v} (z^{v - 1}) \equiv z^{v - 1} - J {(z^{v - 1})}^{- 1} f (z^{v - 1}),

(15)

where M_v : Θ_v−1 → Θ_v.

Remark 5 The domain Θ₀ contains the initial condition z⁰. Under certain assumptions and with a judicious choice of Θ₀ it can be shown that the sequence in (15) converges in a pointwise sense inside Θ₀. This will be explored in detail in section 3.1.

Suppose that z^v is an analytic extension of x_v on $Ψ \subset ℂ^{N}$ (see Figure 3). Then the convergence rates of the sparse grid applied to any entry of interest of z^v can be characterized. The size of the set Ψ determines the regularity properties of the solution. From the sparse grid discussion in section 2.3 we embed a polyellipse $E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}} ≔ Π_{n = 1}^{N} E_{n, {\hat{σ}}_{n}}$ in Ψ. From Theorem 10 the $L_{ρ}^{\infty} (Γ)$ convergence rate of the sparse grid is sub-exponential (or algebraic) with respect to the number of sparse grid knots η. The decay of the sparse grid is dominated by σ = min_n=1,…,N σ_n. Thus, the larger σ is the faster the convergence rate.

Fig. 3 — Analytic extension of the domain Γ. Any vector q ∈ Γ is extended in Ψ by adding a vector $v \in ℂ^{N}$ i.e. g = q + v.

Remark 6 For finite dimensional spaces $X = Y = ℝ^{m}$ , $m \in ℕ_{0}$ , the Fréchet derivative J corresponds to the Jacobian of f. In the rest of the paper it is assumed that $D \subset ℝ^{m}$ and ‖ · ‖ corresponds to the standard Euclidean norm or the standard matrix norm, depending on context. For the case of the power flow equations m will be simply related to the number of nodes of the power system [8]. We will be using the notion of analytic extensions, which can be defined as follows.

We can now prove an important theorem for our purposes. First, denote f(z^v) : Θ_v → Φ_v as f^v, and J(z^v) ∈ L(Θ_v,Φ_v) as J^v (note that this depends on the generic initial z₀ ∈ Θ₀ at which the Jacobian is computed).

Theorem 4

Assume that for all $v \in ℕ_{0} : D_{v} \subset X$ and

f^v : D_v → E_v can be analytically extended to f^v : Θ_v → Φ_v.
There exists a coefficient c_v > 0 such that
$σ_{m i n} ([\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}]) \geq c_{v},$

where σ_min(·) refers to the minimum singular value, $J_{R}^{v} ≔ Re J^{v}$ and $J_{I}^{v} ≔ Im J^{v}$ .

Then for all $v \in ℕ_{0}$ there exists an analytic extension of x^v on Ψ.

Proof The main strategy for this proof is to use the Cauchy-Riemann equations. This avoids having to explicitly show that the inverse of the complex Jacobian matrix J^v is analytic. The existence of an analytic extension for z^v for each separate complex dimension is shown. The Hartog’s theorem and Osgood’s lemma (Chap1, p2 in [14]) is then used to show analyticity with respect to all the complex dimensions.

For fixed v consider the extension x^v → z^v = x^v + w^v in Θ_v, where $w^{v} \in ℂ^{m}$ and z^v ∈ Θ_v. In complex form $z^{v} = z_{R}^{v} + i z_{I}^{v}$ , where $z_{R}^{v} = Re z^{v}$ , and $z_{I}^{v} = Im z^{v}$ . Furthermore, consider the extension of q → g = q + v in Ψ, where $v \in ℂ^{N}$ and g ∈ Ψ. The extension of the iteration (14) on Θ_v ×Ψ leads to the following block form iteration

[\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] ([\begin{array}{l} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{array}] - [\begin{array}{l} z_{R}^{v} \\ z_{I}^{v} \end{array}]) = - [\begin{array}{l} f_{R}^{v} \\ f_{I}^{v} \end{array}],

(16)

where $f_{R}^{v} ≔ Re f^{v}$ and $f_{I}^{v} ≔ Im f^{v}$ . From (ii) it follows that equation (16) is well posed and is a valid extension of equation (14) on Θ_v × Ψ. We now show that z^v+1 is an analytic extension on Θ_v × Ψ.

We focus our attention on the k^th variable of z^v as $z_{k}^{v}$ and write it in complex form as $z_{k}^{v} = s + i w$ . By differentiating equation (16) with respect to s and w we obtain

\begin{matrix} \partial_{s} [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] [[\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]] + [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] \partial_{s} [[\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]] = - \partial_{s} [\begin{matrix} f_{R}^{v} \\ f_{I}^{v} \end{matrix}] \\ \partial_{w} [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] [[\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]] + [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] \partial_{w} [[\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]] = - \partial_{w} [\begin{matrix} f_{R}^{v} \\ f_{I}^{v} \end{matrix}] \end{matrix} .

(17)

From assumption (ii) we conclude that $\partial_{s} z_{R}^{v + 1}$ , $\partial_{s} z_{I}^{v + 1}$ , $\partial_{w} z_{R}^{v + 1}$ and $\partial_{w} z_{I}^{v + 1}$ exist on Θ_v × Ψ. The following step is to show that the Cauchy-Riemann equations for z^v+1 are satisfied on Θ_v × Ψ.

Let $P (z^{v}) ≔ \partial_{s} z_{R}^{v} - \partial_{w} z_{I}^{v}$ and $Q (z^{v}) ≔ \partial_{w} z_{R}^{v} + \partial_{s} z_{I}^{v}$ , then from equation (17)

[\begin{matrix} (\partial_{s} J_{R}^{v} - \partial_{w} J_{I}^{v}) - (\partial_{s} J_{I}^{v} + \partial_{w} J_{R}^{v}) \\ (\partial_{s} J_{I}^{v} + \partial_{w} J_{R}^{v}) - (\partial_{s} J_{R}^{v} - \partial_{w} J_{I}^{v}) \end{matrix}] ([\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]) + [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] ([\begin{array}{l} P (z^{v + 1}) \\ Q (z^{v + 1}) \end{array}] - [\begin{matrix} P (z^{v}) \\ Q (z^{v}) \end{matrix}]) = - [\begin{matrix} \partial_{s} f_{R}^{v} - \partial_{w} f_{I}^{v} \\ \partial_{s} f_{I}^{v} + \partial_{w} f_{R}^{v} \end{matrix}] .

Now, (i) implies that J^v ∈ L(D_v,E_v) can be analytically extended to J^v ∈ L(Θ_v,Φ_v). Since J(z^v, g) and f^v(z^v, g) are analytic on Θ_v ×Ψ then from the Cauchy-Riemann equations

[\begin{matrix} (\partial_{s} J_{R}^{v} - \partial_{w} J_{I}^{v}) - (\partial_{s} J_{I}^{v} + \partial_{w} J_{R}^{v}) \\ (\partial_{s} J_{I}^{v} + \partial_{w} J_{R}^{v}) - (\partial_{s} J_{R}^{v} + \partial_{w} J_{I}^{v}) \end{matrix}] = 0 and [\begin{matrix} \partial_{s} f_{R}^{v} - \partial_{w} f_{I}^{v} \\ \partial_{s} f_{I}^{v} + \partial_{w} f_{R}^{v} \end{matrix}] = 0 .

Since $z_{k}^{v}$ is a linear polynomial of s + iw then P(z^v) = Q(z^v) = 0 on $C^{N}$ and thus P(z^v+1) = Q(z^v+1) = 0 on Θ_v × Ψ. We conclude that z^v+1 is analytic for the k^th variable for all z^v ∈ Θ_v and g ∈ Ψ. Following a similar argument we can show that for l = 1,…,N the l^th variable extension of q has leads to an analytic extension of z^v+1 whenever z^v ∈ Θ_v and g ∈ Ψ. We now extend the analyticity of z^v+1 on all of Θ_v × Ψ.

Since $z_{k}^{v + 1}$ is analytic for all k = 1,…,m and the l^th variable of q has an analytic extension for all l = 1,…,N whenever z^v ∈ Θ_v and g ∈ Ψ, then from Hartog’s theorem we conclude that z^v+1 is continuous on Θ_v × Ψ. From Osgood’s lemma it follows that z^v+1 is analytic on Θ_v ×Ψ. From an induction argument and using that fact that the composition of analytic functions is analytic then it follows that z^v+1 is analytic in Ψ, for all $v \in ℕ$ . □

If the assumptions of Theorem 4 are satisfied then z^v is complex analytic in Ψ and it is reasonable to construct a series of sparse grid surrogate models of the entries of the vector x^v. Note that in practice we restrict out attention to a subset of the variables of interest of x^v. With a slight abuse of notation denote $S_{w}^{m, g} [x^{v} (q)]$ as the sparse grid approximation of the entries of interest of the vector x^v.

From Theorem 2 we observe that the accuracy of the sparse grid approximation is a function of i) the size of the polyellipse $E_{{\hat{σ}}_{1}, \dots, {\hat{σ}}_{N}} \subset Ψ$ and ii)

\tilde{M} (z^{v}) = sup_{z^{v} \in Θ_{v}, k = 1, \dots, m} | z_{k}^{v} (g) | .

Remark 7 It is important to note that if the complex sequence (15) does not converge, then the size of the sets Θ_v can become unbounded. In particular, it is possible that $\tilde{M} (z^{v}) \to \infty$ as v → ∞ even if J(z^v)⁻¹ ∈ L(Φ_v,Θ_v) exists for all $v \in ℕ_{0}$ . Thus the sparse grid error bound given by Theorem 2 explodes. A control of the size of the sets Θ_v are need. To this end we shall use the Newton Kantorovich theorem to show that the sets Θ_v are bounded and there exists a constant c_v > 0 such that $σ_{m i n} ([\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}]) \geq c_{v} > 0$ .

Our objective now is to analyze under what conditions the complex sequence remains bounded. In particular, for all $v \in ℕ_{0}$ , we ask if it is possible to construct bounded regions $U \subset ℂ^{m}$ and $Ψ \subset ℂ^{N}$ such that z^v is contained in U and thus

\tilde{M} (z^{v}) \leq sup_{z^{v} \in U} {‖ z^{v} ‖}_{\infty} .

To help answer this question we first show that the complex sequence (15) is itself a Newton sequence.

Remark 8 We have to clarify what we mean by the Fréchet derivative of the complex function f : Θ_v → Φ_v. The algebraic problem of equation (13) can be complexified as follows: Find z ∈ Θ₀ such that f(z, g) = 0 for all g ∈ Ψ. This can be re-written in vector form as: Find z = z_R + iz_I ∈ Θ₀ such that Ref(z_R, z_I, g_R, g_I) = 0 and Im f(z_R, z_I, g_R, g_I) = 0 for all g = g_R +ig_I ∈ Ψ. The corresponding Newton iteration is based on

[\begin{matrix} \partial_{z_{R}} f_{R}^{v} & \partial_{z_{I}} f_{R}^{v} \\ \partial_{z_{R}} f_{I}^{v} & \partial_{z_{I}} f_{I}^{v} \end{matrix}] ([\begin{matrix} z_{R}^{v + 1} \\ z_{I}^{v + 1} \end{matrix}] - [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}]) = - [\begin{matrix} f_{R}^{v} \\ f_{I}^{v} \end{matrix}],

(18)

where $\partial_{z_{R}} f_{R}^{v}$ is the Fréchet derivative of $f_{R}^{v}$ with respect to the variables z_R and similarly for the rest. We refer to the matrix

J_{z^{v}} ≔ [\begin{matrix} \partial_{z_{R}} f_{R}^{v} & \partial_{z_{I}} f_{R}^{v} \\ \partial_{z_{R}} f_{I}^{v} & \partial_{z_{I}} f_{I}^{v} \end{matrix}]

as the Fréchet derivative of f : Θ_v → Φ_v.

Lemma 1

Suppose assumption i) of Theorem 4 are satisfied. Then the complex analytic extension J(z^v) ∈ L(Θ_v, Φ_v) of J(x^v) ∈ L(D_v,E_v) is equivalent to the Fréchet derivative of f^v : Θ_v → Φ_v, i.e.

[\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}] = [\begin{matrix} \partial_{z_{R}} f_{R}^{v} & \partial_{z_{I}} f_{R}^{v} \\ \partial_{z_{R}} f_{I}^{v} & \partial_{z_{I}} f_{I}^{v} \end{matrix}] .

Proof We first prove this result for m = 1 dimension. Suppose that $f : D \to ℝ$ , is a Fréchet differentiable function and let $f : Ξ \to ℂ$ be the analytic continuation on the non-empty open set $Ξ \subset ℂ$ . The analytic function $f : Ξ \to ℂ$ can be rewritten as f(x, y) = f_R(x, y) + if_I(x, y) for all x + iy ∈ Ξ. Since f is analytic on Ξ, from the the identity theorem [1] (uniqueness of complex analytic extensions) we have that f_R and f_I are unique in Ξ. Furthermore, since f is analytic the Cauchy-Riemann equations are satisfied. Thus

[\begin{matrix} \partial_{x} f_{R} & - \partial_{x} f_{I} \\ \partial_{x} f_{I} & \partial_{x} f_{R} \end{matrix}] = [\begin{matrix} \partial_{x} f_{R} & \partial_{y} f_{R} \\ \partial_{x} f_{I} & \partial_{y} f_{I} \end{matrix}]

(19)

in Ξ and $f : Ξ \to ℂ$ is Fréchet differentiable. Now, ∂_xf(x, y) = ∂_xf_R(x, y) + i∂_xf_I(x, y) in Ξ, and from the uniqueness property of the Fréchet derivative all the terms are unique. Recall that ∂_xf(x) defined in D is the Fréchet derivative of $f : D \to ℝ$ . Write the analytic extension of ∂_xf(x) (defined in D) on Ξ as g(x, y) + ih(x, y), with x + iy ∈ Ξ. Since ∂_xf(x) = ∂_xf(x, y) = ∂_xf_R(x, y) + i∂_xf_I(x, y) for y = 0 and x ∈ D, from the uniqueness of the analytic extension we conclude g = ∂_xf_R and h = ∂_xf_I for all x + iy ∈ Ξ. From equation (19) the conclusion follows.

We can now prove our statement for the general case using a simple extension of the above argument. Since f^v : Θ_v → Φ_v is complex analytic, from the identity theorem [1] it is the unique extension of f^v : D_v → E_v. (Note that the unique extension of the identity theorem applies in multi-variate case, which includes the variables x^v and q in the domains Θ_v and Ψ respectively.) From the Cauchy-Riemann equations the functions f^v : Θ_v → Φ_v are Fréchet differentiable and unique. Now, with a slight abuse of notation, denote $J_{Z_{R}^{v}}$ as the Jacobian of f^v : Θ_v → Φ_v, with respect to the real variables $Z_{R}^{v}$ only. By using the above one dimensional argument we can show that the analytic extension of each entry of J(x^v) matches $J_{Z} v$ on the real part of Θ_v. From the Cauchy-Riemann equations we conclude that J(z^v) ∈ L(Θ_v, Φ_v) is equivalent to the Fréchet derivative of f^v : Θ_v → Φ_v. □

From Theorem 1 it follows that the complex sequence (15) is a Newton sequence. We can now apply the Newton-Kantorovich Theorem to study the sequence convergence as v → ∞.

3.1. Regions of Analyticity

The size of a polyellipse embedded in the domain Ψ and the magnitude of z^v ∈ Θ_v (for any $v \in ℕ_{0}$ ) directly impacts the accuracy of the sparse grid (c.f. Theorem 2). For each $v \in ℕ_{0}$ the size of the domains Θ_v and Ψ will be characterized by the magnitude of the minimum singular value

σ_{m i n} ([\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}]) \geq c_{v} > 0

(20)

for some c_v > 0. However, constructing the domain Ψ would require imposing inequality conditions for each Newton iteration. This leads to a highly complex coupled problem that is hard to solve. Moreover, if the complex extension z^v grows rapidly with respect to v then the size of the domain Ψ will be most likely severely constrained. In contrast, by applying the Newton-Kantorovich Theorem it is sufficient to impose conditions on the initial Jacobian (v = 0) to construct a region of analyticity for $Ψ \subset ℂ^{N}$ . Furthermore, the size of the iteration z^v will be controlled.

Consider the iteration

α^{v + 1} = α^{v} - J {(α^{v}, g)}^{- 1} f (α^{v}, g),

(21)

where g ∈ Ψ,

α^{0} ≔ [\begin{matrix} x_{0} \\ 0 \end{matrix}], α^{v} ≔ [\begin{matrix} z_{R}^{v} \\ z_{I}^{v} \end{matrix}], J (α^{v}, g) ≔ [\begin{matrix} J_{R}^{v} & - J_{I}^{v} \\ J_{I}^{v} & J_{R}^{v} \end{matrix}], and f (α^{v}, g) ≔ [\begin{matrix} f_{R}^{v} \\ f_{I}^{v} \end{matrix}],

for all $v \in ℕ_{0}$ .

Remark 9 From Lemma 1 or, alternatively, the Cauchy-Riemann equations, the matrix J(α^v,g) corresponds to the Fréchet derivative of f(α^v,g). Thus the sequence (21) is an Newton iteration and the Newton-Kantorovich Theorem can be used to analyze its convergence properties.

Assumption 2

For all q ∈ Γ Assumption 1 is satisfied.

Assumption 3

Assume that $\tilde{D}$ , where $D \subset \tilde{D}$ , is an open convex set in $ℝ^{2 m}$ and the following Lipschitz condition is satisfied:

‖ J (x, g) - J (y, g) ‖ \leq λ_{e} ‖ x - y ‖,

for all x, $y \in \tilde{D}$ , g ∈ Ψ, and λ_e ≥ 0. Furthermore assume that for all g ∈ Ψ

‖ J {(α^{0}, g)}^{- 1} ‖ \leq x_{e},

‖ J {(α^{0}, g)}^{- 1} f (α^{0}, g) ‖ \leq δ_{e},

h_{e} = 2 x_{e} λ_{e} δ_{e} \leq 1,

and $U (α^{0}, t_{e}^{*}) \subset \tilde{D}$ , where $t_{e}^{*} = \frac{2}{h_{e}} (1 - \sqrt{1 - h_{e}}) δ_{e}$ .

Theorem 5

If Assumption 3 is satisfied then for all g ∈ Ψ

The Newton iterates α^v+1 = α^v + J(α^v, g)⁻¹f(α^v, g) exist and $α^{v} \in U (α_{0}, t_{e}^{*}) \subset \tilde{D}$ .
α* := lim_v→∞ α^v exists, $α^{*} \in \bar{U (α^{0}, t_{e}^{*})}$ , and f(α*, g) = 0.
J(α^v, g)⁻¹ exists for all $v \in ℕ$ and equation (20) is satisfied.

Proof From Theorem 1 and Remark 9 we have that the iterates α^v+1 = α^v + J(α^v, g)⁻¹f(α^v, g) are a valid Newton sequence. In particular, J(α^v, g) corresponds to the Fréchet derivative of f(α^v, g). From Assumption 3 and the Newton-Kantorovich theorem [2] the result follows.

Remark 10 Recall from condition (ii) of Theorem 4 for each $v \in ℕ$ the minimum singular value of the Jacobian matrix J(α^v, g) is bounded by a constant c_v > 0. The constant c_v can be obtained from the proof of the Newton-Kantorovich theorem. See the details of the proof of Theorem 2.2.4 in [2]. In particular, the proof of this theorem shows the existence of a sequence of constants $b_{v} \in ℝ$ such that σ_max(J(α^v, g)⁻¹) ≤ b_v (Equation (2.2.21) on page 44). Note that the sequence b₀,…,b_v depends on the initial parameters ϰ_e, δ_e and λ_e , i.e. b_v(ϰ_e, δ_e, λ_e).

Remark 11 From Assumptions 2 and 3 and from the fact that extended Newton iteration is a valid extension of the sequence (14) then we have that λ_e ≥ λ, ϰ_e ≥ ϰ, δ_e ≥ δ, h_e ≥ h. This implies that $t_{e}^{*} \geq t^{*}$ and therefore $U (x_{0}, t^{*}) \subseteq U (α_{0}, t_{e}^{*})$ for all g ∈ Ψ (See Figure 4). From the Newton-Kantorovich Theorem it follows that that $α^{v} \in U (α_{0}, t_{e}^{*})$ for all $v \in ℕ_{0}$ .

Fig. 4 — Region of convergence $U (α_{0}, t_{e}^{*})$ for the extended Newton iteration.

We can construct a region Ψ such that for all g ∈ Ψ the extended Newton iteration converges. Let $y = [\begin{matrix} y_{R} \\ y_{I} \end{matrix}] = [\begin{matrix} Re g \\ Im g \end{matrix}]$ , and apply the multivariate Taylor theorem for each k = 1,…,n, l = 1,…,n entry of the Jacobian matrix J_R(α₀, g). Evaluating g at q + v, we have that

{[J_{R} (α_{0}, q + v_{R}, 0 + v_{I})]}^{k, l} = {[J_{R} (x_{0}, q, 0)]}^{k, l} + R_{k, l} (x_{0})] [\begin{array}{l} v_{R} \\ v_{I} \end{array}],

v_R := Rev, v_I := Im v, and

R_{k, l} (x_{0}) = [R_{k, l}^{1} (x_{0}), \dots, R_{k, l}^{2 m} (x_{0})] .

The entries of the remainder term R_k,l(x₀) are bounded by

| R_{k, l}^{β} (x_{0}) | \leq max_{t \in (0, 1)} | \partial_{y_{β}} {[J_{R} (x_{0}, [\begin{array}{l} q \\ 0 \end{array}] + t [\begin{array}{l} v_{R} \\ v_{I} \end{array}])]}^{k, l} |,

where ∂_yβ refers to the derivative of the β^th variable of the vector y. Form the matrix

E ≔ [\begin{matrix} 0 & - J_{I} (x_{0}, q, v_{R}, v_{I}) \\ J_{I} (x_{0}, q, v_{R}, v_{I}) & 0 \end{matrix}] + [\begin{matrix} Q (x_{0}, q, v_{R}, v_{I}) & 0 \\ 0 & Q (x_{0}, q, v_{R}, v_{I}) \end{matrix}]

and let

Q_{k, l} (x_{0}, q, v_{R}, v_{I}) = R_{k, l} (x_{0}) [\begin{array}{l} v_{R} \\ v_{I} \end{array}]

be the k = 1,…,n, l = 1,…,n entry of the matrix Q. Then

[\begin{matrix} J_{R} (x_{0}, q) & - J_{I} (x_{0}, q, v_{R}, v_{I}) \\ J_{I} (x_{0}, q, v_{R}, v_{I}) & J_{R} (x_{0}, q) \end{matrix}] = J + E = J (I + J^{- 1} E),

where $J ≔ [\begin{matrix} J_{R} (x_{0}, q) & 0 \\ 0 & J_{R} (x_{0}, q) \end{matrix}]$ .

Theorem 6

Suppose that ϰ_e ≥ ϰ and

‖ E (α^{0}, g) ‖ < \frac{1 - \frac{x}{x_{e}}}{x}

whenever g ∈ Ψ then

‖ J {(α^{0}, g)}^{- 1} ‖ \leq x_{e} .

Proof First note that

‖ J {(α_{0}, g)}^{- 1} ‖ \leq ‖ {(J (I + J^{- 1} E))}^{- 1} ‖ \leq ‖ J^{- 1} ‖ ‖ {(I + J^{- 1} E)}^{- 1} ‖ .

(22)

From Lemma 2.2.3 in [13], if ${‖ J^{- 1} E ‖}_{2} < 1$ then $(I + J^{- 1} E)$ is invertible and

‖ {(I + J^{- 1} E)}^{- 1} ‖ < \frac{1}{1 - ‖ J^{- 1} E ‖} .

Given that $‖ J {(x_{0}, q)}^{- 1} ‖ \leq x$ (From Assumption 1) whenever q ∈ Γ, it follows

‖ J^{- 1} ‖ ‖ I + J^{- 1} E ‖ < \frac{x}{1 - ‖ J^{- 1} E ‖} and ‖ J^{- 1} E ‖ \leq ‖ J^{- 1} ‖ ‖ E ‖ \leq x ‖ E ‖ .

(23)

We conclude that if

‖ E (α^{0}, g) ‖ < \frac{1 - \frac{x}{x_{e}}}{x}

whenever g ∈ Ψ, then from Equations (22) and (23)

‖ J {(α^{0}, g)}^{- 1} ‖ \leq x_{e} .

□

Applying the multivariate Taylor’s theorem for each k = 1,…,n, entry of the vector f_R(α₀, g) where g = q + v, we have

{[f_{R} (α_{0}, q + v_{R}, 0 + v_{I})]}^{k} = {[f_{R} (x_{0}, q, 0)]}^{k} + S_{k} (x_{0}, q, 0)] [\begin{array}{l} v_{R} \\ v_{I} \end{array}],

where v_R := Rev, v_I := Im v, and

S_{k} (x_{0}, q, 0) = [S_{k}^{1} (x_{0}, q, 0), \dots, S_{k}^{2 m} (x_{0}, q, 0)] .

The remainder term S_k(x₀, q) is bounded by

| S_{k}^{β} (x_{0}, q, 0) | \leq max_{t \in (0, 1)} | \partial_{y_{β}} {[f_{R} ([\begin{array}{l} q \\ 0 \end{array}] + t [\begin{array}{l} v_{R} \\ v_{I} \end{array}])]}^{k} |,

where ∂_yβ refers to the derivative of the β^th variable of y. We can now rewrite the vector f(α₀, g) as

f (α_{0}, g) = F (x_{0}, q) + G (x_{0}, q, v_{R}, v_{I}),

where $F ≔ [\begin{matrix} f_{R} (x_{0}, q) \\ 0 \end{matrix}]$ , $G ≔ [\begin{array}{l} P (x_{0}, q, v_{R}, v_{I}) \\ f_{I} (x_{0}, q, v_{R}, v_{I}) \end{array}]$ and

P_{k} (x_{0}, q, v_{R}, v_{I}) = S_{k} (x_{0}, q, 0) [\begin{array}{l} v_{R} \\ v_{I} \end{array}] .

Theorem 7

Suppose that ϰ_e ≥ ϰ and δ_e ≥ δ. Then if

‖ G (α^{0}, g) ‖ < \frac{δ_{e}}{x_{e}} - \frac{δ}{x}

whenever g ∈ Ψ, it follows

‖ J {(α_{0}, g)}^{- 1} f (α_{0}, g) ‖ \leq δ_{e} .

Proof For each of the entries k = 1,…,n of the vector P.

‖ J {(α_{0}, g)}^{- 1} f (α_{0}, g) ‖ = ‖ {(J + E)}^{- 1} (F + G) ‖ \leq ‖ (I + J^{- 1} E) J^{- 1} (F + G) ‖ *** \leq ‖ (I + J^{- 1} E) J^{- 1} F ‖ + ‖ (I + J^{- 1} E) J^{- 1} G ‖ *** \leq ‖ I + J^{- 1} E ‖ ‖ J^{- 1} F ‖ + ‖ I + J^{- 1} E ‖ ‖ J^{- 1} ‖ ‖ G ‖ *** \leq ‖ I + J^{- 1} E ‖ δ + ‖ I + J^{- 1} E ‖ ‖ G ‖ x .

(24)

Since $‖ E ‖_{2} < \frac{1 - \frac{x}{x_{e}}}{x} < x$ (from Theorem 6) from Lemma 2.2.3 in [13] it follows that $(I + J^{- 1} E)$ is invertible and

‖ {(I + J^{- 1} E)}^{- 1} ‖ < \frac{1}{1 - ‖ J^{- 1} E ‖} \leq \frac{x_{e}}{x} .

(25)

Combining equations (24) and (25) we have

‖ J {(α_{0}, g)}^{- 1} f (α_{0}, g) ‖ < \frac{x_{e}}{x} (δ + ‖ G (x_{0}, q, v_{R}, v_{I}) ‖ x) .

The result follows. □

From the values of ϰ, δ, λ and ϰ_e, δ_e, λ_e and Theorems 6 and 7, the region of analyticity Ψ can be constructed. From this region, convergence rates from Theorem 2 for the sparse grid interpolation can be estimated. If we are interested in forming a sparse grid for each of the entries of the vector $x^{v} \in ℝ^{n}$ , then there are potentially n sparse grids. To estimate the convergence rate of the sequence of sparse grids it is sufficient to embed a polydisk $E_{σ_{1}, \dots, σ_{N}} \subset Ψ$ for a suitable set of coefficients {σ₁ …,σ_N}. Furthermore since $α^{v} \in U (α_{0}, t_{e}^{*})$ , the maximal coefficient $\tilde{M} (z^{v})$ can be bounded as

\tilde{M} (z^{v}) \leq t_{e}^{*} + {‖ x_{0} ‖}_{l^{2} (ℝ^{2 m})} .

4. Application to power flow

The theory developed in Section 3 can be applied to the computation of the statistics of stochastic power flow. In particular we concentrate on the random perturbations of the generators, loads and admittance uncertainty of the transmission lines. Much of the power system network model presented in this section is based on [8].

Consider a network with m+1 mechanical constant power generators. The electrical power injected into the network at each generator is given by

P_{G_{k}} = \sum_{l = 0}^{m} V_{k} V_{l} s i n (θ_{k} - θ_{l} + φ_{k, l}) | Y_{k, l} |,

(26)

where the operands of the summation are the power from bus k transmitted to bus l through a line with admittance Y_k,l = G_k,l + iB_k,l, phase shift φ_k,l, and voltage V_k at the buses. These form the algebraic constraints of the power system. The dynamic constraints at generator k are given by

M_{k} {\ddot{θ}}_{i} + D_{k} {\dot{θ}}_{i} + P_{G_{k}} = P_{M_{k}} + P_{I_{k}} (ω) + P_{L_{k}} (ω)

(27)

where M_k is the moment of inertia of generator i, D_k is the damping factor, $P_{M_{k}}$ denotes the mechanical power, $P_{L_{k}} (ω)$ is the stochastic load and $P_{I_{k}} (ω)$ is the intermittent stochastic power applied to bus i. Equations (26) and (27) constitute the swing equation model. Since the intermittent power generators and loads are stochastic, the rotor angle θ_k(ω) and power generation $P_{G_{k}} (ω)$ will be stochastic as well. A simple example of a 3 bus power system is shown in Figure 5. From the steady state response the power flow equations are given by

P_{k} (x) = \sum_{l = 0}^{m} V_{k} V_{l} [G_{i k} c o s (θ_{k} - θ_{l}) + B_{i k} s i n (θ_{k} - θ_{l})]

Q_{k} (x) = \sum_{l = 0}^{m} V_{k} V_{l} [G_{i k} s i n (θ_{k} - θ_{l}) + B_{i k} c o s (θ_{k} - θ_{l})]

for k = 0,…,m.

Fig. 5 — 2 generators, 3 buses, 1 load simple power system example. This figure is modified from [10]. Bus 1 is the slack bus. Bus 2 contains a stochastic generator. Bus 3 contains the random load. Note that voltages and power flows are in *p.u*.

It is assumed that at each node the active and reactive power injections (or loads) are given by P₁,…,P_m and Q₁,…,Q_m. The first bus is assumed to be slack bus with known angle θ₀ = 0 and fixed voltage V₀.

Remark 12 According to power system convention, the numbering of the buses (nodes) starts with 1 instead of 0. To simplify the notation in this section we start from 0. However, for the examples and numerical results we revert to the power system standard.

In this paper we limit our discussion of power flow to the case where the power injections P₁,…,P_m and Q₁,…,Q_m are assumed to be known, but could be stochastic. The unknowns are formed by the angles θ₁,…,θ_m and voltages V₁,…,V_m. The power flow equations are solved with a Newton iteration and posed as

θ ≔ [\begin{matrix} θ_{1} \\ ⋮ \\ θ_{m} \end{matrix}], V ≔ [\begin{matrix} V_{1} \\ ⋮ \\ V_{m} \end{matrix}], x ≔ [\begin{matrix} θ \\ V \end{matrix}], f (x) = [\begin{matrix} Δ P (x) \\ Δ Q (x) \end{matrix}],

where

Δ P (x) ≔ [\begin{matrix} P_{1} (x) - P_{1} \\ ⋮ \\ P_{m} (x) - P_{m} \end{matrix}] and Δ Q (x) ≔ [\begin{matrix} Q_{1} (x) - Q_{1} \\ ⋮ \\ Q_{m} (x) - Q_{m} \end{matrix}] .

The Jacobian matrix is given in block form as

J = [\begin{array}{l} J_{11} & J_{12} \\ J_{21} & J_{22} \end{array}],

where J₁₁, J₁₂, J₂₁, $J_{22} \in ℝ^{n \times n}$ . For k, l = 1,…,m let θ_k,l := θ_k − θ_l and if k ≠ l

J_{k, l}^{11} = V_{k} V_{l} G_{k, l} s i n (θ_{k, l}) - B_{k, l} (c o s (θ_{k, l})),

J_{k, l}^{21} = - V_{k} V_{l} G_{k, l} c o s (θ_{k, l}) + B_{k, l} (s i n (θ_{k, l})),

J_{k, l}^{12} = V_{k} G_{k, l} c o s (θ_{k, l}) + B_{k, l} (s i n (θ_{k, l})),

J_{k, l}^{22} = V_{k} G_{k, l} s i n (θ_{k, l}) - B_{k, l} (c o s (θ_{k, l})),

otherwise

\begin{matrix} J_{k, k}^{11} = - Q_{k} (x) - B_{k, k} V_{k}^{2} & J_{k, k}^{21} = P_{k} (x) - G_{k, k} V_{k}^{2} \\ J_{k, k}^{12} = \frac{P_{k} (x)}{V_{k}} + G_{k, k} V_{k} & J_{k, k}^{22} = \frac{Q_{k} (x)}{V_{k}} - B_{k, k} V_{k} \end{matrix} .

It is clear from the structure of f and the Jacobian J that they are analytic everywhere except for V_k = 0, for k = 1,…,m. However, in practice the domain Θ₀ is chosen such that the origin is avoided. Otherwise the analyticity assumptions of Theorem 4 are not satisfied.

There are many forms of uncertainty that can be present in the solution of the power flow equations. We concentrate on the following cases:

Random loads: The power loads P_k and Q_k, k = 1,…,m, will be a function of the random vector q ∈ Γ:
$P_{k} + i Q_{k} = P_{k}^{0} (1 + c_{k} q_{k}) + Q_{k}^{0} (1 + c_{k + 1} q_{k + 1}),$

where $P_{k}^{0}$ and $Q_{k}^{0}$ are the nominal power loads (or generators), q_k ∈ [−1, 1] and $c_{k}, c_{k + 1} \in ℝ$ .
Random admittances: The transmission line admittances Y_k,l will be functions of the random vector q ∈ Γ. Let $A$ be the set of network index tuples (k, l) such that the admittance is stochastic. Thus for all $k, l \in A$ let
$Y_{k, l} = G_{k, l} + i B_{k, l} = G_{k, l}^{0} (1 + c_{k, l, 1} q_{k, l, 1}) + B_{k, l}^{0} (1 + c_{k, l, 2} q_{k, l, 2}),$
where $G_{k, l}^{0}$ and $B_{k, l}^{0}$ are the nominal conductance and susceptance, q_k,l,1, q_k,l,2 ∈ [−1, 1] and $c_{k, l, 1}, c_{k, l, 2} \in ℝ$ . Note that with a slight of abuse of notation the vector q consists of all the stochastic random variables ${q_{k, l, 1}, q_{k, l, 2}}_{(k, l) \in A}$ .

For sufficiently small coefficients c_k, c_k+1 with k = 1,…,m and c_k,l,1, c_k,l,2 for all tuples $(k, l) \in A$ the assumptions of Theorems 5, 6 and 7 are satisfied for some initial condition x₀ and thus we can justify the use of the sparse grids. Due to the extent of a detailed analysis of the size of these coefficients, it is left for a future work emphasizing the details of power systems. However, in Appendix A we present the case for random generators and loads.

We test the sparse grid approximation on the New England 39 Bus, 10 Generator, power system model provided from the Matpower 6.0 steady state simulator [21, 32]. In this model buses 1 – 29 are PQ buses, buses 30, 32–39 are generators and bus 31 is the reference (slack).

The following numerical examples indicate that the conclusions of the analyticity theorems we proved are valid i.e. algebraic and sub-exponential convergence of the stochastic norm with respect to the random variables. This is despite the conservative bounds placed on the analyticity region of Ψ from the Newton-Kantorovich Theorem. We expand on this point in the conclusion section.

Two tests are performed. We randomly perturb either the loads or the admittances of the transmission lines. The mean and variance of the voltage V₂₂ at bus 22 are computed. The mean $E [V_{22}]$ and variance var[V₂₂] are computed with the Clenshaw-Curtis isotropic Sparse Grid Matlab Kit [4, 30] for N = 2, 4, 12 dimensions and up to the w = 7 level. This last level, w = 7 is taken as the “true” solution. The errors are computed up to level w = 4 with respect to this solution. Two tests are performed:

Random loads: The loads are considered stochastic and are perturbed by up to ± 50% of their nominal value. For each k = 1,…,N, the so-called k^th PQ bus is stochastically perturbed as
$P_{k} + i Q_{k} = P_{k}^{0} (1 + \frac{q_{k}}{2}) + Q_{k}^{0} (1 + \frac{q_{k}}{2}),$

where $P_{k}^{0}$ and $Q_{k}^{0}$ are the nominal power loads, q_k ∈ [−1, 1], ρ(q_k) has a uniform distribution and the random variables q₁,…,q_N are independent. Note that although the load random perturbations are independent, the power flows will be dependent on all the random variables q₁,…,q_N.

In Figure 6 (a) & (b) the mean and variance convergence error for the stochastic voltage V₂₂ of bus 22 are shown. A surrogate model based on the sparse grid operator is formed as $S_{w}^{m, g} [V_{22}]$ with Clenshaw-Curtis abscissas. Each of the circles corresponds to a sparse grid $S_{w}^{m, g}$ starting with level w = 1 up to level w = 4. The y-axis corresponds to the error of the mean or variance. The x-axis is the number of sparse grid knots needed to form the grid $S_{w}^{m, g}$ . The dimension of the sparse grid is given by N = 2, 4, 12.

From Figure 6 (a) & (b) we observe that the error decreases faster than polynomially with respect to the number of knots η. As we increase the number w of levels, sub-exponential convergence is achieved. This is much faster than the $η^{- \frac{1}{2}}$ convergence rate of the Monte Carlo method. For example, for N = 12, then mean is computed approximately 10¹¹ times faster for the same accuracy. This is the difference between 2 hours of computation on a simple 4 core processor and 20 million years with Monte Carlo. However, as the number of dimensions N increases the convergence rate of the sparse grid decreases, as predicted by Theorem 2. Moreover, if the level w is not large enough then the error bound gives algebraic convergence.
Random transmission line admittances: The admittances of the network are assumed to be random with
$Y_{k, l} = G_{k, l} + i B_{k, l} = G_{k, l}^{0} (1 + \frac{q_{k, l, 1}}{2}) + B_{k, l}^{0} (1 + \frac{q_{k, l, 2}}{2}),$

where $G_{k, l}^{0}$ and $B_{k, l}^{0}$ are the nominal conductance and susceptance. The coefficients q_k,l,1, q_k,l,2 ∈ [−1,1] have a uniform distribution and are all independent. Figure 6 (c) & (d) indicate sub-exponential convergence of the mean and variance of the voltage V₂₂ at bus 22 for a sufficiently large number of knots. However, as the number of stochastic dimensions N increases to 12, the convergence rate decreases and almost approaches polynomial convergence. From Theorem 1 the sufficient condition w > N/log2 leads to subexponential convergence. For N = 12 we have that w has to be larger than 18 to guarantee sub-exponential convergence. In Figures (c) & (d) the largest level for w is 4.

Fig. 6 — Sparse grid convergence rates. (a) & (b) Mean and variance error of the voltage V₂₂ of bus 22 given a stochastic load perturbation with dimension N and the number of knots of the sparse grid. (c) & (d) Mean and variance of error of the voltage V₂₂ of bus 22 given a random admittance with dimension N. Notice that for all 4 cases the convergence rates are faster than polynomial, indicating a sub-exponential convergence rate.

5. Conclusions

In this paper we have introduced ideas from UQ and numerical analysis, typically used in the field of stochastic PDEs, and applied them to non-linear stochastic networks. More specifically, these ideas are applied to the Newton iteration. We have developed a regularity analysis of the solution with respect to the random perturbations. Under sufficient conditions based on the Newton-Kantorovich Theorem there exists analytic extensions of the solution of the Newton iteration. These indicate that the application of sparse grids for the computation of the stochastic moments leads to sub-exponential or algebraic convergence. For a moderate number of dimensions the convergence rates are much faster than traditional Monte Carlo approaches $(η^{- \frac{1}{2}})$ . In addition, numerical experiments applied to the power flow problem confirm these subexponential and algebraic convergence rates.

A weakness in the application of the Newton-Kantorovich Theorem is that it constricts the size of the region of analyticity Ψ, thus leading to a conservative convergence rate of the sparse grid. This motivates the application of less restrictive methods such as damped Newton iterates [5]. In addition, if we incorporate the assumption that all the Newton iterations converge for each of the knots of the space grid, then by developing an a posteriori method convergence rates can be further improved.

Future work includes the important application of this method to the security constrained problem [28] from the probabilistic perspective. In other words, given stochastic perturbations of the loads and sources what are the optimal power injections into the grid such that the probability of failure is below a tolerance level. Current approaches rely on simplifications of the stochastic perturbations to deal with the high dimensions. However, this can lead to suboptimal results. The high dimensional stochastic quadrature approach developed in this paper will allow more optimal results.

A Analyticity regions for random generators and loads

We examine the case of random generators and loads to obtain convergence rates of the sparse grid. The task is to synthesize an analyticity region Ψ for the extended Newton iteration to converge. We only check that the conditions of Theorem 7 are satisfied and assume that 6 is satisfied. A full analysis will be done in a future work. Thus, we synthesis the region of analyticity for Ψ by checking that

‖ G (α^{0}, g) ‖ < \frac{δ_{e}}{x_{e}} - \frac{δ}{x},

whenever g ∈ Ψ. Without loss of generality assume that the first τ ≤ m buses contain stochastic power generators (or loads) with active power P₁(ω) := (q₁ + v_1,R + iv_1,I)c₁ + a₁,,…,P_τ(ω) := (q_τ +v_q,R+iv_q,I)c_τ +a_τ and reactive power Q₁(ω) := (q_τ+1 +v_τ+1,R + iv_τ+1,I)c_τ+1 +a_τ+1,…, Q_τ(ω) := (q_N +v_N,R +iv_N,I)c_N +a_N. The random vector q ∈ Γ is assumed to be stochastic with joint distribution ρ(q) and for all k = 1,...,N v_k := v_k,R + iv_k,I is the complex extension of each q_k ∈ Γ_k. Furthermore, for k = 1,…,N the variables $a_{k} \in ℝ$ and $c_{k} \in ℝ^{+}$ , where a_k + c_k and a_k indicate the maximum and minimum range of the stochastic perturbation. Thus we have

f (α_{0}) ≔ [\begin{matrix} P_{1} (x_{0}) - P_{1} (ω) \\ ⋮ \\ P_{τ} (x_{0}) - P_{τ} (ω) \\ P_{τ + 1} (x_{0}) - P_{τ + 1} \\ ⋮ \\ \frac{P_{m} (x_{0}) - P_{m}}{Q_{1} (x_{0}) - Q_{1} (ω)} \\ ⋮ \\ Q_{τ} (x_{0}) - Q_{τ} (ω) \\ Q_{τ + 1} (x_{0}) - Q_{τ + 1} \\ ⋮ \\ Q_{m} (x_{0}) - Q_{m} \end{matrix}],

Re f (α_{0}) = [\begin{matrix} P_{1} (x_{0}) - (q_{1} + v_{1, R}) c_{1} + a_{1} \\ ⋮ \\ P_{τ} (x_{0}) - (q_{τ} + v_{q, R}) c_{τ} + a_{τ} \\ P_{τ + 1} (x_{0}) - P_{τ + 1} \\ ⋮ \\ \frac{P_{m} (x_{0}) - P_{m}}{Q_{1} (x_{0}) - (q_{τ + 1} + v_{q + 1, R}) c_{τ + 1} + a_{τ + 1}} \\ ⋮ \\ Q_{τ} (x_{0}) - (q_{N} + v_{N, R}) c_{N} + a_{N} \\ Q_{τ + 1} (x_{0}) - Q_{τ + 1} \\ ⋮ \\ Q_{m} (x_{0}) - Q_{m} \end{matrix}],

Im f (α_{0}) = - {[v_{1, I} c_{1} \dots v_{τ, I} c_{τ} 0 \dots 0 | v_{τ + 1, I} c_{τ + 1} \dots v_{N, I} c_{N} 0 \dots 0]}^{T},

and therefore

P_{P} = - [\begin{matrix} (v_{1, R} + v_{1, I}) c_{1} \\ ⋮ \\ (v_{τ, R} + v_{τ, I}) c_{τ} \\ 0 \\ ⋮ \\ 0 \end{matrix}], P_{Q} = - [\begin{matrix} (v_{τ + 1, R} + v_{τ + 1, I}) c_{τ + 1} \\ ⋮ \\ (v_{N, R} + v_{N, I}) c_{N} \\ 0 \\ ⋮ \\ 0 \end{matrix}],

$P = [\begin{array}{l} P_{P} \\ Q_{Q} \end{array}]$ and thus $‖ G ‖_{2}^{2} = {\sum_{k = 1}^{N} (v_{k, R} + v_{k, I})}^{2} c_{k}^{2} + \sum_{k = 1}^{N} v_{k, I}^{2} c_{k}^{2}$ . The last equality is due to the integral remainder form of Taylor’s Theorem. Let v_k,R = v_R/c_k and v_k,I = v_I/c_k for k = 1,…,2N, where v_R, $v_{I} \in ℝ$ , then for any ϵ > 0

‖ G ‖_{2}^{2} = N v_{R}^{2} + 2 N v_{I}^{2} + 2 N v_{R} v_{I} *** \leq v_{R}^{2} N (1 + 2 ϵ) + v_{I}^{2} N (2 + ϵ^{- 1} / 2) *** \leq {(\frac{δ_{e}}{x_{e}} - \frac{δ}{x})}^{2} .

The last inequality is obtained by using Cauchy’s inequality. Let $γ_{e} ≔ \frac{δ_{e}}{x_{e}} - \frac{δ}{x}$ , thus the inequality

\frac{v_{R}^{2}}{α^{2}} + \frac{v_{I}^{2}}{β^{2}} \leq 1,

(28)

where $α^{2} ≔ \frac{γ_{e}^{2}}{N (1 + 2 ϵ)}$ and $β^{2} ≔ \frac{γ_{e}^{2}}{N (2 + ϵ^{- 1} / 2)}$ , forms an elliptical region $Σ \subset ℂ$ (See Figure 7) such that ‖G‖₂ ≤ γ_e.

Fig. 7 — Embedding of Bernstein ellipse $E_{σ}$ in the domain Φ.

Consider the region Φ := {g = q+v_R +iv_I |q ∈ [−1,1], (v_R, v_I) ∈ Σ} where ‖G‖₂ ≤ γ_e is satisfied. Suppose that we want to embed a Bernstein ellipse $E_{σ}$ in Φ. To achieve this consider the foci points of $E_{σ}$ at −1 and 1. It is not hard to show that $\frac{e^{σ} - e^{- σ}}{2} \geq \frac{e^{σ} + e^{- σ}}{2} - 1$ for σ > 0. At the foci point ±1 trace the ellipse from Equation (28) and set $β = \frac{e^{σ} - e^{- σ}}{2}$ (See Figure 7).

Choose an ϵ > 0 such that $E_{σ}$ is embedded in Φ by solving the following equation

\frac{α}{β} = \sqrt{\frac{4 + ϵ^{- 1}}{2 (1 + 2 ϵ)}}

leading to

ϵ = \frac{{(c^{2} + 4)}^{\frac{1}{2}} - c + 2}{4 c} > 0,

where c := (α/β)² > 0. Pick ϵ > 0 such that c = (α/β)² = 1. Pick σ > 0 such that $\frac{e^{σ} - e^{- σ}}{2} = β$ . This leads to

σ = log (β + {(β^{2} + 1)}^{\frac{1}{2}})

and $\frac{e^{σ} + e^{- σ}}{2} - 1 \leq β = α$ . The region bounded by the ellipse $E_{σ}$ is therefore embedded in Φ.

A polyellipse in C^N can now be constructed such that such that ‖G‖₂ ≤ γ_e. Recall that v_k,R = v_R/c_k and v_k,I = v_I/c_k for k = 1,…,N and consider the regions Ψ_k := {g = q + v_k,R + iv_k,I |q ∈ [−1, 1], (v_R, v_I) ∈ Σ}. By following the procedure for embedding $E_{σ}$ in Φ for each k = 1,…,N an ellipse $E_{ϱ k}$ can be embedded in Ψ_k with

ϱ_{k} ≔ log (\frac{β}{c_{k}} + {(\frac{β^{2}}{c_{k}^{2}} + 1)}^{\frac{1}{2}}) .

The polyellipse $ε_{ϱ_{1}, \dots, ϱ_{N}} ≔ E_{ϱ 1} \times \dots \times E_{ϱ N}$ is embedded in Ψ₁ × ⋯ × Ψ_N. Thus for any $g \in ε_{ϱ_{1}, \dots, ϱ_{N}}$

‖ G (α_{0}, g) ‖ \leq γ_{e} .

With the region bounded by the polyellipse $ε_{ϱ_{1}, \dots, ϱ_{N}}$ the convergence rate of the sparse grid can be estimated with respect to the magnitude of the coefficients c_k, for k = 1,...,N.

Remark 13 From this analysis we observe that the size of the analyticity region of Ψ depends directly on

β = \sqrt{\frac{γ_{e}^{2}}{N (2 + ϵ^{- 1} / 2)}},

where $ϵ = \frac{\sqrt{5} + 1}{4}$ . The size of the region Ψ decays as square root with respect to the number of stochastic dimensions N, thus reducing the convergence rate of the sparse grid.

Example 1 Consider the 3 Bus simple power system with stochastic load and generator based on Example 10.6 in [8] and Figure 5. Bus 1 is the slack bus with Inline graphic . Bus 2 voltage is fixed as V₂ = 1.05 p.u. and contains a stochastic generator P_E = q₁c₁ + a₁, where q₁ ∈ [−1, 1] with a₁ = 0.6661 p.u. and $c_{1} \in ℝ^{+}$ , i.e. the generator is random within the range 0.6661±c₁ . Bus 3 contains the random load with P_L+iQ_L = (q₂c₂+a₂)+i(q₃c₃+a₃), where q₂,q₃ ∈ [−1, 1] with a₂ = 2.8653 p.u., a₃ = 1.2244 p.u. and c₂, $c_{3} \in ℝ^{+}$ , i.e. the load is random within the range 2.8653 ± c₂ of the active power and 1.2244 ± c₃ for the reactive power. The admittance matrix is to the network is

Y_{b u s} ≔ i [\begin{array}{r} - 20 & 10 & 10 \\ 10 & - 20 & 10 \\ 10 & 10 & - 20 \end{array}] .

The vector of unknowns is x := [θ₂, θ₃, V₃]^T and f(x, q) = [P₁(x)−q₁c₁−a₁,P₂(x)−q₂c₂−a₂,P₃(x)−q₃c₃−a₃]^T for all q ∈ Γ and therefore f(α, g) = [P₁(α)−(q₁+v_1,R+iv_1,I)c₁+a₁,P₂(α)−(q₂+v_2,R+iv_2,I)c₂+a₃, P₃(α)−(q₃+v_3,R+iv_3,I)c₃+a₃]^T for all g ∈ Ψ. With q = 0 the Newton algorithm converges in 20 iterations to x* = [−5.2361×10⁻²,−1.7445×10⁻¹, 0.9500]^T with 10⁻¹⁵ tolerance. Furthermore,

J (x^{*}) = (\begin{array}{l} 0.065427 & 0.033847 & 0.0013531 \\ 0.033847 & 0.07112 & 0.011273 \\ 0.001284 & 0.010697 & 0.065493 \end{array}) and x = {‖ J^{- 1} (x^{*}) ‖}_{2} = 0.1043.

Since there is no direct stochastic components (q₁, q₂, q₃) in the Jacobian matrix, then

J (x) = [\begin{matrix} 10.5 (c o s (θ_{2}) + V_{3} c o s (θ_{2} - θ_{3})) & - 10.5 V_{3} c o s (θ_{2} - θ_{3}) & 10.5 s i n (θ_{2} - θ_{3}) \\ - 10.5 V_{3} c o s (θ_{3} - θ_{2}) & 10 V_{3} c o s (θ_{3}) + 10.5 V_{3} c o s (θ_{3} - θ_{2}) & 10.5 s i n (θ_{3}) + 10.5 s i n (θ_{3} - θ_{2}) \\ - 10.5 V_{3} s i n (θ_{3} - θ_{2}) & 10.5 V_{3} (s i n (θ_{3}) + s i n (θ_{3} - θ_{2})) & - (10 c o s (θ_{3}) + 10.5 c o s (θ_{3} - θ_{2}) - 39.96 V_{3}^{2}) \end{matrix}] .

From the mean value theorem we have that

λ = \sqrt{\sum_{k, l = 1}^{m} {‖ \nabla J_{k, l} (x) ‖}_{L^{\infty} (D \times Γ)}} < \infty,

where D is a bounded set, J_k,l(x) is the k^th row and l^th column entry of the Jacobian matrix J(x).

With the initial condition x₀ = x*(0), for small enough coefficients c₁, c₂ and c₃ we have that:

‖J⁻¹(x₀)‖ ≤ ϰ = 0.1043.
Furthermore, since f is continuous and f(x₀, 0) = 0 then ‖J⁻¹(x₀)f(x₀, q)‖ ≤ δ < ∞, where δ is arbitrarily small.
It follows that h < 1 for all $x \in \bar{B (x^{*} (0), t^{*})} \subset D$ where $t^{*} = \frac{2}{h} (1 - \sqrt{1 - h}) δ$ .
From the Newton-Kantorovich theorem the iteration converges for all q ∈ Γ and

{‖ V_{3} (q) ‖}_{L^{\infty} (Γ)} = sup_{x \in \bar{B (x_{0}, t^{*})}} | x [3] | \leq t^{*} + {‖ x^{*} (0) ‖}_{l^{2} (ℝ^{m})} .

Now, pick δ_e > 0 such that δ < δ_e and also pick ϰ_e = ϰ, λ_e = λ such that h < h_e ≤ 1. From the random load analysis we have $γ_{e} ≔ \frac{δ_{e}^{1}}{x_{e}} - \frac{δ}{x}$ and

‖ G (α_{0}, g) ‖ \leq γ_{e}

for all $g \in E_{ϱ 1},_{ϱ 2},_{ϱ 3}$ where

β = {(\frac{(1 + \sqrt{5})}{6 (2 + \sqrt{5})})}^{\frac{1}{2}} γ_{e}

and for k = 1,…,3

ϱ_{k} ≔ log (- \frac{β}{c_{k}} + {(\frac{β^{2}}{c_{k}^{2}} + 1)}^{\frac{1}{2}}) .

Assuming that Theorem 6 is satisfied, then from Theorem 7 it follows that whenever g ∈ Ψ then

‖ J {(x_{0})}^{- 1} f (α_{0}, g) ‖ \leq δ_{e},

the limit of the Newton iteration converges and is holomorphic in $E_{ϱ 1},_{ϱ 2},_{ϱ 3} \subset Ψ \subset ℂ^{3}$ .

Acknowledgments

This material is based upon work supported by the National Science Foundation under Grant No. 1736392. Research reported in this technical report was supported in part by the National Institute of General Medical Sciences (NIGMS) of the National Institutes of Health under award number 1R01GM131409–01.

References

1.Ablowitz MJ, Fokas AS (2003) Complex variables : introduction and applications, 2nd edn Cambridge, UK ; New York: : Cambridge University Press [Google Scholar]
2.Argyros IK (2008) Convergence and Applications of Newton-type Iterations. Springer [Google Scholar]
3.Babuska I, Nobile F, Tempone R (2010) A stochastic collocation method for elliptic partial differential equations with random input data. SIAM Review 52(2):317–355, DOI 10.1137/100786356, http://epubs.siam.org/doi/pdf/10.1137/100786356 [DOI] [Google Scholar]
4.Bäck J, Nobile F, Tamellini L, Tempone R (2011) Stochastic spectral Galerkin and collocation methods for PDEs with random coefficients: A numerical comparison In: Hesthaven JS, Rnquist EM (eds) Spectral and High Order Methods for Partial Differential Equations, Lecture Notes in Computational Science and Engineering, vol 76, Springer Berlin; Heidelberg, pp 43–62 [Google Scholar]
5.Bank RE, Rose DJ (1982) Analysis of a multilevel iterative method for nonlinear finite element equations. Mathematics of Computation 39(160):453–465 [Google Scholar]
6.Barthelmann V, Novak E, Ritter K (2000) High dimensional polynomial interpolation on sparse grids. Advances in Computational Mathematics 12:273–288 [Google Scholar]
7.Beck J, Nobile F, Tamellini L, Tempone R (2014) Convergence of quasi-optimal stochastic Galerkin methods for a class of PDEs with random coefficients. Computers & Mathematics with Applications 67(4):732 – 751, DOI 10.1016/j.camwa.2013.03.004, URL http://www.sciencedirect.com/science/article/pii/S0898122113001569, high-order Finite Element Approximation for Partial Differential Equations [DOI] [Google Scholar]
8.Bergen AR, Vittal V (2000) Power systems analysis, 2nd edn Pearson/Prentice Hall, [Google Scholar]
9.Castrillón-Candás JE, Nobile F, Tempone R (2016) Analytic regularity and collocation approximation for PDEs with random domain deformations. Computers and Mathematics with applications 71(6):1173–1197 [Google Scholar]
10.Fiandrino C (2013) How can I do a power electric system in circuitikz? tex.stackexchange.com/questions/145197/how-can-i-do-a-power-electric-system-in-circuitikz
11.Fishman GS (1996) Monte Carlo : concepts, algorithms, and applications Springer series in operations research, Springer, New York, Berlin, URL http://opac.inria.fr/record=b1079070, with 98 illustrations (p. de titre) [Google Scholar]
12.Gerstner T, Griebel M (2003) Dimension-adaptive tensor-product quadrature. Computing 71(1):65–87 [Google Scholar]
13.Golub GH, Van Loan CF (1996) Matrix Computations (3rd Ed.). Johns Hopkins University Press, Baltimore, MD, USA [Google Scholar]
14.Gunning R, Rossi H (1965) Analytic Functions of Several Complex Variables. American Mathematical Society [Google Scholar]
15.Hockenberry JR, Lesieutre BC (2004) Evaluation of uncertainty in dynamic simulations of power system models: The probabilistic collocation method. IEEE Transactions On Power Systems 19(3) [Google Scholar]
16.Holst M (1994) The Poisson-Boltzmann equation: Analysis and multilevel numerical solution, 1st edn. Applied Mathematics and CRPC, California Institute of Technology [Google Scholar]
17.J Tang FN, Ponci F, Monti A (2015) Dimension-adaptive sparse grid interpolation for uncertainty quantification in modern power systems: Probabilistic power flow. IEEE Transactions On Power Systems 19 [Google Scholar]
18.Klimke A (2007) Sparse Grid Interpolation Toolbox – user’s guide Tech. Rep IANS report 2007/017, University of Stuttgart [Google Scholar]
19.Klimke A, Wohlmuth B (2005) Algorithm 847: spinterp: Piecewise multilinear hierarchical sparse grid interpolation in MATLAB. ACM Transactions on Mathematical Software 31(4) [Google Scholar]
20.Krantz SG (1992) Function Theory of Several Complex Variables. AMS Chelsea Publishing, Providence, Rhode Island [Google Scholar]
21.Murillo-Sánchez CE, Zimmerman RD, Anderson CL, Thomas RJ (2013) Secure planning and operations of systems with stochastic sources, energy storage, and active demand. IEEE Transactions on Smart Grid 4(4):2220–2229 [Google Scholar]
22.National Academies of Sciences, Engineering, and Medicine (2016) Analytic Research Foundations for the Next-Generation Electric Grid. The National Academies Press, Washington, DC, DOI 10.17226/21919 [DOI] [Google Scholar]
23.Nobile F, Tempone R (2009) Analysis and implementation issues for the numerical approximation of parabolic equations with random coefficients. International Journal for Numerical Methods in Engineering 80(6–7):979–1006, DOI 10.1002/nme.2656 [DOI] [Google Scholar]
24.Nobile F, Tempone R, Webster C (2008) An anisotropic sparse grid stochastic collocation method for partial differential equations with random input data. SIAM Journal on Numerical Analysis 46(5):2411–2442, http://epubs.siam.org/doi/pdf/10.1137/070680540 [Google Scholar]
25.Nobile F, Tempone R, Webster C (2008) A sparse grid stochastic collocation method for partial differential equations with random input data. SIAM Journal on Numerical Analysis 46(5):2309–2345, http://epubs.siam.org/doi/pdf/10.1137/060663660 [Google Scholar]
26.Nobile F, Tamellini L, Tempone R (2016) Convergence of quasi-optimal sparse-grid approximation of Hilbert-space-valued functions: application to random elliptic PDEs. Numerische Mathematik 134(2):343–388, DOI 10.1007/s00211-015-0773-y, URL 10.1007/s00211-015-0773-y [DOI] [Google Scholar]
27.Prempraneerach P, Hover F, Triantafyllou M, Karniadakis G (2010) Uncertainty quantification in simulations of power systems: Multi-element polynomial chaos methods. Reliability Engineering & System Safety 95(6):632 – 646, DOI 10.1016/j.ress.2010.01.012, URL http://www.sciencedirect.com/science/article/pii/S0951832010000281 [DOI] [Google Scholar]
28.Roald L, Oldewurtel F, Krause T, Andersson G (2013) Analytical reformulation of security constrained optimal power flow with probabilistic constraints. In: 2013 IEEE Grenoble Conference, pp 1–6, DOI 10.1109/PTC.2013.6652224 [DOI] [Google Scholar]
29.Smolyak S (1963) Quadrature and interpolation formulas for tensor products of certain classes of functions. Soviet Mathematics, Doklady 4:240–243 [Google Scholar]
30.Tamellini L, Nobile F (2009–2015) Sparse grids matlab kit. http://csqi.epfl.ch/page-107231-en.html
31.Trefethen LN (2012) Approximation Theory and Approximation Practice (Other Titles in Applied Mathematics) Society for Industrial and Applied Mathematics, Philadelphia, PA, USA [Google Scholar]
32.Zimmerman RD, Murillo-Sánchez CE, Thomas RJ (2011) Matpower: Steady-state operations, planning, and analysis tools for power systems research and education. IEEE Transactions on Power Systems 26(1):12–19 [Google Scholar]

[R1] 1.Ablowitz MJ, Fokas AS (2003) Complex variables : introduction and applications, 2nd edn Cambridge, UK ; New York: : Cambridge University Press [Google Scholar]

[R2] 2.Argyros IK (2008) Convergence and Applications of Newton-type Iterations. Springer [Google Scholar]

[R3] 3.Babuska I, Nobile F, Tempone R (2010) A stochastic collocation method for elliptic partial differential equations with random input data. SIAM Review 52(2):317–355, DOI 10.1137/100786356, http://epubs.siam.org/doi/pdf/10.1137/100786356 [DOI] [Google Scholar]

[R4] 4.Bäck J, Nobile F, Tamellini L, Tempone R (2011) Stochastic spectral Galerkin and collocation methods for PDEs with random coefficients: A numerical comparison In: Hesthaven JS, Rnquist EM (eds) Spectral and High Order Methods for Partial Differential Equations, Lecture Notes in Computational Science and Engineering, vol 76, Springer Berlin; Heidelberg, pp 43–62 [Google Scholar]

[R5] 5.Bank RE, Rose DJ (1982) Analysis of a multilevel iterative method for nonlinear finite element equations. Mathematics of Computation 39(160):453–465 [Google Scholar]

[R6] 6.Barthelmann V, Novak E, Ritter K (2000) High dimensional polynomial interpolation on sparse grids. Advances in Computational Mathematics 12:273–288 [Google Scholar]

[R7] 7.Beck J, Nobile F, Tamellini L, Tempone R (2014) Convergence of quasi-optimal stochastic Galerkin methods for a class of PDEs with random coefficients. Computers & Mathematics with Applications 67(4):732 – 751, DOI 10.1016/j.camwa.2013.03.004, URL http://www.sciencedirect.com/science/article/pii/S0898122113001569, high-order Finite Element Approximation for Partial Differential Equations [DOI] [Google Scholar]

[R8] 8.Bergen AR, Vittal V (2000) Power systems analysis, 2nd edn Pearson/Prentice Hall, [Google Scholar]

[R9] 9.Castrillón-Candás JE, Nobile F, Tempone R (2016) Analytic regularity and collocation approximation for PDEs with random domain deformations. Computers and Mathematics with applications 71(6):1173–1197 [Google Scholar]

[R10] 10.Fiandrino C (2013) How can I do a power electric system in circuitikz? tex.stackexchange.com/questions/145197/how-can-i-do-a-power-electric-system-in-circuitikz

[R11] 11.Fishman GS (1996) Monte Carlo : concepts, algorithms, and applications Springer series in operations research, Springer, New York, Berlin, URL http://opac.inria.fr/record=b1079070, with 98 illustrations (p. de titre) [Google Scholar]

[R12] 12.Gerstner T, Griebel M (2003) Dimension-adaptive tensor-product quadrature. Computing 71(1):65–87 [Google Scholar]

[R13] 13.Golub GH, Van Loan CF (1996) Matrix Computations (3rd Ed.). Johns Hopkins University Press, Baltimore, MD, USA [Google Scholar]

[R14] 14.Gunning R, Rossi H (1965) Analytic Functions of Several Complex Variables. American Mathematical Society [Google Scholar]

[R15] 15.Hockenberry JR, Lesieutre BC (2004) Evaluation of uncertainty in dynamic simulations of power system models: The probabilistic collocation method. IEEE Transactions On Power Systems 19(3) [Google Scholar]

[R16] 16.Holst M (1994) The Poisson-Boltzmann equation: Analysis and multilevel numerical solution, 1st edn. Applied Mathematics and CRPC, California Institute of Technology [Google Scholar]

[R17] 17.J Tang FN, Ponci F, Monti A (2015) Dimension-adaptive sparse grid interpolation for uncertainty quantification in modern power systems: Probabilistic power flow. IEEE Transactions On Power Systems 19 [Google Scholar]

[R18] 18.Klimke A (2007) Sparse Grid Interpolation Toolbox – user’s guide Tech. Rep IANS report 2007/017, University of Stuttgart [Google Scholar]

[R19] 19.Klimke A, Wohlmuth B (2005) Algorithm 847: spinterp: Piecewise multilinear hierarchical sparse grid interpolation in MATLAB. ACM Transactions on Mathematical Software 31(4) [Google Scholar]

[R20] 20.Krantz SG (1992) Function Theory of Several Complex Variables. AMS Chelsea Publishing, Providence, Rhode Island [Google Scholar]

[R21] 21.Murillo-Sánchez CE, Zimmerman RD, Anderson CL, Thomas RJ (2013) Secure planning and operations of systems with stochastic sources, energy storage, and active demand. IEEE Transactions on Smart Grid 4(4):2220–2229 [Google Scholar]

[R22] 22.National Academies of Sciences, Engineering, and Medicine (2016) Analytic Research Foundations for the Next-Generation Electric Grid. The National Academies Press, Washington, DC, DOI 10.17226/21919 [DOI] [Google Scholar]

[R23] 23.Nobile F, Tempone R (2009) Analysis and implementation issues for the numerical approximation of parabolic equations with random coefficients. International Journal for Numerical Methods in Engineering 80(6–7):979–1006, DOI 10.1002/nme.2656 [DOI] [Google Scholar]

[R24] 24.Nobile F, Tempone R, Webster C (2008) An anisotropic sparse grid stochastic collocation method for partial differential equations with random input data. SIAM Journal on Numerical Analysis 46(5):2411–2442, http://epubs.siam.org/doi/pdf/10.1137/070680540 [Google Scholar]

[R25] 25.Nobile F, Tempone R, Webster C (2008) A sparse grid stochastic collocation method for partial differential equations with random input data. SIAM Journal on Numerical Analysis 46(5):2309–2345, http://epubs.siam.org/doi/pdf/10.1137/060663660 [Google Scholar]

[R26] 26.Nobile F, Tamellini L, Tempone R (2016) Convergence of quasi-optimal sparse-grid approximation of Hilbert-space-valued functions: application to random elliptic PDEs. Numerische Mathematik 134(2):343–388, DOI 10.1007/s00211-015-0773-y, URL 10.1007/s00211-015-0773-y [DOI] [Google Scholar]

[R27] 27.Prempraneerach P, Hover F, Triantafyllou M, Karniadakis G (2010) Uncertainty quantification in simulations of power systems: Multi-element polynomial chaos methods. Reliability Engineering & System Safety 95(6):632 – 646, DOI 10.1016/j.ress.2010.01.012, URL http://www.sciencedirect.com/science/article/pii/S0951832010000281 [DOI] [Google Scholar]

[R28] 28.Roald L, Oldewurtel F, Krause T, Andersson G (2013) Analytical reformulation of security constrained optimal power flow with probabilistic constraints. In: 2013 IEEE Grenoble Conference, pp 1–6, DOI 10.1109/PTC.2013.6652224 [DOI] [Google Scholar]

[R29] 29.Smolyak S (1963) Quadrature and interpolation formulas for tensor products of certain classes of functions. Soviet Mathematics, Doklady 4:240–243 [Google Scholar]

[R30] 30.Tamellini L, Nobile F (2009–2015) Sparse grids matlab kit. http://csqi.epfl.ch/page-107231-en.html

[R31] 31.Trefethen LN (2012) Approximation Theory and Approximation Practice (Other Titles in Applied Mathematics) Society for Industrial and Applied Mathematics, Philadelphia, PA, USA [Google Scholar]

[R32] 32.Zimmerman RD, Murillo-Sánchez CE, Thomas RJ (2011) Matpower: Steady-state operations, planning, and analysis tools for power systems research and education. IEEE Transactions on Power Systems 26(1):12–19 [Google Scholar]

PERMALINK

Analytic regularity and stochastic collocation of high-dimensional Newton iterates

Julio E Castrillón-Candás

Mark Kon

Abstract

1. Introduction

2. Mathematical background

2.1. Newton-Kantorovich Theorem

Assumption 1

2.2. Stochastic spaces

2.3. Sparse grids

Proposition 1

Table 1.

Fig. 1.

Fig. 2.

Theorem 1

Theorem 2

2.4. Analyticity

Theorem 3 (Hartog’s theorem)

3. Analyticity of the Newton iteration

Fig. 3.

Theorem 4

Lemma 1

3.1. Regions of Analyticity

Assumption 2

Assumption 3

Theorem 5

Fig. 4.

Theorem 6

Theorem 7

4. Application to power flow

Fig. 5.

Fig. 6.

5. Conclusions

A Analyticity regions for random generators and loads

Fig. 7.

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases