Fast orthogonal transforms and generation of Brownian paths

Gunther Leobacher

doi:10.1016/j.jco.2011.11.003

. 2012 Apr;28(2):278–302. doi: 10.1016/j.jco.2011.11.003

Fast orthogonal transforms and generation of Brownian paths

Gunther Leobacher ¹

PMCID: PMC3587409 PMID: 23471545

Abstract

We present a number of fast constructions of discrete Brownian paths that can be used as alternatives to principal component analysis and Brownian bridge for stratified Monte Carlo and quasi-Monte Carlo. By fast we mean that a path of length $n$ can be generated in $O (n log (n))$ floating point operations. We highlight some of the connections between the different constructions and we provide some numerical examples.

Keywords: Fast path generation, Fast orthogonal transform, Variance reduction, Quasi-Monte Carlo

Highlights

► Linear constructions of Brownian paths correspond to orthogonal transforms. ► Some orthogonal transforms enhance quasi-Monte Carlo. ► Principal component construction can be approximated by fast cosine transform. ► Orthogonal transforms can also enhance simulation of Levy paths.

1. Orthogonal transforms and Brownian paths

There are several constructions that are frequently used to construct discrete Brownian paths, by which we mean a random function $B$ on a given set ${t_{1}, \dots, t_{n}} \subseteq R$ , $0 < t_{1} < \dots < t_{n} \leq 1$ , so that $B = (B_{t_{1}}, \dots, B_{t_{n}})$ is a Gaussian vector with mean zero and covariance matrix

{(min (t_{j}, t_{k}))}_{j, k = 1}^{n} = (\begin{matrix} t_{1} & t_{1} & t_{1} & \dots & t_{1} \\ t_{1} & t_{2} & t_{2} & \dots & t_{2} \\ t_{1} & t_{2} & t_{3} & \dots & t_{3} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ t_{1} & t_{2} & t_{3} & \dots & t_{n} \end{matrix}) .

The case where the $t_{j}$ are evenly spaced is the most important one from the practical point of view. In that case the covariance matrix equals

{(\frac{1}{n} min (j, k))}_{j, k = 1}^{n} = \frac{1}{n} (\begin{matrix} 1 & 1 & 1 & \dots & 1 \\ 1 & 2 & 2 & \dots & 2 \\ 1 & 2 & 3 & \dots & 3 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & 2 & 3 & \dots & n \end{matrix}) .

Throughout the paper this matrix will be denoted by $Σ^{(n)}$ or, if there is no danger of confusion, simply by $Σ$ .

The arguably most straightforward method (a.k.a. forward method or step-by-step method) is to compute the cumulative sum of $n$ independent normal variables of mean zero and variance $\frac{1}{n}$ . All constructions we present in this article are equivalent to this simple method from the probabilistic point of view.

However, there are refined simulation methods, for example stratified sampling (cf. [10]) and quasi-Monte Carlo methods (see [19]), which achieve higher convergence rates for some problems. Those techniques have in common that they require the identification of more important and less important input variables. For many problems the straightforward method does not provide this.

For this reason alternatives to the forward construction are frequently used, the Brownian bridge (BB) construction (a.k.a. Lévy–Ciesielski construction or midpoint displacement) and the principal component analysis (PCA) construction (a.k.a. singular value construction). The first use of the BB construction in finance is due to [18], the first use of PCA for financial applications was presented in [2], both with dramatic improvement of convergence rates.

It has been mentioned by Papageorgiou [20] that in fact any decomposition $A A^{T} = Σ$ provides a construction for a discrete approximation of a Brownian path via $Y = A X$ , where $X$ is a standard normal vector. In that context, the forward construction corresponds to the Cholesky decomposition of $Σ$ , $Σ = S S^{⊤}$ , where $S$ is the summation operator

S = \frac{1}{\sqrt{n}} (\begin{matrix} 1 & 0 & \dots & 0 \\ 1 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & 1 & \dots & 1 \end{matrix}) .

(1)

PCA corresponds to $A = V D$ , where $Σ = V D^{2} V^{⊤}$ is the singular value decomposition of $Σ$ . A corresponding decomposition for the BB algorithm is given, for example, in [15].

However, Papageorgiou [20] notes that there are examples where BB and PCA are not giving better results than the forward method in connection with quasi-Monte Carlo. He further shows that the worst case error of integration for a certain class of payoff functions is independent of the path construction. Thus in the sense of worst case error all decompositions are equivalent.

This has been investigated further by Sloan and Wang [24]. The authors show another equivalence principle which roughly states that every decomposition is equally bad and good for QMC, depending on the function that one wants to integrate. For every decomposition $A$ that is good for one payoff function $f$ , and every decomposition $\tilde{A}$ there is another payoff function $\tilde{f}$ for which $\tilde{A}$ is equally good.

It is therefore prudent to tailor the decomposition to the problem at hand. This is done, for example, by Imai and Tan [12].

While the possible decompositions of $Σ$ provide a clean framework for the study of algorithms for generation of Brownian paths, they are of limited practical value because the matrix-vector multiplication is comparatively slow for all but very small values of $n$ . This is the case since general matrix-vector multiplication uses $O (n^{2})$ floating point operation (flops), while the forward method and the Brownian bridge use only $O (n)$ flops.

Until recently this has been considered a serious disadvantage of PCA as well, cf. [10]. Yet it has been shown by Scheicher [23], using results from Åkesson and Lehoczky [1], that PCA can be computed using the fast sine transform, thereby using $O (n log (n))$ flops.

While the importance of the proper choice of the decomposition $A A^{⊤} = Σ$ for problems arising from quasi-Monte Carlo pricing of financial derivatives is stressed by a number of authors, see e.g. [24,12], there has been a lack of alternatives to the three aforementioned constructions that allow for fast matrix-vector multiplication, by which we mean generation of a path using at most $O (n log (n))$ flops.

The present paper aims to narrow this gap by providing a number of fast constructions of Brownian paths that can be used as alternatives to the three constructions presented above. We know from Sloan and Wang [24] that every one of those constructions will have some payoff functions for which they are especially well suited (and some other payoff functions for which they are ill suited).

The practitioner who is willing to use alternative constructions of Brownian paths therefore is presented with the following alternatives: she might want to tailor a (slow) construction to her special problem in the spirit of Imai and Tan [12], or alternatively, she might want to find out which of the fast constructions presented here is relatively well suited to her problem. Which choice is the better will depend on the special problem. But if one of the fast constructions is reasonably close to the optimal construction, its use will reduce the computing time considerably, with essentially the same error.

Papageorgiou [20] observed that there is a one-to-one correspondence between constructions of Brownian paths and orthogonal transforms. We present his simple theorem here since it is essential for all of our constructions.

Theorem 1.1 Papageorgiou —

If $Σ = S S^{⊤}$ is the Cholesky decomposition of $Σ$ , then any orthogonal transformation $T$ on $R^{n}$ defines a decomposition $Σ = S T {(S T)}^{⊤}$ .

Conversely, for every $n \times n$ -matrix $A$ with $Σ = A A^{⊤}$ there exists an orthogonal transform $T$ such that $A = S T$ .

Proof

For any orthogonal transform $T$ we have $T^{⊤} = T^{- 1}$ , such that $S T {(S T)}^{⊤} = S T T^{⊤} S^{⊤} = S S^{⊤}$ .

On the other hand, $S$ is invertible, so that for $T = S^{- 1} A$ we have $A = S T$ and

$T T^{⊤} = S^{- 1} A A^{⊤} {(S^{- 1})}^{⊤} = S^{- 1} Σ {(S^{- 1})}^{⊤} = S^{- 1} S S^{⊤} {(S^{⊤})}^{- 1} = id,$

such that $T$ is orthogonal. □

Note that by “orthogonal” we mean that $T$ preserves lengths and angles. Strictly speaking such transforms are “orthonormal”, but the term “orthogonal” is more common.

Matrix-vector multiplication with $S$ and $S^{- 1}$ can be done in linear time. Searching for decompositions of $Σ$ which admit fast matrix-vector multiplication is therefore equivalent to searching for orthogonal matrices that allow for matrix-vector multiplication using at most $O (n log (n))$ flops.

In this paper we consider a variety of mostly well-known orthogonal transforms that can be computed using at most $O (n log n)$ flops.

Any of those constructions can be modified by combining them with permutations, which do not require any flops at all. Also the multiplication of two or more orthogonal matrices is orthogonal, and the product matrix admits fast matrix-vector multiplications if all of the factors admit this, provided the number of factors is small compared to the number of dimensions.

The remainder of the paper is structured as follows: Section 2 reviews the principal component analysis (PCA) construction and the general Brownian bridge (BB) construction, whereby we give some generalizations like interpolation and $m$ -step construction, that allow the combination of BB with PCA and other fast constructions.

In the main Section 3 we present a number of fast orthogonal transforms derived from fast Fourier transform (FFT) type constructions. A special emphasis is on the fast cosine transform which, as it turns out, corresponds to a construction which is close to PCA. The proof of proximity is one of the main results of our paper.

Section 4 contains some fast constructions that do not fit the previous schemes, and connections to previous methods are highlighted.

Some numerical examples are computed in Section 5 to illustrate the advantages of some of the proposed methods. Section 6 concludes.

Since a large portion of the paper consists of listing fast construction methods, it is probably of interest to the reader to know which of those constructions have previously been known and which are new. This shall be done here, though we need to tread carefully: due to the vast applicability of Brownian motion and simulation we concede that it is too easy to overlook even significant contributions if they are related to a field foreign to the author and maybe even use different terminology.

While the first construction of PCA in Section 2.1 is due to Scheicher [23], the second construction using the corresponding orthogonal transform is new. The idea of using the Brownian bridge for interpolation to generalize an equal-step generation method to non-equal steps seems to have been previously unknown, cf. Section 2.3, and the same holds for the general $m$ -step forward and BB methods.

To the authors best knowledge, none of the orthogonal transforms presented in Section 3 have previously been used for the (fast) construction of Brownian paths, with the sole exception of Scheicher’s use of the DST-I to calculate the PCA. But it is worth noting that if one uses DST-I as the underlying orthogonal transform, cf. Theorem 1.1, then one obtains a construction different from Scheicher’s. Nevertheless all of those transforms are of course well known, as are their respective properties. The fact that the construction obtained from DCT-IV is close to PCA is entirely new and consequently so is the idea to use this construction as a faster substitute for PCA when constructing Lévy paths.

The facts from Section 4 seem to be folklore to a large extent. However, the construction obtained from the Haar transform is new and it seems that it has not been observed previously that the Kronecker product can be used to combine different (fast) construction methods for Brownian paths.

2. PCA and Brownian bridge

2.1. PCA construction

Åkesson and Lehoczky [1] showed that for $k = 1, \dots, n$ the $k$ -th eigenvalue and eigenvector of the matrix $Σ$ are given by

λ_{k} = {(4 n {sin}^{2} (\frac{2 k - 1}{2 n + 1} \frac{π}{2}))}^{- 1}

and $v_{k} = {(v_{k, 1}, \dots, v_{k, n})}^{⊤}$ where

v_{k, j} = \frac{2}{\sqrt{2 n + 1}} sin (\frac{(2 k - 1) j}{2 n + 1} π), j = 1, \dots, n,

respectively. Therefore $Σ = V D {(V D)}^{⊤}$ , where $V$ is the matrix

V = (v_{1}, \dots, v_{n}),

that is, $V_{j k} = v_{k, j}$ ,

{(V x)}_{j} = \sum_{k = 1}^{n} \frac{2}{\sqrt{2 n + 1}} sin (\frac{(2 k - 1) j}{2 n + 1} π) x_{k}

and $D$ is the diagonal matrix that has $λ_{1}^{1 / 2}, \dots, λ_{n}^{1 / 2}$ as its diagonal elements.

It has been observed by Scheicher [23] that $v_{i}$ is essentially the $(2 i - 1)$ -th basis function of the discrete sine transform (DST-I) in dimension $2 n$ : Recall the definition of DST-I (provided Section 3.3) on the set ${1, \dots, N}$ ,

M_{DST-I} (y) ≔ {(\sqrt{\frac{2}{N + 1}} \sum_{k = 1}^{N} y_{k} sin (\frac{k j}{N + 1} π))}_{j = 1}^{N} .

Therefore $V = \sqrt{2} P M_{DST-I} Q$ , where $P$ is the projection of a $2 n$ -vector onto its first $n$ elements, $M_{DST-I}$ is DST-I in dimension $N = 2 n$ and $Q$ is the mapping

Q x ≔ (x_{1}, 0, x_{2}, 0, \dots, x_{n}, 0),

{(M_{DST-I} Q x)}_{j} = \sqrt{\frac{2}{N + 1}} \sum_{k = 1}^{n} x_{k} sin (\frac{(2 k - 1) j}{N + 1} π) .

Multiplication with $D, Q$ and $P$ can be done in linear time while multiplication with $M_{DST-I}$ takes $O (2 n log (2 n))$ flops (see [26]), therefore the generation of a Brownian path using PCA can be done using $O (n log (n))$ flops.

Another fast way to construct the PCA is to calculate the corresponding orthogonal transform $T_{PCA} ≔ S^{- 1} D V$ and compute the PCA via $S T_{PCA}$ .

Theorem 2.1

$T_{PCA} = \sqrt{2} \hat{P} {\hat{M}}_{DCT - III} \hat{Q},$

where ${\hat{M}}_{DCT - III}$ is the discrete cosine transform $DCT - III$ in dimension $2 n + 1$ ,

$\hat{Q} x ≔ (0, x_{1}, 0, x_{2}, 0, \dots, x_{n}, 0),$

and $\hat{P}$ is the projection of a $2 n + 1$ -vector onto its first $n$ elements.

Proof

For $x \in R^{n}$

${(D V x)}_{j} = \sum_{k = 1}^{n} \frac{1}{\sqrt{n (2 n + 1)}} {(sin (\frac{2 k - 1}{2 n + 1} \frac{π}{2}))}^{- 1} sin (\frac{2 k - 1}{2 n + 1} j π) x_{k}$

and for $y \in R^{n}$

${(S^{- 1} y)}_{j} = {\begin{cases} \sqrt{n} y_{1} & j = 1 \\ \sqrt{n} (y_{j} - y_{j - 1}) & 2 \leq j \leq n . \end{cases}$

Therefore

${(S^{- 1} D V x)}_{j} = \sum_{k = 1}^{n} \frac{1}{\sqrt{2 n + 1}} {(sin (\frac{2 k - 1}{2 n + 1} \frac{π}{2}))}^{- 1} (sin (\frac{2 k - 1}{2 n + 1} j π) - sin (\frac{2 k - 1}{2 n + 1} (j - 1) π)) x_{k} .$

But using the addition theorem for the sine we get

$sin (\frac{2 k - 1}{2 n + 1} j π) - sin (\frac{2 k - 1}{2 n + 1} (j - 1) π) = sin (\frac{2 k - 1}{2 n + 1} (j - \frac{1}{2} + \frac{1}{2}) π) - sin (\frac{2 k - 1}{2 n + 1} (j - \frac{1}{2} - \frac{1}{2}) π) = 2 cos (\frac{2 k - 1}{2 n + 1} (j - \frac{1}{2}) π) sin (\frac{2 k - 1}{2 n + 1} \frac{π}{2}),$

such that

${(S^{- 1} D V x)}_{j} = \sqrt{2} \sum_{k = 1}^{n} \sqrt{\frac{2}{2 n + 1}} cos (\frac{2 k - 1}{2 n + 1} (j - \frac{1}{2}) π) x_{k} .$

Comparing this with the definition in [26] of the DCT-III transform finishes the proof. □

2.2. General Brownian bridge

Suppose $Z = (Z_{1}, \dots, Z_{n})$ is a vector of independent standard normal variables.

Consider real numbers $0 = t_{0} < t_{1} < \dots < t_{n} = 1$ . We want to construct a discrete Brownian path $(B_{t_{1}}, \dots, B_{t_{n}})$ , i.e. a Gaussian vector with covariance matrix ${(min (t_{j}, t_{k}))}_{j, k}$ .

The so-called forward construction solves this problem in the following way: $B_{t_{1}} ≔ \sqrt{t_{1}} Z_{1}$ and $B_{t_{k + 1}} ≔ B_{t_{k}} + \sqrt{t_{k + 1} - t_{k}} Z_{k + 1}$ for $k = 1, \dots, n - 1$ . It is easy to check that the vector $(B_{t_{1}}, \dots, B_{t_{n}})$ constructed in this way has indeed the required covariance matrix. The construction requires $O (n)$ flops (note that in a (quasi-)Monte Carlo simulation with many scenarios the values $\sqrt{t_{k + 1} - t_{k}}$ , $k = 1, \dots, n - 1$ need to be computed only once).

An alternative construction is the well-known Brownian bridge, which we will repeat for the convenience of the reader.

Suppose the elements of $(B_{t_{1}}, \dots, B_{t_{n}})$ should be computed in the order $B_{t_{π (1)}}, B_{t_{π (2)}}, \dots, B_{t_{π (n)}}$ for some permutation $π$ of $n$ elements. Consequently, in computing $B_{t_{π (j)}}$ we need to take into account the previously computed elements. Fortunately at most two of those are of relevance, the one next to $π (j)$ on the left and the one next to $π (j)$ on the right.

Formally define for every $j \in {1, \dots, n}$ two sets,

L (j) ≔ {k : k < π (j) and π^{- 1} (k) < j}

R (j) ≔ {k : k > π (j) and π^{- 1} (k) < j} .

That is, $L$ contains all the indices $k$ that are smaller than $π (j)$ and for which $B_{t_{k}}$ has already been constructed and $R$ contains all the indices $k$ that are greater than $π (j)$ and for which $B_{t_{k}}$ has already been constructed. Now define

l (j) ≔ {\begin{cases} 0 & if L_{j} = 0̸ \\ max L_{j} & if L_{j} \neq 0̸ \end{cases}

r (j) ≔ {\begin{cases} \infty & if R_{j} = 0̸ \\ min R_{j} & if R_{j} \neq 0̸ \end{cases}

and set $B_{t_{0}} = 0$ ,

B_{t_{π (j)}} ≔ {\begin{cases} B_{t_{l (j)}} + \sqrt{t_{π (j)} - t_{l (j)}} Z_{j} & if r (j) = \infty \\ \frac{t_{r (j)} - t_{π (j)}}{t_{r (j)} - t_{l (j)}} B_{t_{l (j)}} + \frac{t_{π (j)} - t_{l (j)}}{t_{r (j)} - t_{l (j)}} B_{t_{r (j)}} + \sqrt{\frac{(t_{π (j)} - t_{l (j)}) (t_{r (j)} - t_{π (j)})}{t_{r (j)} - t_{l (j)}}} Z_{j} & if r (j) < \infty . \end{cases}

It is straightforward to check that the vector $(B_{t_{1}}, \dots, B_{t_{n}})$ constructed in that way has again covariance matrix ${(min (t_{j}, t_{k}))}_{j, k}$ . The functions $l$ and $r$ , as well as the factors of $B_{t_{l (j)}}$ , $B_{t_{r (j)}}$ , $Z_{j}$ , do not depend on the random vector $Z$ so their computation needs to be done only once. In some special cases the functions $l$ and $r$ can be computed explicitly, for example if the $π (t_{j})$ are the first $n$ elements of the van der Corput sequence or of the ${k α}$ -sequence with $α = \frac{1 + \sqrt{5}}{2}$ , see [15]. Since in each step only two of the already constructed values are used it follows that the Brownian bridge construction uses $O (n)$ floating point operations.

Moreover we see that the forward construction is a special case of the Brownian bridge construction where $π (j) = j$ for all $j$ .

The classical Brownian bridge construction as presented in Caflisch and Morokoff [18] corresponds to the setup $n = 2^{L}$ ,

t_{k} = k 2^{- L}, k = 0, \dots, n

where the $B_{t_{k}}$ are constructed in the order $B_{1}$ , $B_{1 / 2}$ , $B_{1 / 4}$ , $B_{3 / 4}$ , $B_{1 / 8}$ , $B_{3 / 8}$ , $B_{5 / 8}$ , $B_{7 / 8}$ , $\dots$ . This is therefore another example where the functions $l$ and $r$ can easily be computed.

2.3. Interpolation using Brownian bridge

The preceding section inspires another class of constructions: we may start with constructing a possibly rough approximation of the discrete path and fill in the gaps using the Brownian bridge.

More concretely, suppose we have nodes $0 < t_{1} < \dots < t_{N} = 1$ and we want to generate a sample of the corresponding discrete Brownian path, $B_{t_{1}}, \dots, B_{t_{N}}$ . We proceed as follows: choose some convenient natural number $n$ of roughly the same or a smaller magnitude as $N$ and use some fast construction to generate $B_{1 / n}, \dots, B_{1}$ . Then partition the set ${t_{1}, \dots, t_{N}}$ into subsets

{t_{1}, \dots, t_{N}} = T \cup ⋃_{k = 1}^{n} T_{k},

where $T_{k} ≔ {t_{j} : \frac{k - 1}{n} < t_{j} < \frac{k}{n}}$ and $T ≔ {t_{j} : t_{j} = \frac{k}{n} for some 0 < k \leq n}$ . The values $B_{t}$ for $t \in T$ are already known. For every $0 < k \leq n$ we now compute ${B_{t} : t \in T_{k}}$ using the Brownian bridge construction.

The fast construction we used above to generate $B_{1 / n}, \dots, B_{1}$ may be any of the constructions presented in Section 3 or PCA. Yet it may as well be something entirely different: suppose we have some $n \times n$ matrix $A$ with $A A^{⊤} = Σ^{(n)}$ and with $n$ being of the order of magnitude of $\sqrt{N}$ . Then $(B_{1 / n}, \dots, B_{1})$ can be constructed in $n^{2} \approx N$ steps (multiplication of an $n$ -vector with $A$ ) and interpolation using the Brownian bridge can be done in at most $O (N)$ steps.

Another possible solution to the problem of generating a Brownian path with unequally spaced time nodes is presented by Keiner and Waterhouse [14]. They describe an approximate PCA that constructs paths with unequal time-steps.

2.4. $m$ -step forward method

Suppose we want to generate a discrete Brownian path on ${\frac{1}{n}, \frac{2}{n}, \dots, 1}$ where $n = n_{1} m$ for some $n_{1}, m \in N$ . There is a straightforward generalization of the forward method whereby $m$ forward steps are generated in one “forward leap” in the following way: fix any decomposition $Σ^{(m)} = A A^{⊤}$ .

(B_{((k + 1) n_{1} + 1) / n}, \dots, B_{((k + 1) n_{1} + m) / n}) ≔ (B_{(k n_{1}) / n}, \dots, B_{(k n_{1}) / n}) + \sqrt{1 / n_{1}} A Z_{k + 1},

where $Z_{1}, Z_{2}, \dots$ are independent standard Gaussian vectors of dimension $m$ . Clearly the number of flops needed is $n_{1}$ times the number needed by one multiplication by $A$ . In any case it is less then $n_{1} m^{2}$ which is $O (n)$ for constant $m$ . If $A$ allows for fast multiplication, the number of operations is $O (n log (m))$ .

2.5. $m$ -step Brownian bridge

Similarly to the $m$ -step forward method, we can generalize the Brownian bridge construction to an $m$ -step method: suppose we want to generate a discrete Brownian path on ${\frac{1}{n}, \frac{2}{n}, \dots, 1}$ where $n = {(m + 1)}^{ν}$ for some $m, ν \in N$ .

Let $(B_{\frac{1}{m + 1}}^{0}, B_{\frac{2}{m + 1}}^{0}, \dots, B_{1}^{0})$ be a discrete Brownian path and consider the Gaussian vector

X = (X_{\frac{1}{m + 1}}, \dots, X_{\frac{m}{m + 1}}) ≔ (B_{\frac{1}{m + 1}}^{0} - \frac{1}{m + 1} B_{1}^{0}, \dots, B_{\frac{m}{m + 1}}^{0} - \frac{m}{m + 1} B_{1}^{0}) .

(2)

It is easy to see that for any standard normal variable $Z$ independent of $B$ the vector

(X_{\frac{1}{m + 1}} + \frac{1}{m + 1} Z, X_{\frac{2}{m + 1}} + \frac{2}{m + 1} Z, \dots, X_{\frac{m}{m + 1}} + \frac{m}{m + 1} Z, Z)

is a discrete Brownian path. Moreover the covariance matrix of $(X_{\frac{1}{m + 1}}, X_{\frac{2}{m + 1}}, \dots, X_{\frac{m}{m + 1}})$ is given by

Γ^{(m)} = {(\frac{min (j, k)}{m + 1} - \frac{j k}{{(m + 1)}^{2}})}_{j, k = 1}^{m} .

We call any Gaussian vector $X = (X_{\frac{1}{m + 1}}, \dots, X_{\frac{m}{m + 1}})$ with covariance matrix $Γ^{(m)}$ a discrete Brownian bridge on ${\frac{1}{m + 1}, \dots, \frac{m}{m + 1}}$ . We write $Γ$ instead of $Γ^{(m)}$ if there is no danger of confusion. $Γ$ is a symmetric positive definite matrix which can therefore be decomposed as $Γ = C C^{⊤}$ .

This provides us with a generalization of the classical Brownian bridge algorithm where in every step (except the first one) $m$ points are generated at once.

Let $z_{0}$ be a standard normal variable and let

{Z_{j, k} : j = 0, \dots, ν - 1, k = 0, \dots, m^{j} - 1}

be a collection of independent standard normal vectors of dimension $m$ .

Define $B_{1} ≔ z_{0}$ , subsequently

{(B_{m^{- 1}}, \dots, B_{(m - 1) m^{- 1}})}^{⊤} = C Z_{0, 0},

further for $j = 1, \dots, ν - 1$

{(B_{m^{- j - 1} + k m^{- j}}, \dots, B_{(m - 1) m^{- j - 1} + k m^{- j}})}^{⊤} = B_{k m^{- j}} {(1, \dots, 1)}^{⊤} + \frac{1}{m} (B_{(k + 1) m^{- j}} - B_{k m^{- j}}) {(1, \dots, m - 1)}^{⊤} + \sqrt{m^{- j}} C Z_{j, k},

for $k \in {0, \dots, m^{j} - 1}$ . It can be shown that this construction generates a Brownian path as required.

Let us now turn to the number of multiplications needed. Multiplication of a vector by $C$ can be done using at most $m^{2}$ scalar multiplications, we need $1 + m + \dots + m^{ν - 1}$ such matrix-vector multiplications, giving a total of $m^{2} \frac{m^{ν} - 1}{m - 1} \approx m n$ scalar multiplications. If matrix-vector multiplication can be done in $m log (m)$ steps, then we get $log (m) n$ . Both results are $O (n)$ for constant $m$ .

The last sentence begs the question under which circumstances matrix-vector multiplication can be done in $m log (m)$ steps. We do not know of a direct decomposition of $Γ^{(m)}$ that makes that possible, but every fast decomposition of $Σ^{(m + 1)}$ gives rise to a fast generation of a discrete Brownian bridge via Eq. (2), using $O ((m + 1) log (m + 1)) = O (m log (m))$ operations.

A possible drawback is that we use $m + 1$ random numbers to generate an $m$ -dimensional Gaussian vector, so that one random number is “lost” or, in other words, the dimension of the integration problem becomes higher with each step.

But this can be remedied to a certain degree: note that if $X$ is constructed from $B^{0}$ according to Eq. (2), then $X$ is uncorrelated and therefore independent of $B_{1}^{0}$ :

E ((B_{\frac{k}{m + 1}}^{0} - \frac{k}{m + 1} B_{1}^{0}) B_{1}^{0}) = \frac{k}{m + 1} - \frac{k}{m + 1} = 0 .

Thus $B_{1}^{0}$ may be “recycled” as a component of one of the $Z_{j, k}$ in a later step and so only in the very last step there is one random variable wasted. So if the $m$ -step Brownian bridge construction is used to generate a discrete Brownian path on ${\frac{1}{n}, \frac{2}{n}, \dots, 1)$ with $n = {(m + 1)}^{ν}$ , then using the above method we need $n + 1 = {(m + 1)}^{ν} + 1$ random numbers, i.e. the integration problem becomes $(n + 1)$ -dimensional.

Finally we want to remark that the generation of $X$ above can be used for $m$ -step interpolation, analog to the generalized Brownian bridge in Section 2.3.

3. Generation by FFT-type transforms

Probably the most famous example of a unitary transform that allows for fast matrix-vector multiplication is the discrete Fourier transform DFT (with the corresponding fast multiplication algorithm FFT),

{(F x)}_{j} = \frac{1}{\sqrt{n}} \sum_{k = 1}^{n} x_{k} e^{- (j - 1) (k - 1) \frac{2 π i}{n}}, j = 1, \dots, n,

for a vector $x \in C^{n}$ . (Usually the indices range from 0 to $n - 1$ , but for consistency with other constructions in this paper we chose the above definition.)

There are a many variants of the discrete Fourier transform that map real functions to real functions and are therefore orthogonal. We present some of those variants.

3.1. Modified Fourier transform

Most Fourier variants that map real functions to real functions have some additional special properties which, for example, make them useful for the fast generation of convolutions. From our point of view we are content with any such variant, indeed we want to have many different variants to choose from for a particular application.

We present one modification of the Fourier transform that maps real functions to real functions. It is well known (and easy to check) that the discrete Fourier transform of a vector $(x_{1}, \dots, x_{n})$ is real-valued iff $x_{1} \in R$ and $x_{n + 2 - k} = {\bar{x}}_{k}$ for $2 \leq k \leq \frac{n}{2} + 1$ . Consider the linear map $B$ ,

{(B x)}_{k} = {\begin{cases} x_{1} & k = 1 \\ x_{k} + i x_{⌊ n / 2 ⌋ + k} & 2 \leq k < n / 2 + 1 \\ x_{n / 2 + 1} & k = n / 2 + 1 \\ x_{n + 2 - k} - i x_{⌊ n / 2 ⌋ + n + 2 - k} & n / 2 + 1 < k \leq n . \end{cases}

(Note that one may use this definition for odd $n$ as well.) Then for any standard normal vector $X$ , $F B X$ is a (real-valued) standard normal vector.

A drawback of this method is that the FFT algorithm uses complex multiplication, which uses 4 real multiplications. However many fast orthogonal transforms that use only real multiplications have been developed, mainly–as is to be expected–by researchers in the area of signal processing.

3.2. Hartley transform and Hilbert transform

The Hartley transform on $[0, 2 π]$ is a variant of the Fourier transform where the orthonormal basis is given by the functions $x \mapsto \frac{1}{\sqrt{2 π}} (cos (k x) + sin (k x))$ , $k \in Z$ . See [5,6].

It is easy to see that the Hartley transform on $[0, 2 π]$ is given by

H (f) ≔ ℜ F (f) - ℑ F (f),

(3)

where $F$ denotes the Fourier transform on $[0, 2 π]$ ,

F (f) (k) = \frac{1}{\sqrt{2 π}} \int_{0}^{2 π} f (x) e^{- 2 π i k x} d x .

In analogy to the Fourier transform there is a discrete version of the Hartley transform together with a fast algorithm, see [27, Section 3.2] (or one might just use the FFT and Eq. (3)): for a vector $x \in R^{n}$ define

{(H x)}_{j} = \frac{1}{\sqrt{n}} \sum_{k = 1}^{n} x_{k} (cos ((j - 1) (k - 1) \frac{2 π}{n}) + sin ((j - 1) (k - 1) \frac{2 π}{n})) = ℜ (F x) (j) - ℑ (F x) (j), j = 1, \dots, n .

We identify the linear map $H$ with the corresponding matrix.

Theorem 3.1

The matrix $H$ has the following properties:

1.
$H$ is orthogonal;

2.
$H$ is self-inverse, $H^{- 1} = H$ ;

3.
for any real vector $x$ the product $H x$ can be computed in $O (n log (n))$ steps.

A proof can be found in [6, Chapter 12].

Another important property of the Fast Discrete Hartley transform is that it can be computed using only real multiplications and additions. Consult Bracewell [6] for more details on the Hartley transform and its discrete version.

Another example of an orthogonal transform is the so-called Hilbert transform. We will not go into the details of the Hilbert transform. The definition and basic properties can be found in [6]. There, a result from Pei and Jaw [21] is also stated which says that the discrete Hilbert transform is of the form

H \circ π \circ H,

where $H$ is the discrete Hartley transform and $π$ is the permutation

π (j) = {\begin{cases} 0 & for j = 1 \\ n + 1 - j & for j \neq 1 . \end{cases}

The discrete Hilbert transform is therefore an orthogonal matrix admitting fast multiplication.

3.3. Sine and cosine transforms

Two of the most important orthogonal transforms that map real input to real output are the sine and cosine transforms. There are four widely used variants of the sine and cosine transform and there is also the so-called $W$ -transform which is similarly defined.

For the convenience of the reader we will recall the definitions of the discrete sine and cosine transforms. Wang [26] or Wickerhauser [27, Section 3.3] provide details on fast implementations.

The definitions are not entirely uniform throughout the literature. We have chosen definitions which are in a form that already describes an orthogonal transform. Usually, as for example in [26], the indices range from 0 to $n - 1$ , but for consistence with the rest of our paper we will let them range from 1 to $n$ .

Define

κ_{A} (j) ≔ {\begin{cases} 1 & if j \neq A \\ 1 / \sqrt{2} & if j \in A . \end{cases}

All matrices below are in $M_{n, n} (R)$ for some $n \geq 2$ , so the indices $j, k$ range from 1 to $n$ .

{(M_{DCT-I})}_{j, k} ≔ \sqrt{\frac{2}{n - 1}} κ_{{1, n}} (j) κ_{{1, n}} (k) cos (\frac{π}{n - 1} (j - 1) (k - 1))

{(M_{DCT-II})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{1}} (j) cos (\frac{π}{n} (j - 1) (k - \frac{1}{2}))

{(M_{DCT-III})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{1}} (k) cos (\frac{π}{n} (j - \frac{1}{2}) (k - 1))

{(M_{DCT-IV})}_{j, k} ≔ \sqrt{\frac{2}{n}} cos (\frac{π}{n} (j - \frac{1}{2}) (k - \frac{1}{2}))

{(M_{DST-I})}_{j, k} ≔ \sqrt{\frac{2}{n + 1}} sin (\frac{π}{n + 1} j k)

{(M_{DST-II})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{n}} (j) sin (\frac{π}{n} j (k - \frac{1}{2}))

{(M_{DST-III})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{n}} (k) sin (\frac{π}{n} (j - \frac{1}{2}) k)

{(M_{DST-IV})}_{j, k} ≔ \sqrt{\frac{2}{n}} sin (\frac{π}{n} (j - \frac{1}{2}) (k - \frac{1}{2})) .

Wang introduced another set of orthogonal transforms which was called discrete $W$ transform, or DWT. The abbreviation “DWT” is nowadays more frequently used for discrete wavelet transform.

{(M_{W-I})}_{j, k} ≔ \sqrt{\frac{2}{n}} sin (\frac{π}{4} + \frac{2 π}{n} (j - 1) (k - 1))

{(M_{W-II})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{n}} (j) sin (\frac{π}{4} + \frac{2 π}{n} (j - 1) (k - \frac{1}{2}))

{(M_{W-III})}_{j, k} ≔ \sqrt{\frac{2}{n}} κ_{{n}} (k) sin (\frac{π}{4} + \frac{2 π}{n} (j - \frac{1}{2}) (k - 1))

{(M_{W-IV})}_{j, k} ≔ \sqrt{\frac{2}{n}} sin (\frac{π}{4} + \frac{2 π}{n} (j - \frac{1}{2}) (k - \frac{1}{2})) .

One aspect of the cosine transform is especially worth noting: its corresponding Brownian paths agree to a large extent with those of the principal component analysis¹.

Let $C ≔ M_{DCT-IV}$ , that is

C_{j k} = \sqrt{\frac{2}{n}} cos (\frac{π}{n} (k - \frac{1}{2}) (j - \frac{1}{2})) .

We want to compute $S C$ , where $S$ is the scaled summation

S = \frac{1}{\sqrt{n}} (\begin{matrix} 1 & 0 & \dots & 0 \\ 1 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & 1 & \dots & 1 \end{matrix}) .

{(S C)}_{l k} = \frac{\sqrt{2}}{n} \sum_{j = 1}^{l} cos (\frac{π}{n} (k - \frac{1}{2}) (j - \frac{1}{2})) = \frac{\sqrt{2}}{n} \sum_{j = 1}^{l} cos (\frac{π}{n} (k - \frac{1}{2}) (j - \frac{1}{2})) \overset{(*)}{=} \frac{1}{\sqrt{n (2 n)}} {(sin (\frac{2 k - 1}{2 n} \frac{π}{2}))}^{- 1} sin (\frac{2 k - 1}{2 n} l π),

where for $(*)$ one writes $cos (x) = \frac{e^{i x} + e^{- i x}}{2}$ and computes the resulting geometric sums.

Recall the PCA construction

λ_{k} = {(4 n {sin}^{2} (\frac{2 k - 1}{2 n + 1} \frac{π}{2}))}^{- 1}

and

V_{l, k} = \frac{2}{\sqrt{2 n + 1}} sin (\frac{2 k - 1}{2 n + 1} l π), l = 1, \dots, n,

{(V D)}_{l k} = λ_{k}^{1 / 2} V_{k, l} = \frac{1}{\sqrt{n (2 n + 1)}} {(sin (\frac{2 k - 1}{2 n + 1} \frac{π}{2}))}^{- 1} sin (\frac{2 k - 1}{2 n + 1} l π) .

We see that for fixed $l, k$ we have ${lim}_{n \to \infty} ({(S C)}_{l k} - {(V D)}_{l k}) = 0$ . On its own that does not say a lot about the proximity of paths generated with the two different methods, but Figs. 1 and 2 illustrate that those paths are rather close indeed, so we want to investigate that topic further.

Fig. 1 — Two Brownian paths with $n = 2^{6}$ , one generated via PCA and one via DCT-IV.

Fig. 2 — Two Brownian paths with $n = 2^{8}$ , one generated via PCA and one via DCT-IV.

Consider the expected squared Euclidean norm of the difference of two paths generated by two $n \times n$ -matrices $P, Q$ from the same set of independent standard normals $(X_{1}, \dots, X_{n})$ :

E ({‖ P X - Q X ‖}^{2}) = E (\sum_{l = 1}^{n} {({(P X)}_{l} - {(Q X)}_{l})}^{2}) = E (\sum_{l = 1}^{n} {(\sum_{k = 1}^{n} {(P - Q)}_{l k} X_{k})}^{2}) = \sum_{l = 1}^{n} \sum_{k = 1}^{n} {(P - Q)}_{l k}^{2} ≕ d_{n} {(P, Q)}^{2} .

The last expression is the square of the Euclidean norm of $P - Q$ in $R^{n^{2}}$ which is known as the Hilbert–Schmidt norm of $P - Q$ .

It turns out that $S C$ is close to $V D$ in the Hilbert–Schmidt norm: for small $n$ the value of $d_{n} (S C, V D)$ can be computed numerically. Fig. 3 shows $d_{n} {(S C, V D)}^{2}$ for $n$ ranging from 0 to 50. The graph suggests that $d_{n} {(S C, V D)}^{2}$ converges for $n \to \infty$ and that the limit is close to $\frac{1}{3}$ .

In fact we have $d_{n} (S C, V D) < 1$ for all $n \in N$ and

\underset{n \to \infty}{lim sup} d_{n} {(S C, V D)}^{2} \leq \frac{2 (48 - π^{2})}{{(π^{2} - 24)}^{2}} = 0.381 \dots .

This will be proved below. As a corollary we note that the average of the variances ${(E ({(S C X - V D X)}_{k}^{2}))}_{k = 1}^{n}$ tends to 0 as $n$ tends to infinity.

Define

δ (n, k) ≔ \sum_{l = 1}^{n} {(S C - V D)}_{l k}^{2}

and

u (n, x) ≔ \frac{1}{n x^{2}} (\frac{9 (\frac{1}{n} + 1)}{{(\frac{1}{2 n} + 1)}^{2} {(6 - {(\frac{1}{2 n} + 1)}^{2} x^{2})}^{2}} + \frac{9}{{(6 - x^{2})}^{2}} - \frac{\sqrt{n} \sqrt{4 n + 2}}{4 n + 1}) .

(4)

Proposition 3.2

$u$ is increasing in the second variable on $[0, \frac{π}{2}]$ .

Proof

Compute the partial derivative of $u$ w.r.t. $x$ as

$\frac{\partial}{\partial k} u (n, k) = \frac{1}{n x^{3}} (\frac{9 (\frac{1}{n} + 1) {(2 n + 1)}^{2} x^{2}}{{(\frac{1}{2 n} + 1)}^{2} n^{2} {(6 - {(\frac{1}{2 n} + 1)}^{2} x^{2})}^{3}} + \frac{6^{2} x^{2}}{{(6 - x^{2})}^{3}} - \frac{2 \cdot 9}{{(6 - x^{2})}^{2}} - \frac{2 \cdot 9 (\frac{1}{n} + 1)}{{(\frac{1}{2 n} + 1)}^{2} {(6 - {(\frac{1}{2 n} + 1)}^{2} x^{2})}^{2}} + \frac{2 \sqrt{n} \sqrt{4 n + 2}}{4 n + 1}) .$

We need to show that $\frac{\partial}{\partial k} u (n, x) > 0$ , which is equivalent to

$\hat{u} (n, x) ≔ n x^{3} \frac{\partial}{\partial k} u (n, x) > 0 .$

(Note that $x > 0$ ). This in turn is shown by proving that $\hat{u} (n, 0) > 0$ and $\frac{\partial}{\partial k} \hat{u} (n, 0) > 0$ . We leave those last calculations to the reader. □

Lemma 3.3

For all $n$ and all $1 \leq k \leq n$ we have

$δ (n, k) < u (n, \frac{2 k - 1}{2 n + 1} \frac{π}{2}) .$

Proof

The sum $δ (n, k) ≔ \sum_{l = 1}^{n} {(S C - V D)}_{l k}^{2}$ can be computed by writing $sin (x) = \frac{1}{2 i} (e^{i x} - e^{- i x})$ for $x \in {\frac{2 k - 1}{2 n} l π, \frac{2 k - 1}{2 n + 1} l π}$ so that the sum becomes a sum of 4 geometric sums. The result can be simplified to

$δ (n, k) = \frac{{csc}^{2} (\frac{π (2 k - 1)}{4 n})}{8 n^{2}} + \frac{{csc}^{2} (\frac{π (2 k - 1)}{4 n})}{4 n} + \frac{{csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)})}{4 n (2 n + 1)} + \frac{2 {csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)})}{4 (2 n + 1)} - \frac{\sqrt{2} csc (\frac{π (2 k - 1)}{2 (2 n + 1)}) csc (\frac{π (2 k - 1)}{4 n (2 n + 1)})}{4 n \sqrt{n (2 n + 1)}} - \frac{sin (\frac{π (2 k - 1) (2 n + 1)}{2 n}) csc (\frac{π (2 k - 1)}{2 n}) {csc}^{2} (\frac{π (2 k - 1)}{4 n})}{8 n^{2}} - \frac{sin (π (2 k - 1)) {csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)}) csc (\frac{π (2 k - 1)}{2 n + 1})}{4 n (2 n + 1)} + \frac{\sqrt{2} sin (\frac{π (2 k - 1) (4 n + 1)}{4 n}) csc (\frac{π (2 k - 1)}{2 (2 n + 1)}) csc (\frac{π (2 k - 1) (4 n + 1)}{4 n (2 n + 1)}) csc (\frac{π (2 k - 1)}{4 n})}{4 n \sqrt{n (2 n + 1)}} .$

We make use of the facts that $sin (π (2 k - 1)) = 0$ , $sin (\frac{π (2 k - 1) (2 n + 1)}{2 n}) = - sin (\frac{π (2 k - 1)}{2 n})$ and $sin (\frac{π (2 k - 1) (4 n + 1)}{4 n}) = - sin (\frac{π (2 k - 1)}{4 n})$ for integers $k, n$ , such that

$δ (n, k) = \frac{{csc}^{2} (\frac{π (2 k - 1)}{4 n})}{8 n^{2}} + \frac{{csc}^{2} (\frac{π (2 k - 1)}{4 n})}{4 n} + \frac{{csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)})}{4 n (2 n + 1)} + \frac{2 {csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)})}{4 (2 n + 1)} - \frac{\sqrt{2} csc (\frac{π (2 k - 1)}{2 (2 n + 1)}) csc (\frac{π (2 k - 1)}{4 n (2 n + 1)})}{4 n \sqrt{n (2 n + 1)}} + \frac{{csc}^{2} (\frac{π (2 k - 1)}{4 n})}{8 n^{2}} - \frac{\sqrt{2} csc (\frac{π (2 k - 1)}{2 (2 n + 1)}) csc (\frac{π (2 k - 1) (4 n + 1)}{4 n (2 n + 1)})}{4 n \sqrt{n (2 n + 1)}},$

that is,

$δ (n, k) = \frac{1}{4 n} (\frac{1}{n} + 1) {csc}^{2} (\frac{π (2 k - 1)}{4 n}) + \frac{1}{4 n} {csc}^{2} (\frac{π (2 k - 1)}{2 (2 n + 1)}) - \frac{\sqrt{2} csc (\frac{π (2 k - 1)}{2 (2 n + 1)})}{4 n \sqrt{n (2 n + 1)}} (csc (\frac{π (2 k - 1)}{4 n (2 n + 1)}) + csc (\frac{π (2 k - 1) (4 n + 1)}{4 n (2 n + 1)})) .$

Next we use that for $x \in (0, \frac{π}{2})$ one has $x - \frac{x^{3}}{6} < sin (x) < x$ and therefore

$\frac{1}{x - \frac{x^{3}}{6}} > csc (x) > \frac{1}{x},$

such that, after some simplifications we get the estimate

$δ (n, k) < \frac{1}{n {(\frac{π (2 k - 1)}{2 (2 n + 1)})}^{2}} (\frac{9 (\frac{1}{n} + 1)}{{(\frac{1}{2 n} + 1)}^{2} {(6 - {(\frac{1}{2 n} + 1)}^{2} {(\frac{π (2 k - 1)}{2 (2 n + 1)})}^{2})}^{2}} + \frac{9}{{(6 - {(\frac{π (2 k - 1)}{2 (2 n + 1)})}^{2})}^{2}} - \frac{\sqrt{n} \sqrt{4 n + 2}}{4 n + 1}) .$

Therefore

$δ (n, k) < u (n, \frac{2 k - 1}{2 n + 1} \frac{π}{2}),$

□

Theorem 3.4

The sequence $d_{n} {(S C, V D)}^{2}$ is bounded by $\frac{2 (48 - π^{2})}{{(π^{2} - 24)}^{2}}$ .

Proof

We know from Proposition 3.2 that $u$ is increasing in $x$ and from Lemma 3.3 that $δ (n, k) < u (n, \frac{2 k - 1}{2 n + 1} \frac{π}{2})$ for all $k \in {1, \dots, n}$ , so we conclude that $δ (n, k) < u (n, \frac{π}{2})$ and therefore

$d_{n} {(S C, V D)}^{2} = \sum_{l = 1}^{n} \sum_{k = 1}^{n} {(S C - V D)}_{l k}^{2} = \sum_{k = 1}^{n} δ (n, k) < n u (n, \frac{π}{2}) .$ (5)

It is readily checked that

$lim_{n \to \infty} n u (n, \frac{π}{2}) = \frac{2 (48 - π^{2})}{{(π^{2} - 24)}^{2}} = 0.381 \dots$

so that $d_{n} (S C, V D)$ is indeed bounded. □

Some extra work is needed to find a uniform bound on $d_{n} (S C, V D)$ :

Theorem 3.5

$d_{n} (S C, V D) < 1$ .

Proof

Observe that $\frac{\sqrt{4 n + 2} \sqrt{n}}{4 n + 1} \geq \frac{1}{2} - \frac{1}{64 n^{2}}$ , so that from (4) to (5)

$d_{n} {(S C, V D)}^{2} < n u (n, \frac{π}{2}) \leq \frac{4}{π^{2}} (\frac{(\frac{1}{n} + 1)}{{(\frac{1}{2 n} + 1)}^{2} {(2 - {(\frac{1}{2 n} + 1)}^{2} \frac{π^{2}}{12})}^{2}} + \frac{1}{{(2 - \frac{π^{2}}{12})}^{2}} - (\frac{1}{2} - \frac{1}{64 n^{2}})) \leq \frac{4}{π^{2}} (\frac{2}{{(\frac{1}{2 n} + 1)}^{2} {(2 - {(\frac{1}{2 n} + 1)}^{2} \frac{π^{2}}{12})}^{2}} - (\frac{1}{2} - \frac{1}{64 n^{2}})) .$

Now use the estimates $\frac{4}{π^{2}} < \frac{41}{100}$ and $\frac{π^{2}}{12} < \frac{83}{100}$ to verify that $n u (n, \frac{π}{2}) < 1$ for all $n \geq 3$ .

For $n = 1, 2$ direct computation shows that $d_{n} (S C, V D) < 1$ . □

3.4. Intermezzo: simulation of Lévy processes

A Lévy process $L$ is a stochastic process with the following properties:

1.
$L_{0} = 0$ a.s.;
2.
for fixed $t > 0$ the random variables $L_{s + t} - L_{s}$ have the same distribution for all $s \geq 0$ ;
3.
for $0 \leq t_{1} \leq t_{2} \leq \dots \leq t_{n}$ the random variables $L_{t_{1}}, L_{t_{2}} - L_{t_{1}}, \dots, L_{t_{n}} - L_{t_{n - 1}}$ are independent.
4.
$L$ is continuous in probability.

This is a generalization of Brownian motion, which is obtained if $L_{t}$ has normal distribution with mean 0 and variance $t$ .

The distribution of $L_{t}$ for arbitrary $t$ is already determined by the distribution of $L_{1}$ : if $ψ$ is the generating function of $L_{1}$ , then by definition of a Lévy process and elementary properties of characteristic functions, $L_{m / n}$ has characteristic function $ψ^{m / n}$ for any rational number $m / n > 0$ and by continuity, $L_{t}$ has characteristic function $ψ^{t}$ for any real number $t > 0$ . See for example [22].

If we want to know the values of a sample path for $t = \frac{1}{n}, \frac{2}{n}, \dots, 1$ we can use a direct generalization of the forward method, provided that we know the inverse of the cumulative probability distribution function (CDF) of $L_{1 / n}$ : let $F$ be the CDF of $L_{1 / n}$ and $F^{- 1}$ its inverse. Given independent $(0, 1)$ -uniform random variables $U_{1}, \dots, U_{n}$ define

L_{1 / n} ≔ F^{- 1} (U_{1})

L_{2 / n} ≔ F^{- 1} (U_{1}) + F^{- 1} (U_{2})

⋮

L_{1} ≔ F^{- 1} (U_{1}) + F^{- 1} (U_{2}) + \dots + F^{- 1} (U_{n}) .

In contrast, the general construction underlying all the generation methods mentioned so far in this paper do not generalize to Lévy processes since those constructions depend heavily on the normality of the generated vector.²

It has first been observed in [16] that constructions of Brownian paths like BB and PCA can be generalized to constructions of paths of Lévy processes in the way described below.

Let $Φ$ denote the standard normal distribution function, i.e.

Φ (x) ≔ \int_{- \infty}^{x} \frac{1}{\sqrt{2 π}} exp (- \frac{y^{2}}{2}) d y .

Given any Brownian path $(B_{0}, B_{1 / n}, \dots, B_{1})$ we have that the random variables ${\hat{U}}_{k} ≔ Φ (n^{1 / 2} (B_{k / n} - B_{(k - 1) / n}))$ , $k \in {1, \dots, n}$ are independent and uniformly distributed on $(0, 1)$ . These transformed uniform variables ${\hat{U}}_{1}, \dots, {\hat{U}}_{n}$ can now be used in the forward construction of a Lévy path.

Thus every orthogonal transform $T$ can be used for a construction of Lévy paths using the following algorithm:

1.
Generate independent, standard normal random variables $X_{1}, \dots, X_{n}$ ;
2.
Compute $Y = T X$ ;
3.
Compute independent, uniformly distributed random variables ${\hat{U}}_{1}, \dots, {\hat{U}}_{n}$ via ${\hat{U}}_{k} = Φ (Y_{k})$ , $k = 1, \dots, n$ ;
4.
Compute a discrete path of $L$ via the forward construction, i.e. $L_{1 / n} = F^{- 1} ({\hat{U}}_{1}), L_{2 / n} = F^{- 1} ({\hat{U}}_{1}) + F^{- 1} ({\hat{U}}_{2}), \dots, L_{1} = F^{- 1} ({\hat{U}}_{1}) + \dots + F^{- 1} ({\hat{U}}_{n})$ .

If the $T$ in that algorithm corresponds to Brownian path generation using PCA, then the above method can be viewed as a kind of PCA for the Lévy process. In that case $B = V D X$ and ${\hat{U}}_{k} = Φ (Y_{k})$ , where $Y = S^{- 1} V D X$ .

If, on the other hand, we use for $T$ the cosine transform of Section 3.3, this will give a result that is close to the one obtained from PCA while using less than half the number of flops, since $P C A$ involves the sine transform on $R^{2 n}$ .

This method is actually entirely general: every simulation algorithm can use standard normals instead of uniform random variables, and it is always possible to apply an orthogonal transform to those standard normals. From the probabilistic point of view the resulting integration problem will be equivalent to the original one, but it may still hold that convergence when using QMC or stratification becomes faster.

3.5. Walsh–Hadamard transform

Walsh functions and the corresponding transform were first introduced by Walsh in [25]. Subsequently, Fine [8] augmented the theory and made the connection to classical Fourier theory by observing that the Walsh functions are just the characters of the group $([0, 1), \oplus)$ , where $\oplus$ is digit-wise addition modulo 2. Like the Fourier transform, the Walsh–Hadamard transform has a discrete version which will be outlined below.

Consider the group structure on $(0, \dots, 2^{n} - 1)$ that is induced by bit-wise operations in base 2. For example $3 \oplus 3 = {(1, 1)}_{2} +_{Z_{2}^{n}} {(1, 1)}_{2} = {(0)}_{2} = 0$ , $7 \oplus 4 = {(1, 1, 1)}_{2} +_{Z_{2}^{n}} {(1, 0, 0)}_{2} = {(1, 1)}_{2} = 3$ .

Then the characters of the group are of the form

χ_{k} (j) = {(- 1)}^{k ⊙ j},

where $⊙$ denotes bit-wise inner product, i.e. $k ⊙ j = k_{0} j_{0} + \dots + k_{m} j_{m}$ for $k = k_{0} + k_{1} 2 + \dots + k_{m} 2^{m}$ and $j = j_{0} + j_{1} 2 + \dots + j_{m} 2^{m}$ . The $χ_{k}$ are also called (discrete) Walsh functions. See [7] for details.

From the character property it follows immediately that $(\frac{1}{\sqrt{n}} χ_{0}, \dots, \frac{1}{\sqrt{n}} χ_{n - 1})$ form an orthonormal basis of $R^{n}$ and therefore the coordinate transformation from the canonical basis to Walsh functions is orthogonal. Let $W$ denote the corresponding matrix. Multiplication of a vector with $W$ is called the discrete Hadamard–Walsh transform.

Theorem 3.6

The matrix $W$ has the following properties:

1.
$W$ is orthogonal;

2.
$W$ is self-inverse, $W^{- 1} = W$ ;

3.
the entries of $W$ are all in ${- 1, 1}$ ;

4.
for any real vector $x$ the product $W x$ can be computed using $O (n log (n))$ flops.

We omit the easy proof as it is mostly a special case of the Kronecker product of orthogonal matrices which is treated in Section 4.3. A detailed description of the fast algorithm can be found in [9].

There is a nice one-line Mathematica implementation of the fast Walsh transform, taken from [28, Notes for Chapter 10], which we would like to share with the reader:

A disadvantage of the method is that it relies on $n$ being a power of 2. A big advantage is that most of the occurring floating point multiplications are by 1 or $- 1$ , with only $n$ multiplications by ${(\sqrt{n})}^{- 1}$ .

4. Further approaches

4.1. Wavelet analysis

Another interesting orthogonal transform is the Haar transform as introduced in [11]. We repeat the basic definitions, loosely following [13]. For a vector $x^{0} ≔ (x_{1}^{0}, \dots, x_{n}^{0})$ of length $n = 2^{L}$ define

x_{k}^{1} ≔ \frac{1}{\sqrt{2}} (x_{2 k - 1}^{0} + x_{2 k}^{0})

d_{k}^{1} ≔ \frac{1}{\sqrt{2}} (x_{2 k - 1}^{0} - x_{2 k}^{0}) .

It is not hard to convince oneself that the mapping $x^{0} \mapsto (x^{1}, d^{1})$ constitutes an orthogonal transform.

Moreover, the construction can be repeated on $x^{1}$ and in general we define

x_{k}^{j} ≔ \frac{1}{\sqrt{2}} (x_{2 k - 1}^{j - 1} + x_{2 k}^{j - 1})

d_{k}^{j} ≔ \frac{1}{\sqrt{2}} (x_{2 k - 1}^{j - 1} - x_{2 k}^{j - 1})

for $k = 1, \dots, 2^{L - j}$ , $j = 1, \dots, L$ and it is again easy to see that the mapping

x^{0} \mapsto (x^{L}, d^{L}, d^{L - 1}, d^{L - 2}, \dots, d^{1})

is an orthogonal transform. The inverse can be computed similarly:

x_{2 k - 1}^{j - 1} ≔ \frac{1}{\sqrt{2}} (x_{k}^{j} + d_{k}^{j})

x_{2 k}^{j - 1} ≔ \frac{1}{\sqrt{2}} (x_{k}^{j} - d_{k}^{j})

for $k = 1, \dots, 2^{L - j}$ , $j = L, \dots, 1$ .

Let $H$ denote the Haar transform on $1, \dots, n$ where $n$ is a power of 2. It is well known (and easy to check) that $S H^{- 1}$ , where $S$ is again the normalized summation matrix given in Eq. (1), is exactly the classical Brownian bridge construction. General wavelet analysis may therefore be viewed as a generalization of the Brownian bridge construction.

While the inverse Haar transform does not provide us with a new fast construction method for discrete Brownian paths, the Haar transform $H$ does.

The Haar transform as well as its inverse can be computed using $O (n log (n))$ flops. See [13] for a proof. If some care is taken the transforms can be done using $O (n log (n))$ additions/subtractions and $n$ multiplications with powers of $\frac{1}{\sqrt{2}}$ .

4.2. Block-diagonal orthogonal matrices

The following method can be viewed as a generalization of the $m$ -step forward method of Section 2.4: Consider a block-diagonal matrix

C = (\begin{matrix} C_{1} \\ C_{2} \\ ⋱ \\ C_{k} \end{matrix}),

where $C_{1}, \dots, C_{k}$ are orthogonal matrices and $C_{j}$ has dimension $n_{j} \times n_{j}$ , such that $C$ has dimension $n \times n$ with $n = n_{1} + \dots + n_{k}$ .

We have that $C$ admits fast matrix-vector multiplication if every $C_{j}$ , $j = 1, \dots, k$ , does, or if ${max}_{1 \leq j \leq k} n_{j} \leq K log (n)$ for some constant $K$ .

4.3. Kronecker product of orthogonal matrices

Recall the Kronecker product³ $A \otimes B$ of two matrices $A$ and $B$ :

(\begin{matrix} A_{11} & \dots & A_{1 m} \\ ⋮ & ⋱ & ⋮ \\ A_{m 1} & \dots & A_{m m} \end{matrix}) \otimes (\begin{matrix} B_{11} & \dots & B_{1 k} \\ ⋮ & ⋱ & ⋮ \\ B_{k 1} & \dots & B_{k k} \end{matrix}) ≔ (\begin{matrix} A_{11} B & \dots & A_{1 m} B \\ ⋮ & ⋱ & ⋮ \\ A_{m 1} B & \dots & A_{m m} B \end{matrix}) .

Note that

(A_{1} \otimes A_{2}) (B_{1} \otimes B_{2}) = A_{1} B_{1} \otimes A_{2} B_{2}

and

{(A_{1} \otimes A_{2})}^{⊤} = A_{1}^{⊤} \otimes A_{2}^{⊤}

and that the Kronecker product of two identity matrices is an identity matrix. From this it follows that the Kronecker product of orthogonal matrices is an orthogonal matrix.

The significance of the Kronecker product for our considerations is that matrix-vector multiplication with the $m k \times m k$ -matrix $A \otimes B$ can be done using less than ${(m k)}^{2}$ operations. To that end we use the following algorithm:

1.
partition the $m k$ -dimensional vector $x$ into $k$ vectors ${\hat{x}}^{1}, \dots, {\hat{x}}^{k}$ of dimension $m$ , ${\hat{x}}_{j}^{i} ≔ x_{(i - 1) k + j}$ ;
2.
compute $y_{j} ≔ B x_{j}$ for $j = 1, \dots, k$ ;
3.
concatenate the $y_{j}$ ’s to an $m k$ -dimensional vector $y$ ;
4.
partition $y$ into $m$ vectors ${\hat{y}}_{1}, \dots, {\hat{y}}_{m}$ of dimension $k$ , ${\hat{y}}_{j}^{i} ≔ y_{(j - 1) m + i}$ ;
5.
compute $z_{j} ≔ A {\hat{y}}_{j}$ for $j = 1, \dots, m$ ;
6.
concatenate the $z_{j}$ ’s to an $m k$ -dimensional vector $z$ .

Then it is easy to check that $z = (A \otimes B) x$ . The algorithm needs $O (k m^{2} + m k^{2}) = O (m k (m + k))$ flops, compared to $O ({(m k)}^{2})$ flops for classical multiplication of $A \otimes B$ with $x$ . If multiplication with $A$ and $B$ is fast, then the complexity further reduces to $O (k m (log (m) + log (k)))$ .

Suppose $n = 2^{k}$ and that

C = A_{1} \otimes A_{2} \otimes \dots \otimes A_{k}

for $k 2 \times 2$ -matrices $A_{1}, \dots, A_{k}$ . Then it follows from induction that the multiplication $C x$ , where $x \in R^{n}$ , can be done in $O (2^{k} k) = O (n log (n))$ operations. An example is the aforementioned Walsh–Hadamard transform which may be written as $W_{2} \otimes \dots \otimes W_{2}$ ( $k$ times), where

W_{2} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}) .

This is also an easy proof by induction.

5. Numerical examples

We consider some numerical examples that illustrate the methods described in the paper.

We consider a European-style option, i.e. one that can be exercised only at the expiry date $T$ . We assume that the price process is of the form

S_{t} = S_{0} exp (σ B_{t} + (r - \frac{σ^{2}}{2}) t), t \in [0, T],

(6)

where $B$ is a standard Brownian motion, that is, we describe the price process of the share in a Black–Scholes market with interest rate $r$ under the risk-neutral measure.

The arbitrage free price of a derivative $X$ depending only on the path of $S$ up to time $T$ is then given by

p = e^{- r T} E (X),

(7)

provided the expectation is finite. See for example [4] for the general theory of derivative pricing in the Black–Scholes setting.

We limit ourselves to the case where $T = 1$ and where $X$ may only depend on $S_{1 / n}, S_{2 / n}, \dots, S_{1}$ or, equivalently, on $B_{1 / n}, B_{2 / n}, \dots, B_{1}$ .

In that case Eq. (7) takes on the form

p = E (f (B_{1 / n}, \dots, B_{1})),

or, in shortened notation, $p = E (f (B))$ , for some function $f$ .

5.1. Asian option

A classical example is an average value option, also called Asian option, where

X = max (\frac{1}{n} \sum_{k = 1}^{n} S_{\frac{k}{n} T} - K, 0) = max (\frac{1}{n} \sum_{k = 1}^{n} S_{0} exp (σ \sqrt{T} B_{\frac{k}{n}} + (r - \frac{σ^{2}}{2}) \frac{k}{n} T) - K, 0) .

In Fig. 4, for example, an Asian option is considered with $n = 64$ , $r = 0.045$ , $σ = 0.3$ , $S_{0} = K = 100$ , $T = 1$ . The option price was evaluated using a quasi-Monte Carlo rule based on a randomly shifted Sobol sequence and the standard normal variates were generated by inversion of the cumulative probability distribution function. Each line shows the ${log}_{2}$ of the standard deviation divided by the mean over 64 runs of the rule using a different generation method. Along the $x$ -axis we plot the ${log}_{2}$ of the number of points used by the QMC-rule.

We see that PCA outperforms the forward construction and DST-III. The inverse Haar transform, which is equivalent to the Brownian bridge, can be seen to be quite close to PCA.

This behavior is rather typical for simple options, e.g. for European style options that depend on the price process of a single share only and in an economically meaningful way, PCA seems to be (almost) a panacea. A remarkable exception is the ratchet option considered by Papageorgiou [20], for which the forward method performs much better.

5.2. Double average option

We give an example of an exotic derivative for which one of the methods proposed in this paper gives a slightly better convergence compared to the standard methods.

A double average option is an option for which the payoff is a function of $A - A_{1}$ , where $A$ and $A_{1}$ are arithmetic averages of the price process over different intervals. See [29, Section 3.3] for details on this kind of Asian option. An example payoff of a double average put is the following:

X = max (\frac{2}{n} \sum_{k = 1}^{n / 2} S_{\frac{k}{n} T} - \frac{2}{n} \sum_{k = n / 2 + 1}^{n} S_{\frac{k}{n} T}, 0) .

Fig. 5 shows a comparison between several generation methods, the three classical methods as well as the one corresponding to the DST-III transform. We see that DST-III consistently outperforms the classical methods, though not by a large margin. But it is worth noting that DST-III is significantly better than the methods using $O (n)$ flops, Brownian bridge and forward construction.

5.3. Weighted Asian option

It is not hard to find theoretical examples that strongly prefer some non-classical generation method, like it is done by Sloan and Wang [24]. We sketch their idea briefly: suppose the payoff of the option, expressed as a function of the Brownian path, is of the form $f (B) = g (w \cdot B)$ . Suppose further that $w$ is of the special form

w^{⊤} = (1, 0, \dots, 0) T^{- 1} S^{- 1},

where $T$ is a fixed orthogonal transform and $S$ is scaled summation.

If the Brownian motion is generated via the same orthogonal transform $T$ , then $B = S T Z$ , where $Z$ is a vector of independent standard normals, and so

w \cdot (S T Z) = w^{⊤} S T Z = (1, 0, \dots, 0) T^{- 1} S^{- 1} S T Z = (1, 0, \dots, 0) Z = Z_{1},

such that $f (B) = g (Z_{1})$ , and the integration problem becomes one-dimensional.

Therefore one may expect a much faster convergence of the QMC rule than for the $n$ -dimensional original problem that is obtained for most other generation methods of the Brownian path.

But the same effect may be observed in the (practically more interesting) case that the payoff is only approximately of the form $f (B) = g (w \cdot B)$ .

Consider a weighted version of the Asian option, that is

X = max (\sum_{k = 1}^{n} w_{k} S_{\frac{k}{n} T} - K, 0) = max (\sum_{k = 1}^{n} w_{k} S_{0} exp (σ \sqrt{T} B_{\frac{k}{n}} + (r - \frac{σ^{2}}{2}) \frac{k}{n} T) - K, 0),

where $w^{⊤} = (1, 0, \dots, 0) T_{OTP}^{- 1} S^{- 1}$ , and where $T_{OTP}$ is an orthogonal Kronecker product as described in Section 4.3, $T = A \otimes \dots \otimes A$ (6 times) with

A = (\begin{matrix} cos (ϕ) & sin (ϕ) \\ - sin (ϕ) & cos (ϕ) \end{matrix})

and $ϕ = 2 π / 3$ .⁴ Fig. 6 shows the corresponding graphs. We observe that now OTP is the best method by a large margin, while PCA and BB perform roughly as well as the forward method.

Fig. 7 shows the weight $w$ that is used in the payoff function of the weighted Asian option in this particular example. It shows that the option corresponding to this weight is rather exotic indeed.

Remark 5.1

As can already be seen from the construction of $w$ , there will in general be many orthogonal transforms that make the integration problem (roughly) one-dimensional.

5.4. Weighted Asian option in the NIG Lévy model

We conclude our study of Asian options with considering the same example of a (weighted) Asian option in an NIG Lévy setup. Here the log-increments of the price process have NIG distribution instead of normal distribution, i.e. model (6) is replaced by

S_{t} = S_{0} exp (L_{t} + (r - ξ) t),

(8)

where $L$ is an NIG Lévy process, that is, $L_{t + Δ t} - L_{t}$ has NIG density

f_{NIG} (x; α, β, δ, μ) = \frac{α}{π} exp (δ \sqrt{α^{2} - β^{2}} + β (x - μ)) \frac{δ K_{1} (α \sqrt{δ^{2} + {(x - μ)}^{2}})}{\sqrt{δ^{2} + {(x - μ)}^{2}}},

where $K_{1} (x)$ is the modified Bessel function of the second kind with index 1, that is

K_{1} (x) = \frac{1}{2} \int_{0}^{\infty} exp (- \frac{1}{2} x (z + z^{- 1})) d z

and $α, β, δ, μ$ are the parameters of the distribution. The parameters $α$ and $β$ need to satisfy $0 \leq | β | \leq α$ and $δ > 0$ . $ξ$ is chosen so that $e^{- r T} E (S_{T}) = S_{0}$ .

The parameters in our numerical example are $α = 18.3$ , $β = - 1.06$ , $δ = 0.0184$ and $μ = 0.000434$ . Those have been found from the stock price process of General Electrics using maximum likelihood estimation of the data from January 2000 to September 2010. The popularity of Lévy processes derives from the empirical fact that they give a much better fit to market data than the Gaussian Black–Scholes model. See Fig. 8 for a comparison of the respective maximum likelihood estimated densities with observed market frequencies.

Fig. 8 — Fit of NIG and Gaussian distribution to market log-returns.

Fig. 9 shows the result for the same weighted Asian option as before, but now the NIG model has been used to generate the price process. We see that the behavior is similar to the Black–Scholes case, but it is much less expressed.

6. Conclusions and open questions

We have given a number of constructions of discrete Brownian paths that provide alternatives to the classical constructions, that is, forward, Brownian bridge and PCA construction. All the constructions presented have the desirable property that they have computational complexity $O (n log (n))$ or $O (n)$ .

We have used the orthogonal representation of path construction to derive a new method for efficient computation of the PCA construction and an even faster method for approximate computation of the PCA construction.

We provided numerical examples illustrating two main points: (1) there are cases where alternative constructions are more efficient then the classical ones, (2) however for most practical cases PCA is best with the computationally less complex Brownian bridge being close.

We need to stress that the examples presented all depended on only one Brownian path. Imai and Tan [12] provide examples where a generation method other than PCA is far more effective, however their method uses full matrix multiplication. It would be useful to try and find a fast version of their method or, more generally, find a generic algorithm which combines the orthogonal transforms presented here in an optimal (or near optimal) way for a given integration problem.

Acknowledgments

The author would like to thank Gerhard Larcher, Fritz Pillichshammer and the anonymous referees for valuable comments.

The author is partially supported by the Austrian Science Foundation (FWF), Project P21196.

Footnotes

This is not a complete surprise, since the scaled summation $S$ serves as a kind of integral and the indeterminate integral of the cosine is the sine. Nevertheless the above assertion is not trivial to prove, and it does not hold for DCT-I and DCT-III.

Nevertheless there are Bridge constructions for Lévy processes for which the conditional distribution of $L_{t / 2}$ given $L_{t}$ can be obtained explicitly, like for the variance-gamma process, cf. [3].

We confine ourselves to quadratic matrices.

⁴

This kind of transform has been dubbed “CRAFOT”–constant rotation angle fast orthogonal transform–by Misans and Terauds [17].

References

1.F. Åkesson, J.P. Lehoczky, Discrete eigenfunction expansion of the multi-dimensional Brownian motion and the Ornstein–Uhlenbeck process, Technical report, Carnegie-Mellon University, 1998.
2.Acworth P., Broadie M., Glasserman P. A comparison of some Monte Carlo and quasi-Monte Carlo techniques for option pricing. In: Niederreiter H., Hellekalek P., Larcher G., Zinterhof P., editors. Monte Carlo and Quasi-Monte Carlo Methods 1996, Proceedings of a Conference at the University of Salzburg, Austria, July 912, 1996. Springer; New York: 1998. pp. 1–18. [Google Scholar]
3.Avramidis A.N., L’Ecuyer P. Efficient Monte Carlo and quasi-Monte Carlo option pricing under the variance gamma model. Manag. Sci. 2006;52:1930–1944. [Google Scholar]
4.Bjørk T. Oxford University Press; 2004. Arbitrage Theory in Continuous Time. [Google Scholar]
5.Bracewell R.N. Aspects of the Hartley transform. Proc. IEEE. 1994;82(3) [Google Scholar]
6.Bracewell R.N. third ed. McGraw-Hill; 2000. The Fourier Transform and its Applications. [Google Scholar]
7.Dick J., Pillichshammer F. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press; Cambridge: 2010. Digital nets and sequences. [Google Scholar]
8.Fine N.J. On the Walsh functions. Trans. Amer. Math. Soc. 1949;65:372–414. [Google Scholar]
9.Fino B.J., Algazi V.R. Unified matrix treatment of the fast Walsh–Hadamard transform. IEEE Trans. Comput. 1976;C-25(11):1142–1146. [Google Scholar]
10.Glasserman P. Springer; 2004. Monte Carlo Methods in Financial Engineering. [Google Scholar]
11.Haar A. Zur theorie der orthogonalen funktionensysteme. Math. Ann. 1910;69:331–371. [Google Scholar]
12.Imai J., Tan K.S. A general dimension reduction technique for derivative pricing. J. Comput. Finance. 2007;10:129–155. [Google Scholar]
13.Kaiser G. The fast Haar transform. IEEE Potentials. 1998;17(2):34–37. [Google Scholar]
14.Keiner J., Waterhouse B.J. Fast principal components analysis method for finance problems with unequal time steps. In: L’ Ecuyer P., Owen A.B., editors. Monte Carlo and Quasi-Monte Carlo Methods 2008. Proceedings of the 8th International Conference Monte Carlo and Quasi-Monte Carlo Methods in Scientific Computing, Montréal, Canada, July 6–11, 2008. Springer; Berlin: 2009. pp. 455–465. [Google Scholar]
15.Larcher G., Leobacher G., Scheicher K. On the tractability of the Brownian bridge algorithm. J. Complexity. 2003;19:511–528. [Google Scholar]
16.Leobacher G. Stratified sampling and quasi-Monte Carlo simulation of Lévy processes. Monte-Carlo Methods Appl. 2006;12(3–4):231–238. [Google Scholar]
17.Misans P., Terauds M. Errors of constant rotation angle fast orthogonal transforms used for fixed-point arithmetic dsp applications: Preliminary results. Elektron. Elektrotech. 2005;60(4) [Google Scholar]
18.Moskowitz B., Caflisch R.E. Smoothness and dimension reduction in quasi-Monte Carlo methods. Math. Comput. Model. 1996;23(8–9):37–54. [Google Scholar]
19.Niederreiter H. Low-discrepancy simulation. In: Duan J., Härdle W., Gentle J.E., editors. Handbook of Computational Finance, Springer Handbooks of Computational Statistics. Springer; Berlin: 2012. pp. 703–730. [Google Scholar]
20.Papageorgiou A. The Brownian bridge does not offer a consistent advantage in quasi-Monte Carlo integration. J. Complexity. 2002;18(1):171–186. [Google Scholar]
21.Pei S.C., Jaw S.B. Discrete Hilbert transform by FHT. IEEE T. Circuits Syst. 1989;36:1251–1252. [Google Scholar]
22.Protter P.E. Stochastic Integration and Differential Equations. second ed. vol. 21. Springer-Verlag; Berlin: 2004. (Applications of Mathematics (New York)). Stochastic Modelling and Applied Probability. [Google Scholar]
23.Scheicher K. Complexity and effective dimension of discrete Lévy areas. J. Complexity. 2007;23(2):152–168. [Google Scholar]
24.Sloan I.H., Wang X. Quasi-Monte Carlo methods in financial engineering: An equivalence principle and dimension reduction. Oper. Res. 2011;59(1):80–95. [Google Scholar]
25.Walsh J.L. A closed set of normal orthogonal functions. Amer. J. Math. 1923;45:5–24. [Google Scholar]
26.Wang Z. Fast algorithms for the discrete $W$ transform and for the discrete Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1984;32(4) [Google Scholar]
27.Wickerhauser M.V. A. K. Peters, Ltd.; Wellesley, MA: 1994. Adapted Wavelet Analysis from Theory to Software. [Google Scholar]
28.Wolfram S. Wolfram Media; 2002. A New Kind of Science. [Google Scholar]
29.Zhu Y., Wu X., Chern I. Springer Finance; New York: 2004. Derivative Securities and Difference Methods. [Google Scholar]

[br000005] 1.F. Åkesson, J.P. Lehoczky, Discrete eigenfunction expansion of the multi-dimensional Brownian motion and the Ornstein–Uhlenbeck process, Technical report, Carnegie-Mellon University, 1998.

[br000010] 2.Acworth P., Broadie M., Glasserman P. A comparison of some Monte Carlo and quasi-Monte Carlo techniques for option pricing. In: Niederreiter H., Hellekalek P., Larcher G., Zinterhof P., editors. Monte Carlo and Quasi-Monte Carlo Methods 1996, Proceedings of a Conference at the University of Salzburg, Austria, July 912, 1996. Springer; New York: 1998. pp. 1–18. [Google Scholar]

[br000015] 3.Avramidis A.N., L’Ecuyer P. Efficient Monte Carlo and quasi-Monte Carlo option pricing under the variance gamma model. Manag. Sci. 2006;52:1930–1944. [Google Scholar]

[br000020] 4.Bjørk T. Oxford University Press; 2004. Arbitrage Theory in Continuous Time. [Google Scholar]

[br000025] 5.Bracewell R.N. Aspects of the Hartley transform. Proc. IEEE. 1994;82(3) [Google Scholar]

[br000030] 6.Bracewell R.N. third ed. McGraw-Hill; 2000. The Fourier Transform and its Applications. [Google Scholar]

[br000035] 7.Dick J., Pillichshammer F. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press; Cambridge: 2010. Digital nets and sequences. [Google Scholar]

[br000040] 8.Fine N.J. On the Walsh functions. Trans. Amer. Math. Soc. 1949;65:372–414. [Google Scholar]

[br000045] 9.Fino B.J., Algazi V.R. Unified matrix treatment of the fast Walsh–Hadamard transform. IEEE Trans. Comput. 1976;C-25(11):1142–1146. [Google Scholar]

[br000050] 10.Glasserman P. Springer; 2004. Monte Carlo Methods in Financial Engineering. [Google Scholar]

[br000055] 11.Haar A. Zur theorie der orthogonalen funktionensysteme. Math. Ann. 1910;69:331–371. [Google Scholar]

[br000060] 12.Imai J., Tan K.S. A general dimension reduction technique for derivative pricing. J. Comput. Finance. 2007;10:129–155. [Google Scholar]

[br000065] 13.Kaiser G. The fast Haar transform. IEEE Potentials. 1998;17(2):34–37. [Google Scholar]

[br000070] 14.Keiner J., Waterhouse B.J. Fast principal components analysis method for finance problems with unequal time steps. In: L’ Ecuyer P., Owen A.B., editors. Monte Carlo and Quasi-Monte Carlo Methods 2008. Proceedings of the 8th International Conference Monte Carlo and Quasi-Monte Carlo Methods in Scientific Computing, Montréal, Canada, July 6–11, 2008. Springer; Berlin: 2009. pp. 455–465. [Google Scholar]

[br000075] 15.Larcher G., Leobacher G., Scheicher K. On the tractability of the Brownian bridge algorithm. J. Complexity. 2003;19:511–528. [Google Scholar]

[br000080] 16.Leobacher G. Stratified sampling and quasi-Monte Carlo simulation of Lévy processes. Monte-Carlo Methods Appl. 2006;12(3–4):231–238. [Google Scholar]

[br000085] 17.Misans P., Terauds M. Errors of constant rotation angle fast orthogonal transforms used for fixed-point arithmetic dsp applications: Preliminary results. Elektron. Elektrotech. 2005;60(4) [Google Scholar]

[br000090] 18.Moskowitz B., Caflisch R.E. Smoothness and dimension reduction in quasi-Monte Carlo methods. Math. Comput. Model. 1996;23(8–9):37–54. [Google Scholar]

[br000095] 19.Niederreiter H. Low-discrepancy simulation. In: Duan J., Härdle W., Gentle J.E., editors. Handbook of Computational Finance, Springer Handbooks of Computational Statistics. Springer; Berlin: 2012. pp. 703–730. [Google Scholar]

[br000100] 20.Papageorgiou A. The Brownian bridge does not offer a consistent advantage in quasi-Monte Carlo integration. J. Complexity. 2002;18(1):171–186. [Google Scholar]

[br000105] 21.Pei S.C., Jaw S.B. Discrete Hilbert transform by FHT. IEEE T. Circuits Syst. 1989;36:1251–1252. [Google Scholar]

[br000110] 22.Protter P.E. Stochastic Integration and Differential Equations. second ed. vol. 21. Springer-Verlag; Berlin: 2004. (Applications of Mathematics (New York)). Stochastic Modelling and Applied Probability. [Google Scholar]

[br000115] 23.Scheicher K. Complexity and effective dimension of discrete Lévy areas. J. Complexity. 2007;23(2):152–168. [Google Scholar]

[br000120] 24.Sloan I.H., Wang X. Quasi-Monte Carlo methods in financial engineering: An equivalence principle and dimension reduction. Oper. Res. 2011;59(1):80–95. [Google Scholar]

[br000125] 25.Walsh J.L. A closed set of normal orthogonal functions. Amer. J. Math. 1923;45:5–24. [Google Scholar]

[br000130] 26.Wang Z. Fast algorithms for the discrete $W$ transform and for the discrete Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1984;32(4) [Google Scholar]

[br000135] 27.Wickerhauser M.V. A. K. Peters, Ltd.; Wellesley, MA: 1994. Adapted Wavelet Analysis from Theory to Software. [Google Scholar]

[br000140] 28.Wolfram S. Wolfram Media; 2002. A New Kind of Science. [Google Scholar]

[br000145] 29.Zhu Y., Wu X., Chern I. Springer Finance; New York: 2004. Derivative Securities and Difference Methods. [Google Scholar]

PERMALINK

Fast orthogonal transforms and generation of Brownian paths

Gunther Leobacher

Abstract

Highlights

1. Orthogonal transforms and Brownian paths

Theorem 1.1 Papageorgiou —

Proof

2. PCA and Brownian bridge

2.1. PCA construction

Theorem 2.1

Proof

2.2. General Brownian bridge

2.3. Interpolation using Brownian bridge

2.4. m-step forward method

2.5. m-step Brownian bridge

3. Generation by FFT-type transforms

3.1. Modified Fourier transform

3.2. Hartley transform and Hilbert transform

Theorem 3.1

3.3. Sine and cosine transforms

Fig. 1.

Fig. 2.

Fig. 3.

Proposition 3.2

Proof

Lemma 3.3

Proof

Theorem 3.4

Proof

Theorem 3.5

Proof

3.4. Intermezzo: simulation of Lévy processes

3.5. Walsh–Hadamard transform

Theorem 3.6

4. Further approaches

4.1. Wavelet analysis

4.2. Block-diagonal orthogonal matrices

4.3. Kronecker product of orthogonal matrices

5. Numerical examples

5.1. Asian option

Fig. 4.

5.2. Double average option

Fig. 5.

5.3. Weighted Asian option

Fig. 6.

Fig. 7.

Remark 5.1

5.4. Weighted Asian option in the NIG Lévy model

Fig. 8.

Fig. 9.

6. Conclusions and open questions

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.4. $m$ -step forward method

2.5. $m$ -step Brownian bridge