Abstract
We present a number of fast constructions of discrete Brownian paths that can be used as alternatives to principal component analysis and Brownian bridge for stratified Monte Carlo and quasi-Monte Carlo. By fast we mean that a path of length can be generated in floating point operations. We highlight some of the connections between the different constructions and we provide some numerical examples.
Keywords: Fast path generation, Fast orthogonal transform, Variance reduction, Quasi-Monte Carlo
Highlights
► Linear constructions of Brownian paths correspond to orthogonal transforms. ► Some orthogonal transforms enhance quasi-Monte Carlo. ► Principal component construction can be approximated by fast cosine transform. ► Orthogonal transforms can also enhance simulation of Levy paths.
1. Orthogonal transforms and Brownian paths
There are several constructions that are frequently used to construct discrete Brownian paths, by which we mean a random function on a given set , , so that is a Gaussian vector with mean zero and covariance matrix
The case where the are evenly spaced is the most important one from the practical point of view. In that case the covariance matrix equals
Throughout the paper this matrix will be denoted by or, if there is no danger of confusion, simply by .
The arguably most straightforward method (a.k.a. forward method or step-by-step method) is to compute the cumulative sum of independent normal variables of mean zero and variance . All constructions we present in this article are equivalent to this simple method from the probabilistic point of view.
However, there are refined simulation methods, for example stratified sampling (cf. [10]) and quasi-Monte Carlo methods (see [19]), which achieve higher convergence rates for some problems. Those techniques have in common that they require the identification of more important and less important input variables. For many problems the straightforward method does not provide this.
For this reason alternatives to the forward construction are frequently used, the Brownian bridge (BB) construction (a.k.a. Lévy–Ciesielski construction or midpoint displacement) and the principal component analysis (PCA) construction (a.k.a. singular value construction). The first use of the BB construction in finance is due to [18], the first use of PCA for financial applications was presented in [2], both with dramatic improvement of convergence rates.
It has been mentioned by Papageorgiou [20] that in fact any decomposition provides a construction for a discrete approximation of a Brownian path via , where is a standard normal vector. In that context, the forward construction corresponds to the Cholesky decomposition of , , where is the summation operator
| (1) |
PCA corresponds to , where is the singular value decomposition of . A corresponding decomposition for the BB algorithm is given, for example, in [15].
However, Papageorgiou [20] notes that there are examples where BB and PCA are not giving better results than the forward method in connection with quasi-Monte Carlo. He further shows that the worst case error of integration for a certain class of payoff functions is independent of the path construction. Thus in the sense of worst case error all decompositions are equivalent.
This has been investigated further by Sloan and Wang [24]. The authors show another equivalence principle which roughly states that every decomposition is equally bad and good for QMC, depending on the function that one wants to integrate. For every decomposition that is good for one payoff function , and every decomposition there is another payoff function for which is equally good.
It is therefore prudent to tailor the decomposition to the problem at hand. This is done, for example, by Imai and Tan [12].
While the possible decompositions of provide a clean framework for the study of algorithms for generation of Brownian paths, they are of limited practical value because the matrix-vector multiplication is comparatively slow for all but very small values of . This is the case since general matrix-vector multiplication uses floating point operation (flops), while the forward method and the Brownian bridge use only flops.
Until recently this has been considered a serious disadvantage of PCA as well, cf. [10]. Yet it has been shown by Scheicher [23], using results from Åkesson and Lehoczky [1], that PCA can be computed using the fast sine transform, thereby using flops.
While the importance of the proper choice of the decomposition for problems arising from quasi-Monte Carlo pricing of financial derivatives is stressed by a number of authors, see e.g. [24,12], there has been a lack of alternatives to the three aforementioned constructions that allow for fast matrix-vector multiplication, by which we mean generation of a path using at most flops.
The present paper aims to narrow this gap by providing a number of fast constructions of Brownian paths that can be used as alternatives to the three constructions presented above. We know from Sloan and Wang [24] that every one of those constructions will have some payoff functions for which they are especially well suited (and some other payoff functions for which they are ill suited).
The practitioner who is willing to use alternative constructions of Brownian paths therefore is presented with the following alternatives: she might want to tailor a (slow) construction to her special problem in the spirit of Imai and Tan [12], or alternatively, she might want to find out which of the fast constructions presented here is relatively well suited to her problem. Which choice is the better will depend on the special problem. But if one of the fast constructions is reasonably close to the optimal construction, its use will reduce the computing time considerably, with essentially the same error.
Papageorgiou [20] observed that there is a one-to-one correspondence between constructions of Brownian paths and orthogonal transforms. We present his simple theorem here since it is essential for all of our constructions.
Theorem 1.1 Papageorgiou —
If is the Cholesky decomposition of , then any orthogonal transformation on defines a decomposition .
Conversely, for every -matrix with there exists an orthogonal transform such that .
Proof
For any orthogonal transform we have , such that .
On the other hand, is invertible, so that for we have and
such that is orthogonal. □
Note that by “orthogonal” we mean that preserves lengths and angles. Strictly speaking such transforms are “orthonormal”, but the term “orthogonal” is more common.
Matrix-vector multiplication with and can be done in linear time. Searching for decompositions of which admit fast matrix-vector multiplication is therefore equivalent to searching for orthogonal matrices that allow for matrix-vector multiplication using at most flops.
In this paper we consider a variety of mostly well-known orthogonal transforms that can be computed using at most flops.
Any of those constructions can be modified by combining them with permutations, which do not require any flops at all. Also the multiplication of two or more orthogonal matrices is orthogonal, and the product matrix admits fast matrix-vector multiplications if all of the factors admit this, provided the number of factors is small compared to the number of dimensions.
The remainder of the paper is structured as follows: Section 2 reviews the principal component analysis (PCA) construction and the general Brownian bridge (BB) construction, whereby we give some generalizations like interpolation and -step construction, that allow the combination of BB with PCA and other fast constructions.
In the main Section 3 we present a number of fast orthogonal transforms derived from fast Fourier transform (FFT) type constructions. A special emphasis is on the fast cosine transform which, as it turns out, corresponds to a construction which is close to PCA. The proof of proximity is one of the main results of our paper.
Section 4 contains some fast constructions that do not fit the previous schemes, and connections to previous methods are highlighted.
Some numerical examples are computed in Section 5 to illustrate the advantages of some of the proposed methods. Section 6 concludes.
Since a large portion of the paper consists of listing fast construction methods, it is probably of interest to the reader to know which of those constructions have previously been known and which are new. This shall be done here, though we need to tread carefully: due to the vast applicability of Brownian motion and simulation we concede that it is too easy to overlook even significant contributions if they are related to a field foreign to the author and maybe even use different terminology.
While the first construction of PCA in Section 2.1 is due to Scheicher [23], the second construction using the corresponding orthogonal transform is new. The idea of using the Brownian bridge for interpolation to generalize an equal-step generation method to non-equal steps seems to have been previously unknown, cf. Section 2.3, and the same holds for the general -step forward and BB methods.
To the authors best knowledge, none of the orthogonal transforms presented in Section 3 have previously been used for the (fast) construction of Brownian paths, with the sole exception of Scheicher’s use of the DST-I to calculate the PCA. But it is worth noting that if one uses DST-I as the underlying orthogonal transform, cf. Theorem 1.1, then one obtains a construction different from Scheicher’s. Nevertheless all of those transforms are of course well known, as are their respective properties. The fact that the construction obtained from DCT-IV is close to PCA is entirely new and consequently so is the idea to use this construction as a faster substitute for PCA when constructing Lévy paths.
The facts from Section 4 seem to be folklore to a large extent. However, the construction obtained from the Haar transform is new and it seems that it has not been observed previously that the Kronecker product can be used to combine different (fast) construction methods for Brownian paths.
2. PCA and Brownian bridge
2.1. PCA construction
Åkesson and Lehoczky [1] showed that for the -th eigenvalue and eigenvector of the matrix are given by
and where
respectively. Therefore , where is the matrix
that is, ,
and is the diagonal matrix that has as its diagonal elements.
It has been observed by Scheicher [23] that is essentially the -th basis function of the discrete sine transform (DST-I) in dimension : Recall the definition of DST-I (provided Section 3.3) on the set ,
Therefore , where is the projection of a -vector onto its first elements, is DST-I in dimension and is the mapping
Multiplication with and can be done in linear time while multiplication with takes flops (see [26]), therefore the generation of a Brownian path using PCA can be done using flops.
Another fast way to construct the PCA is to calculate the corresponding orthogonal transform and compute the PCA via .
Theorem 2.1
where is the discrete cosine transform in dimension ,
and is the projection of a -vector onto its first elements.
Proof
For
and for
Therefore
But using the addition theorem for the sine we get
such that
Comparing this with the definition in [26] of the DCT-III transform finishes the proof. □
2.2. General Brownian bridge
Suppose is a vector of independent standard normal variables.
Consider real numbers . We want to construct a discrete Brownian path , i.e. a Gaussian vector with covariance matrix .
The so-called forward construction solves this problem in the following way: and for . It is easy to check that the vector constructed in this way has indeed the required covariance matrix. The construction requires flops (note that in a (quasi-)Monte Carlo simulation with many scenarios the values , need to be computed only once).
An alternative construction is the well-known Brownian bridge, which we will repeat for the convenience of the reader.
Suppose the elements of should be computed in the order for some permutation of elements. Consequently, in computing we need to take into account the previously computed elements. Fortunately at most two of those are of relevance, the one next to on the left and the one next to on the right.
Formally define for every two sets,
That is, contains all the indices that are smaller than and for which has already been constructed and contains all the indices that are greater than and for which has already been constructed. Now define
and set ,
It is straightforward to check that the vector constructed in that way has again covariance matrix . The functions and , as well as the factors of , , , do not depend on the random vector so their computation needs to be done only once. In some special cases the functions and can be computed explicitly, for example if the are the first elements of the van der Corput sequence or of the -sequence with , see [15]. Since in each step only two of the already constructed values are used it follows that the Brownian bridge construction uses floating point operations.
Moreover we see that the forward construction is a special case of the Brownian bridge construction where for all .
The classical Brownian bridge construction as presented in Caflisch and Morokoff [18] corresponds to the setup ,
where the are constructed in the order , , , , , , , , . This is therefore another example where the functions and can easily be computed.
2.3. Interpolation using Brownian bridge
The preceding section inspires another class of constructions: we may start with constructing a possibly rough approximation of the discrete path and fill in the gaps using the Brownian bridge.
More concretely, suppose we have nodes and we want to generate a sample of the corresponding discrete Brownian path, . We proceed as follows: choose some convenient natural number of roughly the same or a smaller magnitude as and use some fast construction to generate . Then partition the set into subsets
where and . The values for are already known. For every we now compute using the Brownian bridge construction.
The fast construction we used above to generate may be any of the constructions presented in Section 3 or PCA. Yet it may as well be something entirely different: suppose we have some matrix with and with being of the order of magnitude of . Then can be constructed in steps (multiplication of an -vector with ) and interpolation using the Brownian bridge can be done in at most steps.
Another possible solution to the problem of generating a Brownian path with unequally spaced time nodes is presented by Keiner and Waterhouse [14]. They describe an approximate PCA that constructs paths with unequal time-steps.
2.4. -step forward method
Suppose we want to generate a discrete Brownian path on where for some . There is a straightforward generalization of the forward method whereby forward steps are generated in one “forward leap” in the following way: fix any decomposition .
where are independent standard Gaussian vectors of dimension . Clearly the number of flops needed is times the number needed by one multiplication by . In any case it is less then which is for constant . If allows for fast multiplication, the number of operations is .
2.5. -step Brownian bridge
Similarly to the -step forward method, we can generalize the Brownian bridge construction to an -step method: suppose we want to generate a discrete Brownian path on where for some .
Let be a discrete Brownian path and consider the Gaussian vector
| (2) |
It is easy to see that for any standard normal variable independent of the vector
is a discrete Brownian path. Moreover the covariance matrix of is given by
We call any Gaussian vector with covariance matrix a discrete Brownian bridge on . We write instead of if there is no danger of confusion. is a symmetric positive definite matrix which can therefore be decomposed as .
This provides us with a generalization of the classical Brownian bridge algorithm where in every step (except the first one) points are generated at once.
Let be a standard normal variable and let
be a collection of independent standard normal vectors of dimension .
Define , subsequently
further for
for . It can be shown that this construction generates a Brownian path as required.
Let us now turn to the number of multiplications needed. Multiplication of a vector by can be done using at most scalar multiplications, we need such matrix-vector multiplications, giving a total of scalar multiplications. If matrix-vector multiplication can be done in steps, then we get . Both results are for constant .
The last sentence begs the question under which circumstances matrix-vector multiplication can be done in steps. We do not know of a direct decomposition of that makes that possible, but every fast decomposition of gives rise to a fast generation of a discrete Brownian bridge via Eq. (2), using operations.
A possible drawback is that we use random numbers to generate an -dimensional Gaussian vector, so that one random number is “lost” or, in other words, the dimension of the integration problem becomes higher with each step.
But this can be remedied to a certain degree: note that if is constructed from according to Eq. (2), then is uncorrelated and therefore independent of :
Thus may be “recycled” as a component of one of the in a later step and so only in the very last step there is one random variable wasted. So if the -step Brownian bridge construction is used to generate a discrete Brownian path on with , then using the above method we need random numbers, i.e. the integration problem becomes -dimensional.
Finally we want to remark that the generation of above can be used for -step interpolation, analog to the generalized Brownian bridge in Section 2.3.
3. Generation by FFT-type transforms
Probably the most famous example of a unitary transform that allows for fast matrix-vector multiplication is the discrete Fourier transform DFT (with the corresponding fast multiplication algorithm FFT),
for a vector . (Usually the indices range from 0 to , but for consistency with other constructions in this paper we chose the above definition.)
There are a many variants of the discrete Fourier transform that map real functions to real functions and are therefore orthogonal. We present some of those variants.
3.1. Modified Fourier transform
Most Fourier variants that map real functions to real functions have some additional special properties which, for example, make them useful for the fast generation of convolutions. From our point of view we are content with any such variant, indeed we want to have many different variants to choose from for a particular application.
We present one modification of the Fourier transform that maps real functions to real functions. It is well known (and easy to check) that the discrete Fourier transform of a vector is real-valued iff and for . Consider the linear map ,
(Note that one may use this definition for odd as well.) Then for any standard normal vector , is a (real-valued) standard normal vector.
A drawback of this method is that the FFT algorithm uses complex multiplication, which uses 4 real multiplications. However many fast orthogonal transforms that use only real multiplications have been developed, mainly–as is to be expected–by researchers in the area of signal processing.
3.2. Hartley transform and Hilbert transform
The Hartley transform on is a variant of the Fourier transform where the orthonormal basis is given by the functions , . See [5,6].
It is easy to see that the Hartley transform on is given by
| (3) |
where denotes the Fourier transform on ,
In analogy to the Fourier transform there is a discrete version of the Hartley transform together with a fast algorithm, see [27, Section 3.2] (or one might just use the FFT and Eq. (3)): for a vector define
We identify the linear map with the corresponding matrix.
Theorem 3.1
The matrix has the following properties:
- 1.
is orthogonal;
- 2.
is self-inverse, ;
- 3.
for any real vector the product can be computed in steps.
A proof can be found in [6, Chapter 12].
Another important property of the Fast Discrete Hartley transform is that it can be computed using only real multiplications and additions. Consult Bracewell [6] for more details on the Hartley transform and its discrete version.
Another example of an orthogonal transform is the so-called Hilbert transform. We will not go into the details of the Hilbert transform. The definition and basic properties can be found in [6]. There, a result from Pei and Jaw [21] is also stated which says that the discrete Hilbert transform is of the form
where is the discrete Hartley transform and is the permutation
The discrete Hilbert transform is therefore an orthogonal matrix admitting fast multiplication.
3.3. Sine and cosine transforms
Two of the most important orthogonal transforms that map real input to real output are the sine and cosine transforms. There are four widely used variants of the sine and cosine transform and there is also the so-called -transform which is similarly defined.
For the convenience of the reader we will recall the definitions of the discrete sine and cosine transforms. Wang [26] or Wickerhauser [27, Section 3.3] provide details on fast implementations.
The definitions are not entirely uniform throughout the literature. We have chosen definitions which are in a form that already describes an orthogonal transform. Usually, as for example in [26], the indices range from 0 to , but for consistence with the rest of our paper we will let them range from 1 to .
Define
All matrices below are in for some , so the indices range from 1 to .
Wang introduced another set of orthogonal transforms which was called discrete transform, or DWT. The abbreviation “DWT” is nowadays more frequently used for discrete wavelet transform.
One aspect of the cosine transform is especially worth noting: its corresponding Brownian paths agree to a large extent with those of the principal component analysis1.
Let , that is
We want to compute , where is the scaled summation
where for one writes and computes the resulting geometric sums.
Recall the PCA construction
and
We see that for fixed we have . On its own that does not say a lot about the proximity of paths generated with the two different methods, but Figs. 1 and 2 illustrate that those paths are rather close indeed, so we want to investigate that topic further.
Fig. 1.
Two Brownian paths with , one generated via PCA and one via DCT-IV.
Fig. 2.
Two Brownian paths with , one generated via PCA and one via DCT-IV.
Consider the expected squared Euclidean norm of the difference of two paths generated by two -matrices from the same set of independent standard normals :
The last expression is the square of the Euclidean norm of in which is known as the Hilbert–Schmidt norm of .
It turns out that is close to in the Hilbert–Schmidt norm: for small the value of can be computed numerically. Fig. 3 shows for ranging from 0 to 50. The graph suggests that converges for and that the limit is close to .
Fig. 3.
The squared distance between PCA and DCT-method for .
In fact we have for all and
This will be proved below. As a corollary we note that the average of the variances tends to 0 as tends to infinity.
Define
and
| (4) |
Proposition 3.2
is increasing in the second variable on .
Proof
Compute the partial derivative of w.r.t. as
We need to show that , which is equivalent to
(Note that ). This in turn is shown by proving that and . We leave those last calculations to the reader. □
Lemma 3.3
For all and all we have
Proof
The sum can be computed by writing for so that the sum becomes a sum of 4 geometric sums. The result can be simplified to
We make use of the facts that , and for integers , such that
that is,
Next we use that for one has and therefore
such that, after some simplifications we get the estimate
Therefore
□
Theorem 3.4
The sequence is bounded by .
Proof
We know from Proposition 3.2 that is increasing in and from Lemma 3.3 that for all , so we conclude that and therefore
(5) It is readily checked that
so that is indeed bounded. □
Some extra work is needed to find a uniform bound on :
Theorem 3.5
.
Proof
Observe that , so that from (4) to (5)
Now use the estimates and to verify that for all .
For direct computation shows that . □
3.4. Intermezzo: simulation of Lévy processes
A Lévy process is a stochastic process with the following properties:
-
1.
a.s.;
-
2.
for fixed the random variables have the same distribution for all ;
-
3.
for the random variables are independent.
-
4.
is continuous in probability.
This is a generalization of Brownian motion, which is obtained if has normal distribution with mean 0 and variance .
The distribution of for arbitrary is already determined by the distribution of : if is the generating function of , then by definition of a Lévy process and elementary properties of characteristic functions, has characteristic function for any rational number and by continuity, has characteristic function for any real number . See for example [22].
If we want to know the values of a sample path for we can use a direct generalization of the forward method, provided that we know the inverse of the cumulative probability distribution function (CDF) of : let be the CDF of and its inverse. Given independent -uniform random variables define
In contrast, the general construction underlying all the generation methods mentioned so far in this paper do not generalize to Lévy processes since those constructions depend heavily on the normality of the generated vector.2
It has first been observed in [16] that constructions of Brownian paths like BB and PCA can be generalized to constructions of paths of Lévy processes in the way described below.
Let denote the standard normal distribution function, i.e.
Given any Brownian path we have that the random variables , are independent and uniformly distributed on . These transformed uniform variables can now be used in the forward construction of a Lévy path.
Thus every orthogonal transform can be used for a construction of Lévy paths using the following algorithm:
-
1.
Generate independent, standard normal random variables ;
-
2.
Compute ;
-
3.
Compute independent, uniformly distributed random variables via , ;
-
4.
Compute a discrete path of via the forward construction, i.e. .
If the in that algorithm corresponds to Brownian path generation using PCA, then the above method can be viewed as a kind of PCA for the Lévy process. In that case and , where .
If, on the other hand, we use for the cosine transform of Section 3.3, this will give a result that is close to the one obtained from PCA while using less than half the number of flops, since involves the sine transform on .
This method is actually entirely general: every simulation algorithm can use standard normals instead of uniform random variables, and it is always possible to apply an orthogonal transform to those standard normals. From the probabilistic point of view the resulting integration problem will be equivalent to the original one, but it may still hold that convergence when using QMC or stratification becomes faster.
3.5. Walsh–Hadamard transform
Walsh functions and the corresponding transform were first introduced by Walsh in [25]. Subsequently, Fine [8] augmented the theory and made the connection to classical Fourier theory by observing that the Walsh functions are just the characters of the group , where is digit-wise addition modulo 2. Like the Fourier transform, the Walsh–Hadamard transform has a discrete version which will be outlined below.
Consider the group structure on that is induced by bit-wise operations in base 2. For example , .
Then the characters of the group are of the form
where denotes bit-wise inner product, i.e. for and . The are also called (discrete) Walsh functions. See [7] for details.
From the character property it follows immediately that form an orthonormal basis of and therefore the coordinate transformation from the canonical basis to Walsh functions is orthogonal. Let denote the corresponding matrix. Multiplication of a vector with is called the discrete Hadamard–Walsh transform.
Theorem 3.6
The matrix has the following properties:
- 1.
is orthogonal;
- 2.
is self-inverse, ;
- 3.
the entries of are all in ;
- 4.
for any real vector the product can be computed using flops.
We omit the easy proof as it is mostly a special case of the Kronecker product of orthogonal matrices which is treated in Section 4.3. A detailed description of the fast algorithm can be found in [9].
There is a nice one-line Mathematica implementation of the fast Walsh transform, taken from [28, Notes for Chapter 10], which we would like to share with the reader:

A disadvantage of the method is that it relies on being a power of 2. A big advantage is that most of the occurring floating point multiplications are by 1 or , with only multiplications by .
4. Further approaches
4.1. Wavelet analysis
Another interesting orthogonal transform is the Haar transform as introduced in [11]. We repeat the basic definitions, loosely following [13]. For a vector of length define
It is not hard to convince oneself that the mapping constitutes an orthogonal transform.
Moreover, the construction can be repeated on and in general we define
for , and it is again easy to see that the mapping
is an orthogonal transform. The inverse can be computed similarly:
for , .
Let denote the Haar transform on where is a power of 2. It is well known (and easy to check) that , where is again the normalized summation matrix given in Eq. (1), is exactly the classical Brownian bridge construction. General wavelet analysis may therefore be viewed as a generalization of the Brownian bridge construction.
While the inverse Haar transform does not provide us with a new fast construction method for discrete Brownian paths, the Haar transform does.
The Haar transform as well as its inverse can be computed using flops. See [13] for a proof. If some care is taken the transforms can be done using additions/subtractions and multiplications with powers of .
4.2. Block-diagonal orthogonal matrices
The following method can be viewed as a generalization of the -step forward method of Section 2.4: Consider a block-diagonal matrix
where are orthogonal matrices and has dimension , such that has dimension with .
We have that admits fast matrix-vector multiplication if every , , does, or if for some constant .
4.3. Kronecker product of orthogonal matrices
Recall the Kronecker product3 of two matrices and :
Note that
and
and that the Kronecker product of two identity matrices is an identity matrix. From this it follows that the Kronecker product of orthogonal matrices is an orthogonal matrix.
The significance of the Kronecker product for our considerations is that matrix-vector multiplication with the -matrix can be done using less than operations. To that end we use the following algorithm:
-
1.
partition the -dimensional vector into vectors of dimension , ;
-
2.
compute for ;
-
3.
concatenate the ’s to an -dimensional vector ;
-
4.
partition into vectors of dimension , ;
-
5.
compute for ;
-
6.
concatenate the ’s to an -dimensional vector .
Then it is easy to check that . The algorithm needs flops, compared to flops for classical multiplication of with . If multiplication with and is fast, then the complexity further reduces to .
Suppose and that
for -matrices . Then it follows from induction that the multiplication , where , can be done in operations. An example is the aforementioned Walsh–Hadamard transform which may be written as ( times), where
This is also an easy proof by induction.
5. Numerical examples
We consider some numerical examples that illustrate the methods described in the paper.
We consider a European-style option, i.e. one that can be exercised only at the expiry date . We assume that the price process is of the form
| (6) |
where is a standard Brownian motion, that is, we describe the price process of the share in a Black–Scholes market with interest rate under the risk-neutral measure.
The arbitrage free price of a derivative depending only on the path of up to time is then given by
| (7) |
provided the expectation is finite. See for example [4] for the general theory of derivative pricing in the Black–Scholes setting.
We limit ourselves to the case where and where may only depend on or, equivalently, on .
In that case Eq. (7) takes on the form
or, in shortened notation, , for some function .
5.1. Asian option
A classical example is an average value option, also called Asian option, where
In Fig. 4, for example, an Asian option is considered with , , , , . The option price was evaluated using a quasi-Monte Carlo rule based on a randomly shifted Sobol sequence and the standard normal variates were generated by inversion of the cumulative probability distribution function. Each line shows the of the standard deviation divided by the mean over 64 runs of the rule using a different generation method. Along the -axis we plot the of the number of points used by the QMC-rule.
Fig. 4.
Asian option.
We see that PCA outperforms the forward construction and DST-III. The inverse Haar transform, which is equivalent to the Brownian bridge, can be seen to be quite close to PCA.
This behavior is rather typical for simple options, e.g. for European style options that depend on the price process of a single share only and in an economically meaningful way, PCA seems to be (almost) a panacea. A remarkable exception is the ratchet option considered by Papageorgiou [20], for which the forward method performs much better.
5.2. Double average option
We give an example of an exotic derivative for which one of the methods proposed in this paper gives a slightly better convergence compared to the standard methods.
A double average option is an option for which the payoff is a function of , where and are arithmetic averages of the price process over different intervals. See [29, Section 3.3] for details on this kind of Asian option. An example payoff of a double average put is the following:
Fig. 5 shows a comparison between several generation methods, the three classical methods as well as the one corresponding to the DST-III transform. We see that DST-III consistently outperforms the classical methods, though not by a large margin. But it is worth noting that DST-III is significantly better than the methods using flops, Brownian bridge and forward construction.
Fig. 5.
Double average put option.
5.3. Weighted Asian option
It is not hard to find theoretical examples that strongly prefer some non-classical generation method, like it is done by Sloan and Wang [24]. We sketch their idea briefly: suppose the payoff of the option, expressed as a function of the Brownian path, is of the form . Suppose further that is of the special form
where is a fixed orthogonal transform and is scaled summation.
If the Brownian motion is generated via the same orthogonal transform , then , where is a vector of independent standard normals, and so
such that , and the integration problem becomes one-dimensional.
Therefore one may expect a much faster convergence of the QMC rule than for the -dimensional original problem that is obtained for most other generation methods of the Brownian path.
But the same effect may be observed in the (practically more interesting) case that the payoff is only approximately of the form .
Consider a weighted version of the Asian option, that is
where , and where is an orthogonal Kronecker product as described in Section 4.3, (6 times) with
and .4Fig. 6 shows the corresponding graphs. We observe that now OTP is the best method by a large margin, while PCA and BB perform roughly as well as the forward method.
Fig. 6.
Weighted Asian option.
Fig. 7 shows the weight that is used in the payoff function of the weighted Asian option in this particular example. It shows that the option corresponding to this weight is rather exotic indeed.
Fig. 7.
Weight of the Asian option.
Remark 5.1
As can already be seen from the construction of , there will in general be many orthogonal transforms that make the integration problem (roughly) one-dimensional.
5.4. Weighted Asian option in the NIG Lévy model
We conclude our study of Asian options with considering the same example of a (weighted) Asian option in an NIG Lévy setup. Here the log-increments of the price process have NIG distribution instead of normal distribution, i.e. model (6) is replaced by
| (8) |
where is an NIG Lévy process, that is, has NIG density
where is the modified Bessel function of the second kind with index 1, that is
and are the parameters of the distribution. The parameters and need to satisfy and . is chosen so that .
The parameters in our numerical example are , , and . Those have been found from the stock price process of General Electrics using maximum likelihood estimation of the data from January 2000 to September 2010. The popularity of Lévy processes derives from the empirical fact that they give a much better fit to market data than the Gaussian Black–Scholes model. See Fig. 8 for a comparison of the respective maximum likelihood estimated densities with observed market frequencies.
Fig. 8.
Fit of NIG and Gaussian distribution to market log-returns.
Fig. 9 shows the result for the same weighted Asian option as before, but now the NIG model has been used to generate the price process. We see that the behavior is similar to the Black–Scholes case, but it is much less expressed.
Fig. 9.
Weighted Asian option in the NIG Lévy model.
6. Conclusions and open questions
We have given a number of constructions of discrete Brownian paths that provide alternatives to the classical constructions, that is, forward, Brownian bridge and PCA construction. All the constructions presented have the desirable property that they have computational complexity or .
We have used the orthogonal representation of path construction to derive a new method for efficient computation of the PCA construction and an even faster method for approximate computation of the PCA construction.
We provided numerical examples illustrating two main points: (1) there are cases where alternative constructions are more efficient then the classical ones, (2) however for most practical cases PCA is best with the computationally less complex Brownian bridge being close.
We need to stress that the examples presented all depended on only one Brownian path. Imai and Tan [12] provide examples where a generation method other than PCA is far more effective, however their method uses full matrix multiplication. It would be useful to try and find a fast version of their method or, more generally, find a generic algorithm which combines the orthogonal transforms presented here in an optimal (or near optimal) way for a given integration problem.
Acknowledgments
The author would like to thank Gerhard Larcher, Fritz Pillichshammer and the anonymous referees for valuable comments.
The author is partially supported by the Austrian Science Foundation (FWF), Project P21196.
Footnotes
This is not a complete surprise, since the scaled summation serves as a kind of integral and the indeterminate integral of the cosine is the sine. Nevertheless the above assertion is not trivial to prove, and it does not hold for DCT-I and DCT-III.
Nevertheless there are Bridge constructions for Lévy processes for which the conditional distribution of given can be obtained explicitly, like for the variance-gamma process, cf. [3].
We confine ourselves to quadratic matrices.
This kind of transform has been dubbed “CRAFOT”–constant rotation angle fast orthogonal transform–by Misans and Terauds [17].
References
- 1.F. Åkesson, J.P. Lehoczky, Discrete eigenfunction expansion of the multi-dimensional Brownian motion and the Ornstein–Uhlenbeck process, Technical report, Carnegie-Mellon University, 1998.
- 2.Acworth P., Broadie M., Glasserman P. A comparison of some Monte Carlo and quasi-Monte Carlo techniques for option pricing. In: Niederreiter H., Hellekalek P., Larcher G., Zinterhof P., editors. Monte Carlo and Quasi-Monte Carlo Methods 1996, Proceedings of a Conference at the University of Salzburg, Austria, July 912, 1996. Springer; New York: 1998. pp. 1–18. [Google Scholar]
- 3.Avramidis A.N., L’Ecuyer P. Efficient Monte Carlo and quasi-Monte Carlo option pricing under the variance gamma model. Manag. Sci. 2006;52:1930–1944. [Google Scholar]
- 4.Bjørk T. Oxford University Press; 2004. Arbitrage Theory in Continuous Time. [Google Scholar]
- 5.Bracewell R.N. Aspects of the Hartley transform. Proc. IEEE. 1994;82(3) [Google Scholar]
- 6.Bracewell R.N. third ed. McGraw-Hill; 2000. The Fourier Transform and its Applications. [Google Scholar]
- 7.Dick J., Pillichshammer F. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press; Cambridge: 2010. Digital nets and sequences. [Google Scholar]
- 8.Fine N.J. On the Walsh functions. Trans. Amer. Math. Soc. 1949;65:372–414. [Google Scholar]
- 9.Fino B.J., Algazi V.R. Unified matrix treatment of the fast Walsh–Hadamard transform. IEEE Trans. Comput. 1976;C-25(11):1142–1146. [Google Scholar]
- 10.Glasserman P. Springer; 2004. Monte Carlo Methods in Financial Engineering. [Google Scholar]
- 11.Haar A. Zur theorie der orthogonalen funktionensysteme. Math. Ann. 1910;69:331–371. [Google Scholar]
- 12.Imai J., Tan K.S. A general dimension reduction technique for derivative pricing. J. Comput. Finance. 2007;10:129–155. [Google Scholar]
- 13.Kaiser G. The fast Haar transform. IEEE Potentials. 1998;17(2):34–37. [Google Scholar]
- 14.Keiner J., Waterhouse B.J. Fast principal components analysis method for finance problems with unequal time steps. In: L’ Ecuyer P., Owen A.B., editors. Monte Carlo and Quasi-Monte Carlo Methods 2008. Proceedings of the 8th International Conference Monte Carlo and Quasi-Monte Carlo Methods in Scientific Computing, Montréal, Canada, July 6–11, 2008. Springer; Berlin: 2009. pp. 455–465. [Google Scholar]
- 15.Larcher G., Leobacher G., Scheicher K. On the tractability of the Brownian bridge algorithm. J. Complexity. 2003;19:511–528. [Google Scholar]
- 16.Leobacher G. Stratified sampling and quasi-Monte Carlo simulation of Lévy processes. Monte-Carlo Methods Appl. 2006;12(3–4):231–238. [Google Scholar]
- 17.Misans P., Terauds M. Errors of constant rotation angle fast orthogonal transforms used for fixed-point arithmetic dsp applications: Preliminary results. Elektron. Elektrotech. 2005;60(4) [Google Scholar]
- 18.Moskowitz B., Caflisch R.E. Smoothness and dimension reduction in quasi-Monte Carlo methods. Math. Comput. Model. 1996;23(8–9):37–54. [Google Scholar]
- 19.Niederreiter H. Low-discrepancy simulation. In: Duan J., Härdle W., Gentle J.E., editors. Handbook of Computational Finance, Springer Handbooks of Computational Statistics. Springer; Berlin: 2012. pp. 703–730. [Google Scholar]
- 20.Papageorgiou A. The Brownian bridge does not offer a consistent advantage in quasi-Monte Carlo integration. J. Complexity. 2002;18(1):171–186. [Google Scholar]
- 21.Pei S.C., Jaw S.B. Discrete Hilbert transform by FHT. IEEE T. Circuits Syst. 1989;36:1251–1252. [Google Scholar]
- 22.Protter P.E. Stochastic Integration and Differential Equations. second ed. vol. 21. Springer-Verlag; Berlin: 2004. (Applications of Mathematics (New York)). Stochastic Modelling and Applied Probability. [Google Scholar]
- 23.Scheicher K. Complexity and effective dimension of discrete Lévy areas. J. Complexity. 2007;23(2):152–168. [Google Scholar]
- 24.Sloan I.H., Wang X. Quasi-Monte Carlo methods in financial engineering: An equivalence principle and dimension reduction. Oper. Res. 2011;59(1):80–95. [Google Scholar]
- 25.Walsh J.L. A closed set of normal orthogonal functions. Amer. J. Math. 1923;45:5–24. [Google Scholar]
- 26.Wang Z. Fast algorithms for the discrete transform and for the discrete Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1984;32(4) [Google Scholar]
- 27.Wickerhauser M.V. A. K. Peters, Ltd.; Wellesley, MA: 1994. Adapted Wavelet Analysis from Theory to Software. [Google Scholar]
- 28.Wolfram S. Wolfram Media; 2002. A New Kind of Science. [Google Scholar]
- 29.Zhu Y., Wu X., Chern I. Springer Finance; New York: 2004. Derivative Securities and Difference Methods. [Google Scholar]









