Column reduced digital nets

Vishnupriya Anupindi; Peter Kritzer

doi:10.1007/s11075-025-02050-8

. 2025 Mar 24;101(3):1451–1473. doi: 10.1007/s11075-025-02050-8

Column reduced digital nets

Vishnupriya Anupindi ¹, Peter Kritzer ^1,^✉

PMCID: PMC12945907 PMID: 41767827

Abstract

Digital nets provide an efficient way to generate integration nodes of quasi-Monte Carlo (QMC) rules. For certain applications, as e.g. in uncertainty quantification, we are interested in obtaining a speed-up in computing products of a matrix with the vectors corresponding to the nodes of a QMC rule. In the recent paper The fast reduced QMC matrix-vector product (Dick et al. J. Comput. Appl. Math. 440, 115642 2024), a speed up was obtained by using so-called reduced lattices and row reduced digital nets. In this work, we propose a different multiplication algorithm where we exploit the repetitive structure of column reduced digital nets instead of row reduced digital nets. This method has advantages over the previous one, as it facilitates the error analysis when using the integration nodes in a QMC rule. We also provide an upper bound for the quality parameter of column reduced digital nets, and numerical tests to illustrate the efficiency of the new algorithm.

Keywords: Numerical integration, Digital net, Digital sequence, Reduced digital net

Introduction

The problem setting

In many applications, such as in statistics, finance, and uncertainty quantification, we would like to numerically compute

\begin{matrix} \int_{D} f (x^{⊤} A) d μ (x), \end{matrix}

where A is a real $s \times τ$ matrix, by quasi-Monte Carlo (QMC) rules

\begin{matrix} Q_{N} (f) : = \frac{1}{N} \sum_{k = 0}^{N - 1} f (x_{k}^{⊤} A), \end{matrix}

where $x_{k} = {(x_{k}^{(1)}, \dots, x_{k}^{(s)})}^{⊤}$ are column vectors corresponding to the points used in the QMC rule. Problems of this kind particularly arise in some important applications in statistics and uncertainty quantification. For instance, this approach can be used when approximating the expected value of a function with a multivariate normal random variable with some given covariance matrix, or when approximating the expected value of the solution of a PDE with random coefficients, see, e.g., [4]. In some cases the domain D in (1) will be chosen as $D = {[0, 1]}^{s}$ . In this case, it is natural to directly use QMC sample points like lattice point sets (see [3]) or (t, m, s)-nets (see [6]) as the points $x_{k}$ . This is the situation we shall mostly consider in the present paper, in particular with respect to the error analysis in Section 4.2. However, in some applications, such as those mentioned above, the domain may be, e.g., $D = R^{s}$ . Then the points $x_{k}$ frequently are of the form $x_{k} = Φ^{- 1} (y_{k})$ , $k \in {0, 1, \dots, N - 1}$ , where the $y_{k}$ are the QMC sample points, and $Φ^{- 1}$ is the inverse of the cumulative distribution function of a standard normal distribution, which is applied component-wise to vectors. In order to avoid that certain coordinates of the sample points are mapped to $\pm \infty$ , one can first shift the $y_{k}$ to the right by a sufficiently small quantity. Many results presented here also hold for this case, in particular the matrix product algorithm presented in Section 4.1, see Remark 5. We also refer to [4], where a similar situation is studied using a different computational method.

Computing the vector-matrix products $x_{k}^{⊤} A$ for all $k \in {0, \dots, N - 1}$ takes $O (N s τ)$ operations. This problem is equivalent to computing the matrix-matrix product XA, where

X = {[x_{0}^{⊤}, x_{1}^{⊤}, \dots, x_{N - 1}^{⊤}]}^{⊤}

is the $N \times s$ matrix whose k-th row is $x_{k}$ . Computing XA can be infeasible in situations where s and N are both large (which happens in many applications).

In the paper [4], it is shown that when using particular types of QMC rules, the cost to evaluate $Q_{N} (f)$ , as in (2), can be reduced to only $O (τ N log N)$ operations provided that $log N ≪ s$ . This reduction in computational cost is achieved by a fast matrix-matrix multiplication exploiting the fact that for specifically chosen point sets, such as (polynomial) lattice rules, the matrix X can be re-ordered to be of circulant structure.

The recent paper [1] studies an alternative method to reduce the computation time by imposing a certain structure of the points $x_{0}, \dots, x_{N - 1}$ . The key idea of this approach is to find situations in which the components of the points $x_{k}$ have a certain repetitive structure, which then facilitates systematic fast computation of the products $x_{k}^{⊤} A$ . This can be achieved by suitable modifications of (polynomial) lattice point sets using ideas from [2], but how to implement this idea for digital nets, which are more general than polynomial lattice point sets and among the most commonly used QMC node sets, is not straightforward. In [1], the authors made a first attempt and studied a reduction of the computation time for digital (t, m, s)-nets by setting certain rows of the generating matrices to zero (we refer to Section 1.2 for the precise definition of digital nets and their generating matrices). The basic idea in [1] is that for each of the s generating matrices $C_{j}^{(m)}$ , $1 \leq j \leq s$ , of the digital net, we identify a so-called reduction index $w_{j} \in Z$ and set the last $w_{j}$ rows of $C_{j}^{(m)}$ equal to zero. As shown in [1], this introduces a certain repetitiveness in the entries of the matrix X and speeds up the computation of the matrix-matrix product XA. We call such digital nets row reduced digital nets. However, for assessing the quality of reduced nets when used in QMC rules, it is more natural to study the situation where certain columns of the generating matrices are set to zero, since this directly corresponds to the reduced (polynomial) lattice point sets, resulting in the consideration of column reduced digital nets. The idea of column reduced digital nets is to set the last $w_{j}$ columns of the generating matrix $C_{j}^{(m)}$ , $1 \leq j \leq s$ , equal to zero, instead of setting rows equal to zero. Furthermore, in the present paper, we focus on digital nets that are obtained from digital sequences, which implies additional structure in the generating matrices. Again, the approach of using column reduced digital nets yields a speed-up in the computation of XA, but as we will see below, it also makes it easier to assess the properties of the resulting column reduced digital nets than doing the same for row reduced digital nets. Furthermore, the error analysis for approximating (1) by (2) becomes easier. This idea was already mentioned (but not pursued) in [1], and this is what we intend to do in the present paper.

Digital nets and sequences

In this section, we give the definitions of (t, m, s)-nets and (t, s)-sequences, the digital construction method for these, and shortly outline how to assess their quality.

Let $F_{b}$ be a finite field with b elements, where b is prime. We identify the elements of $F_{b}$ with the set ${0, 1, \dots, b - 1}$ . An elementary interval in base b and dimension s is a half-open interval of the form $\prod_{j = 1}^{s} [a_{j} b^{- d_{j}}, (a_{j} + 1) b^{- d_{j}})$ where the $a_{j}, d_{j}$ are nonnegative integers with $0 \leq a_{j} < b^{d_{j}}$ for $1 \leq j \leq s$ .

In the following, we recall the definition of (t, m, s)-nets and (t, s)-sequences, which have the property that the number of points in certain elementary intervals is proportional to their sizes. This guarantees a degree of uniform distribution of the point set in ${[0, 1)}^{s}$ , which is desirable when using such a point set in a QMC rule. For detailed discussions on (t, m, s)-nets and (t, s)-sequences, we refer to [6, 10].

Definition 1

For a given dimension $s \geq 1$ and nonnegative integers t, m with $0 \leq t \leq m$ , a (t, m, s)-net in base b is a point set $P \subset {[0, 1)}^{s}$ consisting of $b^{m}$ points such that any elementary interval in base b with volume $b^{t - m}$ contains exactly $b^{t}$ points of $P$ .

A sequence $(x_{0}, x_{1}, \dots)$ of points in ${[0, 1)}^{s}$ is called a (t, s)-sequence in base b if for all integers $m \geq t$ and $k \geq 0$ , the point set consisting of the points $x_{k b^{m}}, \dots, x_{k b^{m} + b^{m} - 1}$ forms a (t, m, s)-net in base b.

Note that the lower the value of t of a (t, m, s)-net or a (t, s)-sequence, the more uniformly the points are distributed in ${[0, 1)}^{s}$ , which is a desirable property when the point set is used as an integration node set in a QMC rule. This is the reason why t is referred to as the quality parameter of a net or sequence.

A (t, m, s)-net is called strict, if it does not fulfill the requirements of a $(t - 1, m, s)$ -net (for $t \geq 1$ ), and analogously for (t, s)-sequences. In general, any (t, m, s)-net is also a $(t + 1, m, s)$ -net for $t < m$ .

We point out that it is, in general, a non-trivial combinatorial question of which values of t can be reached for which configurations of the other parameters. We again refer to [6, 10] for details.

A common way to generate (t, m, s)-nets and (t, s)-sequences is using the digital method, which was first introduced by Niederreiter in [9].

Definition 2

A digital (t, m, s)-net over $F_{b}$ is a (t, m, s)-net $P = {x_{0}, \dots, x_{b^{m} - 1}}$ where the points are constructed as follows. Let $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ in $F_{b}^{m \times m}$ be matrices over $F_{b}$ . To generate the k-th point in $P$ , $0 \leq k \leq b^{m} - 1$ , we use the b-adic expansion $k = \sum_{i = 0}^{m - 1} k_{i} b^{i}$ with digits $k_{i} \in {0, \dots, b - 1}$ which we denote by $\vec{k} = {(k_{0}, \dots, k_{m - 1})}^{⊤}$ . The j-th coordinate $x_{k, j}$ of $x_{k} = (x_{k, 1}, \dots, x_{k, s})$ is obtained by computing

{\vec{x}}_{k, j} : = C_{j}^{(m)} \vec{k},

and then setting

x_{k, j} : = {\vec{x}}_{k, j} \cdot (b^{- 1}, b^{- 2}, \dots, b^{- m}) .

Similarly, a digital (t, s)-sequence $S$ over $F_{b}$ is generated by infinite matrices $C_{1}, \dots, C_{s}$ , where

\begin{matrix} C_{j} = {(c_{i, r}^{(j)})}_{i, r \in N} \in F_{b}^{N \times N} . \end{matrix}

To generate the k-th point in $S$ , $k \geq 0$ , we use the b-adic expansion $k = \sum_{i = 0}^{\infty} k_{i} b^{i}$ with digits $k_{i} \in {0, \dots, b - 1}$ which we denote by $\vec{k} = {(k_{0}, k_{1}, \dots)}^{⊤}$ . The j-th coordinate $x_{k, j}$ of $x_{k} = (x_{k, 1}, \dots, x_{k, s})$ is obtained by computing

{\vec{x}}_{k, j} : = C_{j}^{(m)} \vec{k},

and then setting

x_{k, j} : = {\vec{x}}_{k, j} \cdot (b^{- 1}, b^{- 2}, \dots) .

Note that from any digital (t, s)-sequence over $F_{b}$ with generating matrices $C_{1}, \dots, C_{s}$ , we can, for $m \geq t$ , derive a digital (t, m, s)-net over $F_{b}$ , simply by considering the point set generated by the left upper $m \times m$ submatrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ of $C_{1}, \dots, C_{s}$ . This is equivalent to considering the first $b^{m}$ points of the (t, s)-sequence.

As pointed out above, the quality of a (t, m, s)-net or (t, s)-sequence is determined by its t-value. For digital (t, m, s)-nets and (t, s)-sequences, we can determine the t-value from rank conditions on the generating matrices, using a quantity that we shall refer to as the linear independence parameter.

Definition 3

For any integers $1 \leq j \leq s$ and $m \geq 1$ , let $C_{1}^{(m)}, C_{2}^{(m)}, \dots, C_{s}^{(m)}$ be $m \times m$ matrices over $F_{b}$ . Then the linear independence parameter $ρ_{m} (C_{1}^{(m)}, C_{2}^{(m)}, \dots, C_{s}^{(m)})$ is defined as the largest integer such that for any choice of $d_{1}, \dots, d_{s} \in N_{0}$ , with $d_{1} + \dots + d_{s} = ρ_{m}$ , we have that

\begin{matrix} the first d_{1} rows of C_{1}^{(m)} together with \\ the first d_{2} rows of C_{2}^{(m)} together with \\ ⋮ \\ the first d_{s} rows of C_{s}^{(m)} \end{matrix}

are linearly independent over $F_{b}$ .

It is known (see, e.g., [6, 10]) that the generating matrices $C_{1}^{(m)}, C_{2}^{(m)}, \dots, C_{s}^{(m)}$ of a digital (t, m, s)-net over $F_{b}$ satisfy

\begin{matrix} ρ_{m} (C_{1}^{(m)}, C_{2}^{(m)}, \dots, C_{s}^{(m)}) \geq m - t, \end{matrix}

where we have equality if the net is a strict (t, m, s)-net. Similarly, for the generating matrices $C_{1}, \dots, C_{s}$ of a digital (t, s)-sequence over $F_{b}$ we must have $ρ_{m} (C_{1}^{(m)}, \dots, C_{s}^{(m)}) \geq m - t$ for all $m \geq max {t, 1}$ , where $C_{j}^{(m)}$ denotes the left upper $m \times m$ submatrix of $C_{j}$ for $j \in {1, \dots, s}$ . Hence, for digital nets and sequences, their quality can be assessed by checking linear independence conditions on the rows of the generating matrices.

The t-values of column reduced digital nets

Column reduction for (t, m, s)-nets

Now we turn towards the primary object of our study, which is the column reduced digital nets. We note that if we take a general digital (t, m, s)-net and set some columns of its generating matrices to zero, we cannot control the quality parameter of the reduced net. However, since digital (t, s)-sequences require stronger conditions on their generating matrices, we can estimate the quality parameter of reduced digital (t, m, s)-nets derived from digital sequences by taking the nets generated by the left upper $m \times m$ submatrices of the generating matrices of the sequences.

For $m \geq t$ , we consider the digital (t, m, s)-net generated by the matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ , derived via the above principle from a digital (t, s)-sequence with generating matrices $C_{1}, \dots, C_{s}$ , $C_{j} = (c_{i, r}^{(j)})$ , $i, r \in N$ .

Let $0 = w_{1} \leq \dots \leq w_{s} \in N_{0}$ , we call these numbers the reduction indices, for the generating matrices $C_{j}^{(m)}$ . We derive the corresponding reduced matrices ${\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}$ , with ${\tilde{C}}_{j}^{(m)} = ({\tilde{c}}_{i, r}^{(j)})$ , $i, r \in {1, 2, \dots, m}$ , for $1 \leq j \leq s$ , where

\begin{matrix} {\tilde{c}}_{i, r}^{(j)} = \{\begin{matrix} c_{i, r}^{(j)} & if r \in {1, \dots, m - min (m, w_{j})}, \\ 0 & if r \in {m - min (m, w_{j}) + 1, \dots, m} . \end{matrix}) \end{matrix}

That is, the first $m - min (m, w_{j})$ columns of ${\tilde{C}}_{j}^{(m)}$ are the same as the columns of the matrix $C_{j}^{(m)}$ , and we set the last $min (m, w_{j})$ columns to zero, i.e, if $w_{j} < m$ ,

{\tilde{C}}_{j}^{(m)} = (\begin{matrix} c_{1, 1}^{(j)} & \dots & c_{1, (m - w_{j})}^{(j)} & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ \\ c_{(m - w_{j}), 1}^{(j)} & \dots & c_{(m - w_{j}), (m - w_{j})}^{(j)} & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ c_{m, 1}^{(j)} & \dots & c_{m, (m - w_{j})}^{(j)} & 0 & \dots & 0 \end{matrix}) .

We are interested in estimating the quality parameter of the digital net generated by the ${\tilde{C}}_{j}^{(m)}$ .

Apart from the main motivation outlined in Section 1, there is another computational advantage of using column reduced digital nets. Indeed, by the general construction principle of digital point sets, the generating matrices of a digital net or sequence are multiplied over $F_{b}$ by vectors representing the digits of the indices of the elements of the point set. By replacing the matrices $C_{j}^{(m)}$ by ${\tilde{C}}_{j}^{(m)}$ , we increase the sparsity of the generating matrices, which saves computation time in the generation of the point set.

Theorem 1

Let $P$ be a digital (t, m, s)-net over $F_{b}$ with generating matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ derived from a digital (t, s)-sequence over $F_{b}$ , where we assume that $m \geq t$ . Let ${\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}$ be as defined in (5) with respect to reduction indices $0 = w_{1} \leq \dots \leq w_{s}$ and let $\tilde{t}$ be the minimal quality parameter of the net generated by the ${\tilde{C}}_{j}^{(m)}$ . Then,

\begin{matrix} max {0, m - w_{s} - t} \leq ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq max {0, m - w_{s}}, \end{matrix}

and $\tilde{t} \leq min {m, w_{s} + t}$ .

Furthermore, if $P$ is a strict digital (t, m, s)-net, it is true that

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq max {0, m - max {t, w_{s}}} . \end{matrix}

Proof

We note that we have $m \geq t$ by assumption. If $w_{s} \geq m$ , then we trivially have $ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) = 0$ , as ${\tilde{C}}_{s}^{(m)}$ only contains zeros, and (6) holds.

Therefore, we will assume for the rest of the proof that $w_{s} < m$ .

We prove the second inequality in (6) first. We have

\begin{matrix} {\tilde{C}}_{s}^{(m)} = (\begin{matrix} c_{1, 1}^{(s)} & \dots & c_{1, (m - w_{s})}^{(s)} & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ \\ c_{(m - w_{s}), 1}^{(s)} & \dots & c_{(m - w_{s}), (m - w_{s})}^{(s)} & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ c_{m, 1}^{(s)} & \dots & c_{m, (m - w_{s})}^{(s)} & 0 & \dots & 0 \end{matrix}) . \end{matrix}

Let D be the matrix containing the first $d_{1}$ rows of ${\tilde{C}}_{1}^{(m)}$ , the first $d_{2}$ rows of ${\tilde{C}}_{2}^{(m)}$ , etc., up to the first $d_{s}$ rows of ${\tilde{C}}_{s}^{(m)}$ , where $d_{1}, \dots, d_{s}$ are nonnegative integers satisfying $d_{1} + \dots + d_{s} = m - w_{s}$ . For the special choice $(d_{1}, \dots, d_{s}) = (0, \dots, 0, m - w_{s})$ , we have $rank (D) = rank ({\tilde{C}}_{s}^{(m)}) \leq m - w_{s}$ . Therefore,

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq m - w_{s} . \end{matrix}

Now we prove the first inequality in (6). If $m - w_{s} - t < 0$ , the inequality is trivial.

Otherwise, i.e., if $m - w_{s} \geq t$ , we know that

\begin{matrix} ρ_{k} (C_{1}^{(k)}, \dots, C_{s}^{(k)}) \geq k - t, \end{matrix}

for any $k \geq t$ , since our net is derived from a digital (t, s)-sequence. Here, $C_{j}^{(k)}$ , $1 \leq j \leq s$ , denotes the left upper $k \times k$ submatrix of $C_{j}$ . In particular, we observe that for the left upper $(m - w_{s}) \times (m - w_{s})$ submatrices of $C_{1}, \dots, C_{s}$ ,

ρ_{(m - w_{s})} (C_{1}^{(m - w_{s})}, \dots, C_{s}^{(m - w_{s})}) \geq m - w_{s} - t .

We now consider arbitrary integers $d_{1}, \dots, d_{s} \geq 0$ with $d_{1} + \dots + d_{s} = m - w_{s} - t$ . Let $k_{i}^{(j)}$ denote the i-th row vector of $C_{j}^{(m - w_{s})} \in F_{b}^{(m - w_{s}) \times (m - w_{s})}$ . We know that

\begin{matrix} k_{1}^{(1)}, \dots, k_{d_{1}}^{(1)}, k_{1}^{(2)}, \dots, k_{d_{2}}^{(2)}, \dots, \dots, k_{1}^{(s)}, \dots, k_{d_{s}}^{(s)} \end{matrix}

are linearly independent over $F_{b}$ . Let $c_{i}^{(j)}$ denote the i-th row vector of ${\tilde{C}}_{i}^{(m)} \in F_{b}^{m \times m}$ . We observe that for $1 \leq i \leq m - w_{s}$ ,

\begin{matrix} c_{i}^{(j)} = (k_{i}^{(j)}, u_{i}^{(j)}) \in F_{b}^{1 \times m}, \end{matrix}

where the $k_{i}^{(j)}$ are as above and $u_{i}^{(j)} \in F_{b}^{1 \times w_{s}}$ .

The row vectors

\begin{matrix} c_{1}^{(1)}, \dots, c_{d_{1}}^{(1)}, c_{1}^{(2)}, \dots, c_{d_{2}}^{(2)}, \dots, \dots, c_{1}^{(s)}, \dots, c_{d_{s}}^{(s)} \end{matrix}

are linearly independent, since otherwise the row vectors in (9), which are projections of $c_{i}^{(j)}$ onto the first $m - w_{s}$ entries, would be linearly dependent. Therefore,

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \geq m - w_{s} - t . \end{matrix}

This concludes the proof of (6). Using (4) and the lower bound in (6), we obtain the upper bound for $\tilde{t}$ .

It remains to show (7).

Let D be the matrix containing the first $d_{1}$ rows of ${\tilde{C}}_{1}^{(m)}$ , the first $d_{2}$ rows of ${\tilde{C}}_{2}^{(m)}$ , etc., up to the first $d_{s}$ rows of ${\tilde{C}}_{s}^{(m)}$ , where $d_{1}, \dots, d_{s}$ are nonnegative integers. As above, for the special choice $(d_{1}, \dots, d_{s}) = (0, \dots, 0, m - w_{s})$ , we have $rank (D) = rank ({\tilde{C}}_{s}^{(m)}) \leq (m - w_{s})$ . So,

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq m - w_{s} . \end{matrix}

However, since we assume that $P$ is a strict digital (t, m, s)-net in this part of the proof, there must exist a choice of $(d_{1}, \dots, d_{s})$ with $d_{1} + \dots + d_{s} = m - t + 1$ such that the corresponding rows of $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ are linearly dependent, and therefore also the corresponding rows of ${\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}$ are linearly dependent. This yields

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq m - t, \end{matrix}

so we must have

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) \leq m - max {t, w_{s}} . \end{matrix}

$□$

Remark 1

For $t = 0$ and $w_{s} < m$ in Theorem 1, we obtain equality in (6) and therefore

\begin{matrix} ρ_{m} ({\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}) = m - w_{s}, \end{matrix}

and $\tilde{t} = w_{s}$ .

Remark 2

We now give an example that illustrates that the lower bound in Theorem 1 is sharp.

Assume that Q is a digital (0, 2)-sequence with generating matrices $D_{1}$ and $D_{2}$ (examples of Q exist, e.g., by choosing as Q a Niederreiter sequence, see [9]).

From Q, we construct a digital (t, 2)-sequence P, by prepending exactly t zero columns to both $D_{1}$ and $D_{2}$ . That is, we construct new generating matrices $C_{j}$ , $j \in {1, 2}$ , such that

graphic file with name 11075_2025_2050_Equ55_HTML.gif

It is easily checked that $C_{1}, C_{2}$ generate a digital (t, 2)-sequence; indeed, let $m \geq t$ be arbitrarily chosen but fixed. Then the matrices $C_{1}^{(m)}, C_{2}^{(m)}$ contain the matrices $D_{1}^{(m - t)}, D_{2}^{(m - t)}$ as submatrices. As $D_{1}, D_{2}$ generate a (0, 2)-sequence, for any $d_{1}, d_{2} \in N_{0}$ with $d_{1} + d_{2} = m - t$ the first $d_{1}$ rows of $D_{1}^{(m - t)}$ together with the first $d_{2}$ rows of $D_{2}^{(m - t)}$ must be linearly independent, so also the corresponding rows of $C_{1}^{(m)}$ and $C_{2}^{(m)}$ (with zeros prepended) must be linearly independent. This establishes that $C_{1}$ and $C_{2}$ generate a (t, 2)-sequence.

Let now $m \geq t$ , and let $w_{1} = 0$ , and $w_{2} \geq w_{1}$ be reduction indices such that $m - w_{2} - t \geq 0$ . Then ${\tilde{C}}_{1}^{(m)} = C_{1}^{(m)}$ , and

where $D_{2}^{(m \times (m - t - w_{2}))}$ denotes the left upper $m \times (m - t - w_{2})$ submatrix of $D_{2}$ . By Theorem 1, we know that $ρ_{m} ({\tilde{C}}_{1}^{(m)}, {\tilde{C}}_{2}^{(m)}) \geq m - t - w_{2}$ . However, $ρ_{m} ({\tilde{C}}_{1}^{(m)}, {\tilde{C}}_{2}^{(m)}) > m - t - w_{2}$ cannot hold since the first $m - t - w_{2} + 1$ rows of ${\tilde{C}}_{2}^{(m)}$ must be linearly dependent.

This implies that the lower bound in Theorem 1 is sharp.

Remark 3

Next, we provide an example showing that the upper bound (7) for strict digital nets in Theorem 1 is sharp.

We use the same notation as in Remark 2. We again start with the digital (0, 2)-sequence Q. Again, we transform Q into a (t, s)-sequence, now called R, with generating matrices $E_{1}$ and $E_{2}$ . For $E_{1}$ , we take the first generating matrix of P from above, i.e., $E_{1} = C_{1}$ . Furthermore, we choose $E_{2}$ as

graphic file with name 11075_2025_2050_Equ57_HTML.gif

where $D_{2}^{(t)}$ is the left upper $t \times t$ submatrix of $D_{2}$ . First, note that R really is a strict (t, 2)-sequence. Indeed, if we consider the matrix $E_{1}^{(m)}$ for $m < t$ , this matrix only contains zeros, so the quality parameter of R must be at least t. On the other hand, let $m \geq t$ and consider the matrices $E_{1}^{(m)}$ and $E_{2}^{(m)}$ . Choose $d_{1}, d_{2} \geq 0$ such that $d_{1} + d_{2} = m - t$ , and consider the first $d_{1}$ rows of $E_{1}^{(m)}$ together with the first $d_{2}$ rows of $E_{2}^{(m)}$ . We distinguish two cases.

If $d_{2} \leq t$ , then it is obvious that the first $d_{1}$ rows of $E_{1}^{(m)}$ together with the first $d_{2}$ rows of $E_{2}^{(m)}$ are linearly independent, as $D_{1}$ and $D_{2}$ generate a (0, 2)-sequence.
If $d_{2} > t$ , we proceed as follows. Assume to the contrary that the first $d_{1}$ rows of $E_{1}^{(m)}$ together with the first $d_{2}$ rows of $E_{2}^{(m)}$ were not linearly independent. By the structure of $E_{1}$ and $E_{2}$ , this would immediately imply that the first $d_{1}$ rows of $D_{1}^{(m - t)}$ together with the first $d_{2} - t$ rows of $D_{2}^{(m - t)}$ are not linearly independent, where $d_{1} + d_{2} - t = m - 2 t$ , which would be a contradiction to the property that $D_{1}$ and $D_{2}$ generate a digital (0, 2)-sequence.

Let now again $m \geq t$ , and let $w_{1} = 0$ , and $w_{2} \geq w_{1}$ be reduction indices such that $m - w_{2} - t \geq 0$ . Then ${\tilde{E}}_{1}^{(m)} : = E_{1}^{(m)}$ , and

graphic file with name 11075_2025_2050_Equ58_HTML.gif

We again distinguish two cases.

Case 1: $\max {t, w_{2}} = w_{2}$ . We claim that $ρ_{m} ({\tilde{E}}_{1}^{(m)}, {\tilde{E}}_{2}^{(m)}) = m - w_{2}$ . To this end, let $d_{1}, d_{2} \geq 0$ such that $d_{1} + d_{2} = m - w_{2}$ , which implies that $d_{1}$ and $d_{2}$ are both not larger than $m - t$ . Then, we consider two sub-cases.

If $d_{2} \leq t$ , it is clear because of the structure of the matrices that the first $d_{1}$ rows of ${\tilde{E}}_{1}^{(m)}$ together with the first $d_{2}$ rows of ${\tilde{E}}_{2}^{(m)}$ are linearly independent, as $D_{1}$ and $D_{2}$ generate a (0, 2)-sequence. This is guaranteed since we know that $d_{1}$ and $d_{2}$ are both not larger than $m - t$ .
If $d_{2} > t$ , we proceed as follows. Assume to the contrary that the first $d_{1}$ rows of ${\tilde{E}}_{1}^{(m)}$ together with the first $d_{2}$ rows of ${\tilde{E}}_{2}^{(m)}$ were not linearly independent. By the structure of ${\tilde{E}}_{1}^{(m)}$ and ${\tilde{E}}_{2}^{(m)}$ , this would immediately imply that the first $d_{1}$ rows of $D_{1}^{(m - t)}$ together with the first $d_{2} - t$ rows of $D_{2}^{((m - t) \times (m - t - w_{2}))}$ are not linearly independent, where $d_{1} + d_{2} - t = m - t - w_{2}$ . Note, however, that $D_{1}^{(m - t)}$ contains $D_{1}^{(m - t - w_{2})}$ as its left upper submatrix, and also $D_{2}^{((m - t) \times (m - t - w_{2}))}$ contains $D_{2}^{(m - t - w_{2})}$ as its left upper submatrix. By the property that $D_{1}$ and $D_{2}$ generate a (0, 2)-sequence, and by the assumption that $m - w_{2} \geq t$ , the first $d_{1}$ rows of $D_{1}^{(m - t - w_{2})}$ together with the first $d_{2} - t$ rows of $D_{2}^{(m - t - w_{2})}$ must be linearly independent. The same must, however, then also hold for the corresponding rows of $D_{1}^{(m - t)}$ and $D_{2}^{((m - t) \times (m - t - w_{2}))}$ , which yields a contradiction.

Hence we have shown that $ρ_{m} ({\tilde{E}}_{1}^{(m)}, {\tilde{E}}_{2}^{(m)}) \geq m - w_{2}$ , and by Theorem 1 we must actually have $ρ_{m} ({\tilde{E}}_{1}^{(m)}, {\tilde{E}}_{2}^{(m)}) = m - w_{2}$ .

Case 2: $max {t, w_{2}} = t$ . We claim that $ρ_{m} ({\tilde{E}}_{1}^{(m)}, {\tilde{E}}_{2}^{(m)}) = m - t$ . To this end, let $d_{1}, d_{2} \geq 0$ such that $d_{1} + d_{2} = m - t$ . Also here, we distinguish two sub-cases.

If $d_{2} \leq t$ , it is obvious that the first $d_{1}$ rows of ${\tilde{E}}_{1}^{(m)}$ together with the first $d_{2}$ rows of ${\tilde{E}}_{2}^{(m)}$ are linearly independent, as $D_{1}$ and $D_{2}$ generate a (0, 2)-sequence. This is guaranteed since we know that $d_{1}$ and $d_{2}$ are both not larger than $m - t$ .
If $d_{2} > t$ , we proceed as follows. Assume to the contrary that the first $d_{1}$ rows of ${\tilde{E}}_{1}^{(m)}$ together with the first $d_{2}$ rows of ${\tilde{E}}_{2}^{(m)}$ were not linearly independent. By the structure of ${\tilde{E}}_{1}^{(m)}$ and ${\tilde{E}}_{2}^{(m)}$ , this would immediately imply that the first $d_{1}$ rows of $D_{1}^{(m - t)}$ together with the first $d_{2} - t$ rows of $D_{2}^{((m - t) \times (m - t - w_{2}))}$ are not linearly independent, where $d_{1} + d_{2} - t = m - 2 t \leq m - t - w_{2}$ . Note, however, that $D_{1}^{(m - t)}$ contains $D_{1}^{(m - t - w_{2})}$ as its left upper submatrix, and also $D_{2}^{((m - t) \times (m - t - w_{2}))}$ contains $D_{2}^{(m - t - w_{2})}$ as its left upper submatrix. By the property that $D_{1}$ and $D_{2}$ generate a (0, 2)-sequence, by the fact that $d_{1} + d_{2} \leq m - t - w_{2}$ , and by the assumption that $m - w_{2} \geq t$ , the first $d_{1}$ rows of $D_{1}^{(m - t - w_{2})}$ together with the first $d_{2}$ rows of $D_{2}^{(m - t - w_{2})}$ must be linearly independent. The same must, however, then also hold for the corresponding rows of $D_{1}^{(m - t)}$ and $D_{2}^{((m - t) \times (m - t - w_{2}))}$ , which yields a contradiction.

In summary, we have shown that (7) is sharp for strict digital nets.

Column reduction for $(t, m, e, s)$ -nets

In [13] Tezuka introduced the concept of $(t, m, e, s)$ -nets, which are a generalization of (t, m, s)-nets. In this section, we briefly look at the quality parameter of column reduced nets under this generalized definition of nets. However, for the rest of the paper, we shall then stick to the notion of (t, m, s)-nets again.

Definition 4

Let $e = (e_{1}, \dots, e_{s})$ and $d = (d_{1}, \dots, d_{s})$ be integer vectors with $e_{i} \geq 1$ and $d_{i} \geq 0$ for $i \in {1, \dots, s}$ , where $s \geq 1$ is the dimension. Let t, m be non-negative integers with $0 \leq t \leq m$ . A point set $P \subset {[0, 1)}^{s}$ with $b^{m}$ points is called a $(t, m, e, s)$ -net in base b if every elementary interval $J \subseteq {[0, 1)}^{s}$ of volume $b^{t - m}$ and of the form

J = \prod_{j = 1}^{s} [\frac{a_{j}}{b^{e_{j} d_{j}}}, \frac{a_{j} + 1}{b^{e_{j} d_{j}}})

contains exactly $b^{t}$ points of $P$ , where $0 \leq a_{j} < b^{e_{j} d_{j}}$ for $j \in {1, \dots, s}$ and $d$ satisfies the equation $e_{1} d_{1} + \dots + e_{s} d_{s} = m - t$ .

If we choose $e = (1, \dots, 1) \in N^{s}$ , we obtain the classical definition of a (t, m, s)-net as given in Definition 1. For (t, m, s)-nets, we have the propagation rule that a (t, m, s)-net in base b is also a (v, m, s)-net in base b for any integer v with $t \leq v \leq m$ . However, with the above definition of $(t, m, e, s)$ -nets, we do not have this propagation rule. In [8], the authors provided a revised definition of $(t, m, e, s)$ -nets which ensures the above mentioned propagation rule. In this section, however, we work with the original definition provided by Tezuka in [13].

We note that all (t, m, s)-nets are also $(t, m, e, s)$ -nets, however for certain values of $e$ , we can obtain a lower t-value for the corresponding $(t, m, e, s)$ -net. In particular, for column-reduced digital nets, we can find certain examples where at least for some choices of $e$ , the reduced net retains the original quality parameter t. Let us give some examples.

Example 1

Let $b = 2, s = 2, m = 4$ and consider the (0, 4, 2)-net derived from the Sobol’ sequence, given by the generating matrices

\begin{matrix} C_{1} = (\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}), C_{2} = (\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \\ 0 & 0 & 0 & 1 \end{matrix}) . \end{matrix}

Let $w_{1} = 0$ and $w_{2} = 1$ , then the resulting column reduced digital net is a (1, 4, 2)-net according to Theorem 1. However, for $e = (e_{1}, e_{2})$ chosen such that $(e_{1} d_{1}, e_{2} d_{2}) = (4, 0)$ is the only solution to the equation $e_{1} d_{1} + e_{2} d_{2} = 4$ , we obtain a column reduced net that is still a $(0, 4, e, 2)$ -net. This is because the only elementary intervals J that satisfy the conditions in Definition 4 are of the form

J = [a_{1} / 2^{4}, (a_{1} + 1) / 2^{4}) \times [0, 1) .

Thus, the net property depends only on the first coordinates in the point set $P$ , and since we do not set any columns of $C_{1}$ to zero, i.e., $w_{1} = 0$ , the resulting column reduced net is a $(0, 4, e, 2)$ -net. Some concrete choices for $e$ with the above property are $(e_{1}, e_{2}) = (2, 3)$ , or $(e_{1}, e_{2}) = (1, e_{2})$ where $e_{2} > 4$ , or $(e_{1}, e_{2}) = (4, e_{2})$ where $e_{2} = 3$ or $e_{2} > 4$ .

In general, given a (t, m, s)-net $P$ derived from a (t, s)-sequence and reduction indices $0 = w_{1} \leq w_{2} \leq \dots \leq w_{s}$ , for $e = (e_{1}, \dots, e_{s}) = (m - t, k, \dots, k)$ , where either $1 < k < m - t$ with $gcd (k, m - t) = 1$ or $k > m - t > 0$ , the only solution to the equation $\sum_{j = 1}^{s} e_{j} d_{j} = m - t$ is $d = (1, 0, \dots, 0)$ . Thus, the column reduced net $\tilde{P}$ is an $(m - t, m, e, s)$ -net for the above choice of $e$ .

One might also consider digital (t, m, s)-nets generated by $C_{1}, \dots, C_{s}$ where the $C_{j}$ for $2 \leq j \leq s$ are derived from a (t, s)-sequence but $C_{1}$ is not necessarily derived from a digital sequence, since we usually choose the first reduction index $w_{1} = 0$ . In this case, one could perhaps find more choices of $e = (e_{1}, \dots, e_{s})$ such that the corresponding column reduced digital net is a $(t, m, e, s)$ -net, depending on the reduction indices $w_{2}, \dots, w_{s}$ .

A general and complete analysis of reduced $(t, m, e, s)$ -nets could be an interesting subject for future research.

Projections of column reduced digital nets

Due to the important role of the t-value, one sometimes also considers a slightly refined notion of a (t, m, s)-net, which is then referred to as a $({(t_{u})}_{u \subseteq [s]}, m, s)$ -net, where $[s] : = {1, \dots, s}$ . The latter notion means that for any $u \neq \emptyset$ , $u \subseteq [s]$ , the projection of the net onto those components with indices in $u$ is a $(t_{u}, m, |u|)$ -net. The notion of a $({(t_{u})}_{u \subseteq [s]}, s)$ -sequence is defined analogously. Moreover, for $u \neq \emptyset$ , we write $\bar{u} : = max (u)$ .

If we assume (which we always do in this paper) that the reduction indices satisfy $0 = w_{1} \leq w_{2} \leq \dots \leq w_{s}$ , then, for any non-empty $u \subseteq [s]$ , the reduction index $w_{\bar{u}}$ is the largest among all reduction indices corresponding to $u$ . This yields the following adaption of Theorem 1, which obviously can be shown in the same manner.

Corollary 1

Let $P$ be a digital $({(t_{u})}_{u \subseteq [s]}, m, s)$ -net over $F_{b}$ with generating matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ , which has been derived from a digital $({(t_{u})}_{u \subseteq [s]}, s)$ -sequence, where we assume that $m \geq t$ . Let ${\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}$ be the reduced generating matrices with respect to reduction indices $0 = w_{1} \leq \dots \leq w_{s}$ and let ${({\tilde{t}}_{u})}_{u \subseteq [s]}$ be the minimal quality parameters of the projections of the net generated by the ${\tilde{C}}_{j}^{(m)}$ . Then, for every non-empty $u \subseteq [s]$ ,

\begin{matrix} max {0, m - w_{\bar{u}} - t_{u}} \leq ρ_{m} ({({\tilde{C}}_{j}^{(m)})}_{j \in u}) \leq max {0, m - w_{\bar{u}}}, \end{matrix}

and ${\tilde{t}}_{u} \leq min {m, w_{\bar{u}} + t_{u}}$ .

Furthermore, if, for a non-empty $u \subseteq [s]$ , the projection of $P$ onto the components in $u$ is a strict digital $(t_{u}, m, |u|)$ -net, it is true that

\begin{matrix} ρ_{m} ({({\tilde{C}}_{j}^{(m)})}_{j \in u}) \leq max {0, m - max {t_{u}, w_{\bar{u}}}} . \end{matrix}

Applications of column reduced digital nets

A reduced matrix product algorithm

In this section, we return to the problem outlined in Section 1. Let P be a digital (t, m, s)-net over $F_{b}$ , with generating matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ . Let $w = {(w_{j})}_{j = 1}^{s} \in N_{0}^{s}$ be a sequence of reduction indices with $0 = w_{1} \leq w_{2} \leq \dots \leq w_{s}$ . Let $s^{*} \leq s$ be the largest index such that $w_{s^{*}} < m$ . Let ${\tilde{C}}_{1}^{(m)}, \dots, {\tilde{C}}_{s}^{(m)}$ be the reduced generating matrices corresponding to $w_{1}, \dots, w_{s}$ , and let Q be the corresponding reduced digital net. Let $x_{0}, \dots, x_{N - 1}$ be the points of Q, where we interpret $x_{0}, \dots, x_{N - 1}$ as column vectors. Let

X = {[x_{0}^{⊤}, x_{1}^{⊤}, \dots, x_{N - 1}^{⊤}]}^{⊤}

be the $N \times s$ matrix whose k-th row is the k-th point of Q for $0 \leq k \leq N - 1$ .

Let $ξ_{j}$ denote the j-th column of X, i.e., $X = [ξ_{1}, ξ_{2}, \dots, ξ_{s}]$ . Let $A = {[a_{1}, \dots, a_{s}]}^{⊤}$ , where $a_{j} \in R^{1 \times τ}$ is the j-th row of A. Then we have

\begin{matrix} X A = [ξ_{1}, ξ_{2}, \dots, ξ_{s}] \cdot {[a_{1}, \dots, a_{s}]}^{⊤} = ξ_{1} a_{1} + ξ_{2} a_{2} + \dots + ξ_{s} a_{s} . \end{matrix}

We will make use of a certain inherent repetitiveness of the reduced net Q, which we will illustrate by considering a reduction index $0 \leq w_{j} < m$ for $1 \leq j \leq s^{*}$ , and the corresponding generator matrix ${\tilde{C}}_{j}^{(m)}$ . The j-th components of the $N = b^{m}$ points of Q (i.e., the j-th column $ξ_{j}$ of X) are then given by

\begin{matrix} ξ_{j} = & {(({\tilde{C}}_{j}^{(m)}, \vec{0}) \cdot (b^{- 1}, \dots, b^{- m}), \dots, ({\tilde{C}}_{j}^{(m)}, \vec{(b^{m} - 1)}) \cdot (b^{- 1}, \dots, b^{- m}))}^{⊤} \\ = & {(\underset{b^{w_{j}} t i m e s}{\underset{⏟}{X_{j}, \dots, X_{j}}})}^{⊤}, \end{matrix}

where, as above, we write $\vec{k}$ to denote the vector of base b digits of length m for $k \in {0, 1 \dots, b^{m} - 1}$ , and where

X_{j} = {(({\tilde{C}}_{j}^{(m)}, \vec{0}) \cdot (b^{- 1}, \dots, b^{- m}), \dots, ({\tilde{C}}_{j}^{(m)}, \vec{(b^{m - w_{j}} - 1)}) \cdot (b^{- 1}, \dots, b^{- m}))}^{⊤} .

The reason for this repetitive structure is that, for any $w_{j}$ with $0 < w_{j} < m$ , the last $w_{j}$ columns of ${\tilde{C}}_{j}^{(m)}$ are equal to zero, and thus, in the product ${\tilde{C}}_{j}^{(m)} \vec{k}$ , the last $w_{j}$ entries of $\vec{k}$ become irrelevant. We will exploit this structure within Q to derive a fast matrix-matrix multiplication algorithm to compute XA.

Based on the above observations, it is possible to formulate the following algorithm to compute (11) in an efficient way. Note that for $j > s^{*}$ the j-th column of X consists only of zeros, so there is nothing to compute for the entries of X corresponding to these columns.

Remark 4

The number of computations needed for Algorithm 1 is of order

O (\sum_{j = 1}^{s^{*}}, b^{m - w_{j}}, (τ + m (m - w_{j}))) .

Note that this algorithm also generates the points of the reduced digital net, whereas the standard multiplication or the analogous “row reduced algorithm” [1, Algorithm 4], both require pre-computed points of the digital net as input. Generating the points of a non-reduced digital net requires $O (b^{m} s m^{2})$ operations, see also [1, Algorithm 3] and the standard non-reduced matrix-matrix multiplication usually requires $O (b^{m} s τ)$ operations. Therefore, Algorithm 1 improves the runtime of both steps. We also point out that the number of operations necessary for Algorithm 1 is independent of s, and only depends on $s^{*}$ . If the reduction indices $w_{j}$ grow sufficiently fast, then $s^{*}$ can be significantly lower than s.

Remark 5

Let us consider mappings $ϕ : {[0, 1]}^{s} \to R$ of the form $ϕ (x) = (ϕ_{1} (x_{1}), \dots, ϕ_{s} (x_{s}))$ that we apply simultaneously to all sample points of a given digital net. In this case, we can easily adapt Algorithm 1 such that we obtain a reduced net with points transformed by $ϕ$ , but do not change the order of the computation time outlined in Remark 4. Such an adaption is useful when considering the case $D = R^{s}$ , as pointed out in the introduction.

Error analysis

In the beginning of the paper we set out the task of approximating the integral (1) by the QMC rule (2). We have shown in the previous sections how to speed up the computation of the products $x_{k}^{⊤} A$ if we choose $x_{k}$ as the points of a column reduced digital net. However, we should also keep in mind the integration error made by using a QMC rule of the form (2) using those $x_{k}$ .

In this section, we restrict ourselves to the case $D = {[0, 1]}^{s}$ , such that we do not need to transform the sample points $x_{k}$ before applying the corresponding QMC rule. In many applications of quasi-Monte Carlo, one considers so-called weighted function spaces such as weighted Sobolev or weighted Korobov spaces (see, e.g., [3, 5, 6]). The idea of studying weighted function spaces goes back to the seminal paper [12] of Sloan and Woźniakowski. The motivation for weighted spaces is that in many applications, different coordinates or different groups of coordinates may have different influence on a multivariate problem. To give a simple example, consider numerical integration of a function $f : {[0, 1]}^{s} \to R$ , where

f (x_{1}, \dots, x_{s}) = e^{x_{1}} + \frac{x_{2} + \dots + x_{s}}{2^{s}} .

Clearly, for large s, the first variable has much more influence on this problem than the others. In order to make such observations more precise, one introduces weights, which are nonnegative real numbers $γ_{u}$ , one for each set $u \subseteq {1, \dots, s}$ . Intuitively speaking, the number $γ_{u}$ models the influence of the variables with indices in $u$ . Larger values of $γ_{u}$ mean more influence, smaller values less influence. Formally, we set $γ_{\emptyset} = 1$ , and we write $γ = {γ_{u}}_{u \subseteq {1, \dots, s}}$ . These weights can now be used to modify the norm in a given function space, thereby modifying the set over which a suitable error measure, as for example the worst-case error, of a problem is considered. By making this set smaller according to the weights (in the sense that also here, certain groups of variables may have less influence than others), a problem may thus become easier to handle and even lose the curse of dimensionality, provided that suitable conditions on the weights hold. This effect also corresponds to intuition—if a problem depends on many variables, of which only some have significant influence, it is natural to expect that the problem will be easier to solve than one where all variables have the same influence.

The weighted star discrepancy is (via the well-known Koksma-Hlawka inequality or its weighted version, see, e.g., [3, 6, 10]), a measure of the worst-case quadrature error for a QMC rule with node set Q, with $b^{m}$ nodes, defined as

\begin{matrix} D_{b^{m}, γ}^{*} (Q) : = sup_{x \in {(0, 1]}^{s}} max_{\emptyset \neq u \subseteq [s]} γ_{u} |Δ_{Q, u}, (x)|, \end{matrix}

where

\begin{matrix} Δ_{Q, u} (x) : = \frac{# {(y_{1}, \dots, y_{s}) \in Q : y_{j} < x_{j}, \forall j \in u}}{b^{m}} - \prod_{j \in u} x_{j} . \end{matrix}

Indeed, for certain weighted function classes based on Sobolev spaces of smoothness one, the weighted star discrepancy equals the worst-case quadrature error of a QMC rule with node set Q. Here, by the worst-case error, we mean the supremum of the integration error taken over the unit ball of the function class under consideration. We refer to [3, Section 5.3] for further details on the weighted Koksma-Hlawka inequality.

As shown in [11], we have

\begin{matrix} D_{b^{m}, γ}^{*} (Q) = & max_{\emptyset \neq u \subseteq [s]} sup_{x \in {(0, 1]}^{s}} γ_{u} |Δ_{Q, u}, (x)| \\ = & max_{\emptyset \neq u \subseteq [s]} γ_{u} sup_{x \in {(0, 1]}^{s}} |Δ_{Q, u}, (x)| . \end{matrix}

In the latter expression, the suprema over $x \in {(0, 1]}^{s}$ just yield the values of the star discrepancy of the projections of Q, and thus, one can use existing discrepancy bounds for the projections of Q. Let us proceed as follows. Assume that $P$ is a digital $({(t_{u})}_{u \subseteq [s]}, m, s)$ -net over $F_{b}$ with $m \times m$ generating matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ derived from a digital $({(t_{u})}_{u \subseteq [s]}, s)$ -sequence, where $m \geq t$ . Let $\tilde{P}$ be the corresponding column reduced digital net based on the reduction indices $0 = w_{1} \leq w_{2} \leq \dots \leq w_{s}$ , and let ${({\tilde{t}}_{u})}_{u \subseteq [s]}$ be the minimal quality parameters of the projections of $\tilde{P}$ .

Whenever we consider a $u \subseteq [s]$ that is not a subset of $[s^{*}]$ , we know due to Corollary 1 that the quality parameter of the corresponding projection of $\tilde{P}$ is m and therefore we can bound its discrepancy only trivially by 1. Whenever we have $u \subseteq [s^{*}]$ , however, we can use existing discrepancy bounds for the corresponding net. To this end, we use the results from [7], which are, to our best knowledge, the currently best-known general upper discrepancy bounds for (t, m, s)-nets. This yields, for any non-empty set $u \subseteq [s]$ ,

\begin{matrix} sup_{x \in {(0, 1]}^{s}} |Δ_{\tilde{P}, u}, (x)| \leq \{\begin{matrix} 1 & if u ⊈ [s^{*}], \\ (b^{{\tilde{t}}_{u}} / b^{m}) \sum_{v = 0}^{|u| - 1} a_{v, b}^{(|u|)} m^{v} & if u \subseteq [s^{*}] and |u| \geq 2, \\ b^{{\tilde{t}}_{u}} / b^{m} & if u \subseteq [s^{*}] and |u| = 1, \end{matrix}) \end{matrix}

where

\begin{matrix} a_{v, b}^{(|u|)} = & (\binom{|u| - 2}{v}) {(\frac{b + 2}{2})}^{|u| - 2 - v} \frac{{(b - 1)}^{v}}{2^{v} v!} (a_{0, b}^{(2)} + {|u|}^{2} - 4) \\ + (\binom{|u| - 2}{v - 1}) {(\frac{b + 2}{2})}^{|u| - 1 - v} \frac{{(b - 1)}^{v - 1}}{2^{v - 1} v!} a_{1, b}^{(2)}, \end{matrix}

for $0 \leq v \leq |u| - 1$ , with

a_{0, b}^{(2)} = \{\begin{matrix} \frac{b + 8}{4} & if b is even, \\ \frac{b + 4}{2} & if b is odd, \end{matrix}) and a_{1, b}^{(2)} = \{\begin{matrix} \frac{b^{2}}{4 (b + 1)} & if b is even, \\ \frac{b - 1}{4} & if b is odd. \end{matrix})

This then yields

\begin{matrix} D_{b^{m}, γ}^{*} (\tilde{P}) \leq max \{\underset{u ⊈ [s^{*}]}{max_{\emptyset \neq u \subseteq [s]}} γ_{u}, \underset{|u| = 1}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{{\tilde{t}}_{u}}}{b^{m}}, \underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{{\tilde{t}}_{u}}}{b^{m}} \sum_{v = 0}^{|u| - 1} a_{v, b}^{(|u|)} m^{v}\} . \end{matrix}

Let us analyze the three maxima in the curly brackets in (15) in greater detail. To this end, as also in [1], we restrict ourselves to product weights in the following, i.e., we assume weights $γ_{u} = \prod_{j \in u} γ_{j}$ with $γ_{1} \geq γ_{2} \geq \dots > 0$ .

For the first term, we proceed as in [1], namely we use that $w_{j} \geq m$ if $j \in u \ [s^{*}]$ , and obtain for $v = u \cap [s^{*}]$ that

\begin{matrix} γ_{u} \leq γ_{v} γ_{u \ v} \frac{1}{b^{m}} \prod_{j \in u \ v} (1 + b^{w_{j}}) \leq \frac{1}{b^{m}} \prod_{j \in u} γ_{j} (1 + b^{w_{j}}) . \end{matrix}

For the second maximum in (15), note that we have $u = {j}$ for some $j \in [s^{*}]$ , and hence ${\tilde{t}}_{u} \leq min {m, w_{j} + t_{{j}}}$ by Corollary 1. Consequently,

\begin{matrix} \underset{|u| = 1}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{{\tilde{t}}_{u}}}{b^{m}} \leq max_{j \in [s^{*}]} γ_{j} \frac{b^{min {m, w_{j} + t_{{j}}}}}{b^{m}} . \end{matrix}

For the third maximum in (15), we again use Corollary 1, and obtain

\begin{matrix} \underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{{\tilde{t}}_{u}}}{b^{m}} \sum_{v = 0}^{|u| - 1} a_{v, b}^{(|u|)} m^{v} \leq \underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{min {m, w_{\bar{u}} + t_{u}}}}{b^{m}} \sum_{v = 0}^{|u| - 1} a_{v, b}^{(|u|)} m^{v} . \end{matrix}

Using these estimates in (15), we obtain

\begin{matrix} D_{b^{m}, γ}^{*} (\tilde{P}) \\ \leq & max \{\underset{u ⊈ [s^{*}]}{max_{\emptyset \neq u \subseteq [s]}} \frac{1}{b^{m}} \prod_{j \in u} γ_{j} (1 + b^{w_{j}}), max_{j \in [s^{*}]} γ_{j} \frac{b^{w_{j} + t_{{j}}}}{b^{m}}, \underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{b^{min {m, w_{\bar{u}} + t_{u}}}}{b^{m}} \sum_{v = 0}^{|u| - 1} a_{v, b}^{(|u|)} m^{v}\} . \end{matrix}

Remark 6

A few remarks on (19) are in order. Note that only the first term in the curly brackets in (19) depends on s. The two remaining terms depend on $s^{*}$ , which can be independent of s if the reduction indices $w_{j}$ increase sufficiently fast. However, let us give a few further details on these observations.

We may want that the first term

\frac{1}{b^{m}} \prod_{j \in u} γ_{j} (1 + b^{w_{j}}) \leq \frac{1}{b^{m}} \prod_{j = 1}^{s} γ_{j} (1 + b^{w_{j}})

be bounded by $κ / b^{m}$ for some constant $κ > 0$ independent of s. Let $j_{0} \in N$ be minimal such that $γ_{j} \leq 1$ for all $j > j_{0}$ . Then we impose $\prod_{j = 1}^{s} γ_{j} (1 + b^{w_{j}}) \leq γ_{1}^{0} \prod_{j = 1}^{s} (1 + γ_{j} b^{w_{j}}) \leq κ$ . Hence it is sufficient to choose $κ > γ_{1}^{0}$ and for all $j \in [s]$ ,

\begin{matrix} w_{j} : = min (⌊{log}_{b}, (\frac{{(\frac{κ}{γ_{1}^{0}})}^{1 / s} - 1}{γ_{j}})⌋, m) . \end{matrix}

The choice of the $w_{j}$ in (20) depends on s. For sufficiently fast decaying weights $γ_{j}$ , it is possible to choose the $w_{j}$ such that they no longer depend on s. Indeed, suppose, e.g., that $γ_{j} = j^{- 2}$ . Then we could choose the $w_{j}$ such that, for some $τ \in (1, 2)$ ,

\begin{matrix} w_{j} \leq min (⌊{log}_{b}, (j^{2 - τ})⌋, m) . \end{matrix}

This then yields

\begin{matrix} \prod_{j = 1}^{s} (1 + γ_{j} b^{w_{j}}) \leq exp (\sum_{j = 1}^{s} log (1 + γ_{j} b^{w_{j}})) \leq exp (\sum_{j = 1}^{s}, γ_{j}, b^{w_{j}}) \leq exp (ζ (τ)), \end{matrix}

where $ζ (\cdot)$ is the Riemann zeta function. This gives a dimension-independent bound on the term $\prod_{j = 1}^{s} γ_{j} (1 + b^{w_{j}})$ from above, and hence a dimension-independent bound for all of $D_{b^{m}, γ}^{*} (\tilde{P})$ .

Regarding the second term in (19), this term only depends on one-dimensional projections of $\tilde{P}$ . In particular, if we choose the $w_{j}$ as in (21), this expression should be easy to bound from above. This is even more so if the t-values of the one-dimensional projections of the non-reduced net $P$ are low, which may often be the case (in fact, the t-values of one-dimensional projections might even be zero in many examples). Thus we can bound the second term by an expression of the form $κ^{*} / b^{m}$ , which only depends on $s^{*}$ but not on s.

Regarding the third term in (19), it crucially depends on the weights $γ$ and their interplay with the quality parameters of the projections of $P$ , $t_{u}$ . In particular, small quality parameters, in combination with sufficiently fast decaying weights and a suitable choice of the reduction indices $w_{j}$ , should yield tighter error bounds. Indeed, we could proceed similarly to [7, Corollary 1], and bound the third term in (19) by a term of the form

\underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{1}{b^{m}} (c_{|u|} m^{|u| - 1} + O (m^{|u| - 2})),

where $c_{|u|}$ depends on $b, t_{u}$ and $|u|$ , but not on m. Note that also the third term only depends on $s^{*}$ and not on s, so for sufficiently fast increasing reduction indices $w_{j}$ , the dimension s does not matter. In summary, we obtain

D_{b^{m}, γ}^{*} (\tilde{P}) \leq max \{\frac{κ}{b^{m}}, \frac{κ^{*}}{b^{m}}, \underset{|u| \geq 2}{max_{u \subseteq [s^{*}]}} γ_{u} \frac{1}{b^{m}} (c_{|u|} m^{|u| - 1} + O (m^{|u| - 2}))\} .

Remark 7

Note that our new result yields an advantage over the corresponding result for row reduced nets in [1]. In that paper, one needs to work with the quality parameters of the projections of the reduced net, which are, in general, not known. In the present paper, we benefit from the combination of the column reduction and the fact that the nets considered here are derived from digital sequences, which guarantees additional structure. Usually, it is computationally involved to determine the t-value of a digital net or sequence from the generating matrices, since many linear independence conditions need to be checked. Here, however, we can use Theorem 1 and Corollary 1, which relate the t-values of $P$ to those of $\tilde{P}$ , and thus give us an advantage. In particular, if $P$ is obtained from, say, a Sobol’ or a Niederreiter sequence, it should be possible to have t-values that are guaranteed to be reasonably low.

Numerical experiments

In this section, we test the computational performance of column reduced digital nets for matrix products XA, where A is an $s \times τ$ matrix, as detailed in Section 4.1. We implemented Algorithm 1 in the Julia programming language (Version 1.9.3).1 In the following plots, we compare the runtime of Algorithm 1 to the standard matrix multiplication and also the matrix multiplication using the points from row reduced digital nets as given in [1, Algorithm 4]. We remark that the reported runtimes are also affected by technical implementation details such as memory efficiency, a detailed discussion of which is out of scope here.

For the generating matrices $C_{1}^{(m)}, \dots, C_{s}^{(m)}$ , we used random matrices in $F_{b}^{m \times m}$ , since the matrix product computation itself does not depend on the entries of the matrix, i.e, we get similar relations of runtimes if we use generating matrices of specific digital sequences like Sobol’ or Niederreiter sequences.

In Fig. 1 we see, for fixed $b = 2, m = 12$ , and $τ = 20$ , how the runtime changes as we vary s. We compare this for two different choices of reduction indices $w_{j}$ . We see that in this case, using column reduced digital nets in Algorithm 1 performs better than the use of row reduced digital nets in [1, Algorithm 4] and also the standard matrix multiplication.

As the reduction indices $w_{j}$ increase more slowly (as in Fig. 1b), the difference in performance between the standard multiplication and Algorithm 1 reduces. We can see this also theoretically by inserting the weights in Remark 4.

In Fig. 2, we study the behavior for fixed $b = 2, s = 800$ , and $τ = 20$ as m increases. Note that we use the logarithmic scale for the time but not for m. We observe that also in this case Algorithm 1 seems to perform better than the row reduced case.

Overall, the numerical tests for the runtime using column reduced digital nets fit our theoretical estimate for the runtime as given in Remark 4 and comparison with the row reduced algorithm reveals that the column reduced algorithm could yield a better performance. Additionally to this practical advantage, column reduced matrices also have a theoretical advantage over row reduced matrices, as pointed out in Remark 7.

Conclusion

Column reduced digital nets have applications in the field of quasi-Monte Carlo methods. We can speed up the matrix-matrix multiplication in the quasi-Monte Carlo method by exploiting the repetitive structure of the points of a column reduced digital net. The bounds for the quality parameter (t-value) of column reduced digital nets have not been studied before.

In our research, we provide an algorithm for the matrix-matrix product using column reduced digital nets, which is faster than the standard matrix multiplication algorithm. In addition, we provide bounds for the t-value for column reduced digital nets. This is very essential for the error analysis of our method and has an advantage over the corresponding result for the row reduced nets in [1].

For future work, one could consider relaxing the conditions we impose on the t-value of the underlying digital sequence. One could also explore in-depth the interplay between column and row reduced digital nets.

Acknowledgements

The authors acknowledge the support of the Austrian Science Fund (FWF) Project 10.55776/P34808. For open access purposes, the authors have applied a CC BY public copyright license to any author accepted manuscript version arising from this submission.

Author Contributions

Both authors have contributed to all sections of the paper to equal parts.

Funding

Open access funding provided by Österreichische Akademie der Wissenschaften.

Data Availability

The supporting numerical experiments in the manuscript have been computed by source code that is available at: https://github.com/Vishnupriya-Anupindi/ReducedDigitalNets.jl.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Source code available at https://github.com/Vishnupriya-Anupindi/ReducedDigitalNets.jl

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Dick, J., Ebert, A., Herrmann, L., Kritzer, P., Longo, M.: The fast reduced QMC matrix-vector product. J. Comput. Appl. Math. 440, 115642 (2024) [Google Scholar]
2.Dick, J., Kritzer, P., Leobacher, G., Pillichshammer, F.: A reduced fast component-by-component construction of lattice points for integration in weighted spaces with fast decreasing weights. J. Comput. Appl. Math. 276, 1–15 (2015) [Google Scholar]
3.Dick, J., Kritzer, P., Pillichshammer, F.: Lattice Rules-Numerical Integration, Approximation, and Discrepancy. Springer, Cham (2022) [Google Scholar]
4.Dick, J., Kuo, F.Y., Le Gia, Q.T., Schwab, C.: Fast QMC matrix-vector multiplication. SIAM J. Sci. Comput. 37, A1436–A1450 (2015) [Google Scholar]
5.Dick, J., Kuo, F.Y., Sloan, I.H.: High-dimensional integration–the quasi-Monte Carlo way. Acta Numer. 22, 133–288 (2013) [Google Scholar]
6.Dick, J., Pillichshammer, F.: Digital Nets and Sequences. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press, Cambridge (2010) [Google Scholar]
7.Faure, H., Kritzer, P.: New star discrepancy bounds for -nets and -sequences. Monatsh. Math. 172, 55–75 (2013) [Google Scholar]
8.Hofer, R., Niederreiter, H.: A construction of (t, s)-sequences with finite-row generating matrices using global function fields. Finite Fields and Their Applications 21, 97–110 (2013) [Google Scholar]
9.Niederreiter, H.: Low-discrepancy point sets obtained by digital constructions over finite fields. Czechoslovak Math. J. 42, 143–166 (1992) [Google Scholar]
10.Niederreiter, H.: Random number generation and quasi-monte carlo methods. CBMS-NSF Regional Conference Series in Applied Mathematics, 63. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (1992)
11.Pillichshammer, F.: Tractability properties of the weighted star discrepancy of regular grids. J. Complexity 46, 103–112 (2018) [Google Scholar]
12.Sloan, I.H., Woźniakowski, H.: When are quasi-Monte Carlo algorithms efficient for high-dimensional integrals? J. Complexity 14, 1–33 (1998) [Google Scholar]
13.Tezuka, S.: On the discrepancy of generalized Niederreiter sequences. J. Complexity 29, 240–247 (2013) [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The supporting numerical experiments in the manuscript have been computed by source code that is available at: https://github.com/Vishnupriya-Anupindi/ReducedDigitalNets.jl.

[CR1] 1.Dick, J., Ebert, A., Herrmann, L., Kritzer, P., Longo, M.: The fast reduced QMC matrix-vector product. J. Comput. Appl. Math. 440, 115642 (2024) [Google Scholar]

[CR2] 2.Dick, J., Kritzer, P., Leobacher, G., Pillichshammer, F.: A reduced fast component-by-component construction of lattice points for integration in weighted spaces with fast decreasing weights. J. Comput. Appl. Math. 276, 1–15 (2015) [Google Scholar]

[CR3] 3.Dick, J., Kritzer, P., Pillichshammer, F.: Lattice Rules-Numerical Integration, Approximation, and Discrepancy. Springer, Cham (2022) [Google Scholar]

[CR4] 4.Dick, J., Kuo, F.Y., Le Gia, Q.T., Schwab, C.: Fast QMC matrix-vector multiplication. SIAM J. Sci. Comput. 37, A1436–A1450 (2015) [Google Scholar]

[CR5] 5.Dick, J., Kuo, F.Y., Sloan, I.H.: High-dimensional integration–the quasi-Monte Carlo way. Acta Numer. 22, 133–288 (2013) [Google Scholar]

[CR6] 6.Dick, J., Pillichshammer, F.: Digital Nets and Sequences. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press, Cambridge (2010) [Google Scholar]

[CR7] 7.Faure, H., Kritzer, P.: New star discrepancy bounds for -nets and -sequences. Monatsh. Math. 172, 55–75 (2013) [Google Scholar]

[CR8] 8.Hofer, R., Niederreiter, H.: A construction of (t, s)-sequences with finite-row generating matrices using global function fields. Finite Fields and Their Applications 21, 97–110 (2013) [Google Scholar]

[CR9] 9.Niederreiter, H.: Low-discrepancy point sets obtained by digital constructions over finite fields. Czechoslovak Math. J. 42, 143–166 (1992) [Google Scholar]

[CR10] 10.Niederreiter, H.: Random number generation and quasi-monte carlo methods. CBMS-NSF Regional Conference Series in Applied Mathematics, 63. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (1992)

[CR11] 11.Pillichshammer, F.: Tractability properties of the weighted star discrepancy of regular grids. J. Complexity 46, 103–112 (2018) [Google Scholar]

[CR12] 12.Sloan, I.H., Woźniakowski, H.: When are quasi-Monte Carlo algorithms efficient for high-dimensional integrals? J. Complexity 14, 1–33 (1998) [Google Scholar]

[CR13] 13.Tezuka, S.: On the discrepancy of generalized Niederreiter sequences. J. Complexity 29, 240–247 (2013) [Google Scholar]

PERMALINK

Column reduced digital nets

Vishnupriya Anupindi

Peter Kritzer

Abstract

Introduction

The problem setting

Digital nets and sequences

Definition 1

Definition 2

Definition 3

The t-values of column reduced digital nets

Column reduction for (t, m, s)-nets

Theorem 1

Proof

Remark 1

Remark 2

Remark 3

Column reduction for (t,m,e,s)-nets

Definition 4

Example 1

Projections of column reduced digital nets

Corollary 1

Applications of column reduced digital nets

A reduced matrix product algorithm

Algorithm 1.

Remark 4

Remark 5

Error analysis

Remark 6

Remark 7

Numerical experiments

Fig. 1.

Fig. 2.

Conclusion

Acknowledgements

Author Contributions

Funding

Data Availability

Declarations

Competing interests

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Column reduction for $(t, m, e, s)$ -nets