Skip to main content
Springer logoLink to Springer
. 2025 Mar 24;101(3):1451–1473. doi: 10.1007/s11075-025-02050-8

Column reduced digital nets

Vishnupriya Anupindi 1, Peter Kritzer 1,
PMCID: PMC12945907  PMID: 41767827

Abstract

Digital nets provide an efficient way to generate integration nodes of quasi-Monte Carlo (QMC) rules. For certain applications, as e.g. in uncertainty quantification, we are interested in obtaining a speed-up in computing products of a matrix with the vectors corresponding to the nodes of a QMC rule. In the recent paper The fast reduced QMC matrix-vector product (Dick et al. J. Comput. Appl. Math. 440, 115642 2024), a speed up was obtained by using so-called reduced lattices and row reduced digital nets. In this work, we propose a different multiplication algorithm where we exploit the repetitive structure of column reduced digital nets instead of row reduced digital nets. This method has advantages over the previous one, as it facilitates the error analysis when using the integration nodes in a QMC rule. We also provide an upper bound for the quality parameter of column reduced digital nets, and numerical tests to illustrate the efficiency of the new algorithm.

Keywords: Numerical integration, Digital net, Digital sequence, Reduced digital net

Introduction

The problem setting

In many applications, such as in statistics, finance, and uncertainty quantification, we would like to numerically compute

Df(xA)dμ(x), 1

where A is a real s×τ matrix, by quasi-Monte Carlo (QMC) rules

QN(f):=1Nk=0N-1f(xkA), 2

where xk=(xk(1),,xk(s)) are column vectors corresponding to the points used in the QMC rule. Problems of this kind particularly arise in some important applications in statistics and uncertainty quantification. For instance, this approach can be used when approximating the expected value of a function with a multivariate normal random variable with some given covariance matrix, or when approximating the expected value of the solution of a PDE with random coefficients, see, e.g., [4]. In some cases the domain D in (1) will be chosen as D=[0,1]s. In this case, it is natural to directly use QMC sample points like lattice point sets (see [3]) or (tms)-nets (see [6]) as the points xk. This is the situation we shall mostly consider in the present paper, in particular with respect to the error analysis in Section 4.2. However, in some applications, such as those mentioned above, the domain may be, e.g., D=Rs. Then the points xk frequently are of the form xk=Φ-1(yk), k{0,1,,N-1}, where the yk are the QMC sample points, and Φ-1 is the inverse of the cumulative distribution function of a standard normal distribution, which is applied component-wise to vectors. In order to avoid that certain coordinates of the sample points are mapped to ±, one can first shift the yk to the right by a sufficiently small quantity. Many results presented here also hold for this case, in particular the matrix product algorithm presented in Section 4.1, see Remark 5. We also refer to [4], where a similar situation is studied using a different computational method.

Computing the vector-matrix products xkA for all k{0,,N-1} takes O(Nsτ) operations. This problem is equivalent to computing the matrix-matrix product XA, where

X=x0,x1,,xN-1

is the N×s matrix whose k-th row is xk. Computing XA can be infeasible in situations where s and N are both large (which happens in many applications).

In the paper [4], it is shown that when using particular types of QMC rules, the cost to evaluate QN(f), as in (2), can be reduced to only O(τNlogN) operations provided that logNs. This reduction in computational cost is achieved by a fast matrix-matrix multiplication exploiting the fact that for specifically chosen point sets, such as (polynomial) lattice rules, the matrix X can be re-ordered to be of circulant structure.

The recent paper [1] studies an alternative method to reduce the computation time by imposing a certain structure of the points x0,,xN-1. The key idea of this approach is to find situations in which the components of the points xk have a certain repetitive structure, which then facilitates systematic fast computation of the products xkA. This can be achieved by suitable modifications of (polynomial) lattice point sets using ideas from [2], but how to implement this idea for digital nets, which are more general than polynomial lattice point sets and among the most commonly used QMC node sets, is not straightforward. In [1], the authors made a first attempt and studied a reduction of the computation time for digital (tms)-nets by setting certain rows of the generating matrices to zero (we refer to Section 1.2 for the precise definition of digital nets and their generating matrices). The basic idea in [1] is that for each of the s generating matrices Cj(m), 1js, of the digital net, we identify a so-called reduction index wjZ and set the last wj rows of Cj(m) equal to zero. As shown in [1], this introduces a certain repetitiveness in the entries of the matrix X and speeds up the computation of the matrix-matrix product XA. We call such digital nets row reduced digital nets. However, for assessing the quality of reduced nets when used in QMC rules, it is more natural to study the situation where certain columns of the generating matrices are set to zero, since this directly corresponds to the reduced (polynomial) lattice point sets, resulting in the consideration of column reduced digital nets. The idea of column reduced digital nets is to set the last wj columns of the generating matrix Cj(m), 1js, equal to zero, instead of setting rows equal to zero. Furthermore, in the present paper, we focus on digital nets that are obtained from digital sequences, which implies additional structure in the generating matrices. Again, the approach of using column reduced digital nets yields a speed-up in the computation of XA, but as we will see below, it also makes it easier to assess the properties of the resulting column reduced digital nets than doing the same for row reduced digital nets. Furthermore, the error analysis for approximating (1) by (2) becomes easier. This idea was already mentioned (but not pursued) in [1], and this is what we intend to do in the present paper.

Digital nets and sequences

In this section, we give the definitions of (tms)-nets and (ts)-sequences, the digital construction method for these, and shortly outline how to assess their quality.

Let Fb be a finite field with b elements, where b is prime. We identify the elements of Fb with the set {0,1,,b-1}. An elementary interval in base b and dimension s is a half-open interval of the form j=1s[ajb-dj,(aj+1)b-dj) where the aj,dj are nonnegative integers with 0aj<bdj for 1js.

In the following, we recall the definition of (tms)-nets and (ts)-sequences, which have the property that the number of points in certain elementary intervals is proportional to their sizes. This guarantees a degree of uniform distribution of the point set in [0,1)s, which is desirable when using such a point set in a QMC rule. For detailed discussions on (tms)-nets and (ts)-sequences, we refer to [6, 10].

Definition 1

For a given dimension s1 and nonnegative integers tm with 0tm, a (t, m, s)-net in base b is a point set P[0,1)s consisting of bm points such that any elementary interval in base b with volume bt-m contains exactly bt points of P.

A sequence (x0,x1,) of points in [0,1)s is called a (t, s)-sequence in base b if for all integers mt and k0, the point set consisting of the points xkbm,,xkbm+bm-1 forms a (tms)-net in base b.

Note that the lower the value of t of a (tms)-net or a (ts)-sequence, the more uniformly the points are distributed in [0,1)s, which is a desirable property when the point set is used as an integration node set in a QMC rule. This is the reason why t is referred to as the quality parameter of a net or sequence.

A (tms)-net is called strict, if it does not fulfill the requirements of a (t-1,m,s)-net (for t1), and analogously for (ts)-sequences. In general, any (tms)-net is also a (t+1,m,s)-net for t<m.

We point out that it is, in general, a non-trivial combinatorial question of which values of t can be reached for which configurations of the other parameters. We again refer to [6, 10] for details.

A common way to generate (tms)-nets and (ts)-sequences is using the digital method, which was first introduced by Niederreiter in [9].

Definition 2

A digital (tms)-net over Fb is a (tms)-net P={x0,,xbm-1} where the points are constructed as follows. Let C1(m),,Cs(m) in Fbm×m be matrices over Fb. To generate the k-th point in P, 0kbm-1, we use the b-adic expansion k=i=0m-1kibi with digits ki{0,,b-1} which we denote by k=(k0,,km-1). The j-th coordinate xk,j of xk=(xk,1,,xk,s) is obtained by computing

xk,j:=Cj(m)k,

and then setting

xk,j:=xk,j·(b-1,b-2,,b-m).

Similarly, a digital (ts)-sequence S over Fb is generated by infinite matrices C1,,Cs, where

Cj=(ci,r(j))i,rNFbN×N. 3

To generate the k-th point in S, k0, we use the b-adic expansion k=i=0kibi with digits ki{0,,b-1} which we denote by k=(k0,k1,). The j-th coordinate xk,j of xk=(xk,1,,xk,s) is obtained by computing

xk,j:=Cj(m)k,

and then setting

xk,j:=xk,j·(b-1,b-2,).

Note that from any digital (ts)-sequence over Fb with generating matrices C1,,Cs, we can, for mt, derive a digital (tms)-net over Fb, simply by considering the point set generated by the left upper m×m submatrices C1(m),,Cs(m) of C1,,Cs. This is equivalent to considering the first bm points of the (ts)-sequence.

As pointed out above, the quality of a (tms)-net or (ts)-sequence is determined by its t-value. For digital (tms)-nets and (ts)-sequences, we can determine the t-value from rank conditions on the generating matrices, using a quantity that we shall refer to as the linear independence parameter.

Definition 3

For any integers 1js and m1, let C1(m),C2(m),,Cs(m) be m×m matrices over Fb. Then the linear independence parameter ρm(C1(m),C2(m),,Cs(m)) is defined as the largest integer such that for any choice of d1,,dsN0, with d1++ds=ρm, we have that

the firstd1rows ofC1(m)together withthe firstd2rows ofC2(m)together withthe firstdsrows ofCs(m)

are linearly independent over Fb.

It is known (see, e.g., [6, 10]) that the generating matrices C1(m),C2(m),,Cs(m) of a digital (tms)-net over Fb satisfy

ρm(C1(m),C2(m),,Cs(m))m-t, 4

where we have equality if the net is a strict (tms)-net. Similarly, for the generating matrices C1,,Cs of a digital (ts)-sequence over Fb we must have ρm(C1(m),,Cs(m))m-t for all mmax{t,1}, where Cj(m) denotes the left upper m×m submatrix of Cj for j{1,,s}. Hence, for digital nets and sequences, their quality can be assessed by checking linear independence conditions on the rows of the generating matrices.

The t-values of column reduced digital nets

Column reduction for (tms)-nets

Now we turn towards the primary object of our study, which is the column reduced digital nets. We note that if we take a general digital (tms)-net and set some columns of its generating matrices to zero, we cannot control the quality parameter of the reduced net. However, since digital (ts)-sequences require stronger conditions on their generating matrices, we can estimate the quality parameter of reduced digital (tms)-nets derived from digital sequences by taking the nets generated by the left upper m×m submatrices of the generating matrices of the sequences.

For mt, we consider the digital (tms)-net generated by the matrices C1(m),,Cs(m), derived via the above principle from a digital (ts)-sequence with generating matrices C1,,Cs, Cj=(ci,r(j)), i,rN.

Let 0=w1wsN0, we call these numbers the reduction indices, for the generating matrices Cj(m). We derive the corresponding reduced matrices C~1(m),,C~s(m), with C~j(m)=(c~i,r(j)), i,r{1,2,,m}, for 1js, where

c~i,r(j)=ci,r(j)ifr{1,,m-min(m,wj)},0ifr{m-min(m,wj)+1,,m}. 5

That is, the first m-min(m,wj) columns of C~j(m) are the same as the columns of the matrix Cj(m), and we set the last min(m,wj) columns to zero, i.e, if wj<m,

C~j(m)=c1,1(j)c1,(m-wj)(j)00c(m-wj),1(j)c(m-wj),(m-wj)(j)00cm,1(j)cm,(m-wj)(j)00.

We are interested in estimating the quality parameter of the digital net generated by the C~j(m).

Apart from the main motivation outlined in Section 1, there is another computational advantage of using column reduced digital nets. Indeed, by the general construction principle of digital point sets, the generating matrices of a digital net or sequence are multiplied over Fb by vectors representing the digits of the indices of the elements of the point set. By replacing the matrices Cj(m) by C~j(m), we increase the sparsity of the generating matrices, which saves computation time in the generation of the point set.

Theorem 1

Let P be a digital (tms)-net over Fb with generating matrices C1(m),,Cs(m) derived from a digital (ts)-sequence over Fb, where we assume that mt. Let C~1(m),,C~s(m) be as defined in (5) with respect to reduction indices 0=w1ws and let t~ be the minimal quality parameter of the net generated by the C~j(m). Then,

max{0,m-ws-t}ρmC~1(m),,C~s(m)max{0,m-ws}, 6

and t~min{m,ws+t}.

Furthermore, if P is a strict digital (tms)-net, it is true that

ρmC~1(m),,C~s(m)max{0,m-max{t,ws}}. 7

Proof

We note that we have mt by assumption. If wsm, then we trivially have ρmC~1(m),,C~s(m)=0, as C~s(m) only contains zeros, and (6) holds.

Therefore, we will assume for the rest of the proof that ws<m.

We prove the second inequality in (6) first. We have

C~s(m)=c1,1(s)c1,(m-ws)(s)00c(m-ws),1(s)c(m-ws),(m-ws)(s)00cm,1(s)cm,(m-ws)(s)00.

Let D be the matrix containing the first d1 rows of C~1(m), the first d2 rows of C~2(m), etc., up to the first ds rows of C~s(m), where d1,,ds are nonnegative integers satisfying d1++ds=m-ws. For the special choice (d1,,ds)=(0,,0,m-ws), we have rank(D)=rank(C~s(m))m-ws. Therefore,

ρmC~1(m),,C~s(m)m-ws.

Now we prove the first inequality in (6). If m-ws-t<0, the inequality is trivial.

Otherwise, i.e., if m-wst, we know that

ρkC1(k),,Cs(k)k-t, 8

for any kt, since our net is derived from a digital (ts)-sequence. Here, Cj(k), 1js, denotes the left upper k×k submatrix of Cj. In particular, we observe that for the left upper (m-ws)×(m-ws) submatrices of C1,,Cs,

ρ(m-ws)(C1(m-ws),,Cs(m-ws))m-ws-t.

We now consider arbitrary integers d1,,ds0 with d1++ds=m-ws-t. Let ki(j) denote the i-th row vector of Cj(m-ws)Fb(m-ws)×(m-ws). We know that

k1(1),,kd1(1),k1(2),,kd2(2),,,k1(s),,kds(s) 9

are linearly independent over Fb. Let ci(j) denote the i-th row vector of C~i(m)Fbm×m. We observe that for 1im-ws,

ci(j)=(ki(j),ui(j))Fb1×m,

where the ki(j) are as above and ui(j)Fb1×ws.

The row vectors

c1(1),,cd1(1),c1(2),,cd2(2),,,c1(s),,cds(s) 10

are linearly independent, since otherwise the row vectors in (9), which are projections of ci(j) onto the first m-ws entries, would be linearly dependent. Therefore,

ρmC~1(m),,C~s(m)m-ws-t.

This concludes the proof of (6). Using (4) and the lower bound in (6), we obtain the upper bound for t~.

It remains to show (7).

Let D be the matrix containing the first d1 rows of C~1(m), the first d2 rows of C~2(m), etc., up to the first ds rows of C~s(m), where d1,,ds are nonnegative integers. As above, for the special choice (d1,,ds)=(0,,0,m-ws), we have rank(D)=rank(C~s(m))(m-ws). So,

ρmC~1(m),,C~s(m)m-ws.

However, since we assume that P is a strict digital (tms)-net in this part of the proof, there must exist a choice of (d1,,ds) with d1++ds=m-t+1 such that the corresponding rows of C1(m),,Cs(m) are linearly dependent, and therefore also the corresponding rows of C~1(m),,C~s(m) are linearly dependent. This yields

ρmC~1(m),,C~s(m)m-t,

so we must have

ρmC~1(m),,C~s(m)m-max{t,ws}.

Remark 1

For t=0 and ws<m in Theorem 1, we obtain equality in (6) and therefore

ρmC~1(m),,C~s(m)=m-ws,

and t~=ws.

Remark 2

We now give an example that illustrates that the lower bound in Theorem 1 is sharp.

Assume that Q is a digital (0, 2)-sequence with generating matrices D1 and D2 (examples of Q exist, e.g., by choosing as Q a Niederreiter sequence, see [9]).

From Q, we construct a digital (t, 2)-sequence P, by prepending exactly t zero columns to both D1 and D2. That is, we construct new generating matrices Cj, j{1,2}, such that

graphic file with name 11075_2025_2050_Equ55_HTML.gif

It is easily checked that C1,C2 generate a digital (t, 2)-sequence; indeed, let mt be arbitrarily chosen but fixed. Then the matrices C1(m),C2(m) contain the matrices D1(m-t),D2(m-t) as submatrices. As D1,D2 generate a (0, 2)-sequence, for any d1,d2N0 with d1+d2=m-t the first d1 rows of D1(m-t) together with the first d2 rows of D2(m-t) must be linearly independent, so also the corresponding rows of C1(m) and C2(m) (with zeros prepended) must be linearly independent. This establishes that C1 and C2 generate a (t, 2)-sequence.

Let now mt, and let w1=0, and w2w1 be reduction indices such that m-w2-t0. Then C~1(m)=C1(m), and

graphic file with name 11075_2025_2050_Equ56_HTML.gif

where D2(m×(m-t-w2)) denotes the left upper m×(m-t-w2) submatrix of D2. By Theorem 1, we know that ρmC~1(m),C~2(m)m-t-w2. However, ρmC~1(m),C~2(m)>m-t-w2 cannot hold since the first m-t-w2+1 rows of C~2(m) must be linearly dependent.

This implies that the lower bound in Theorem 1 is sharp.

Remark 3

Next, we provide an example showing that the upper bound (7) for strict digital nets in Theorem 1 is sharp.

We use the same notation as in Remark 2. We again start with the digital (0, 2)-sequence Q. Again, we transform Q into a (ts)-sequence, now called R, with generating matrices E1 and E2. For E1, we take the first generating matrix of P from above, i.e., E1=C1. Furthermore, we choose E2 as

graphic file with name 11075_2025_2050_Equ57_HTML.gif

where D2(t) is the left upper t×t submatrix of D2. First, note that R really is a strict (t, 2)-sequence. Indeed, if we consider the matrix E1(m) for m<t, this matrix only contains zeros, so the quality parameter of R must be at least t. On the other hand, let mt and consider the matrices E1(m) and E2(m). Choose d1,d20 such that d1+d2=m-t, and consider the first d1 rows of E1(m) together with the first d2 rows of E2(m). We distinguish two cases.

  • If d2t, then it is obvious that the first d1 rows of E1(m) together with the first d2 rows of E2(m) are linearly independent, as D1 and D2 generate a (0, 2)-sequence.

  • If d2>t, we proceed as follows. Assume to the contrary that the first d1 rows of E1(m) together with the first d2 rows of E2(m) were not linearly independent. By the structure of E1 and E2, this would immediately imply that the first d1 rows of D1(m-t) together with the first d2-t rows of D2(m-t) are not linearly independent, where d1+d2-t=m-2t, which would be a contradiction to the property that D1 and D2 generate a digital (0, 2)-sequence.

Let now again mt, and let w1=0, and w2w1 be reduction indices such that m-w2-t0. Then E~1(m):=E1(m), and

graphic file with name 11075_2025_2050_Equ58_HTML.gif

We again distinguish two cases.

Case 1: max{t,w2}=w2. We claim that ρmE~1(m),E~2(m)=m-w2. To this end, let d1,d20 such that d1+d2=m-w2, which implies that d1 and d2 are both not larger than m-t. Then, we consider two sub-cases.

  • If d2t, it is clear because of the structure of the matrices that the first d1 rows of E~1(m) together with the first d2 rows of E~2(m) are linearly independent, as D1 and D2 generate a (0, 2)-sequence. This is guaranteed since we know that d1 and d2 are both not larger than m-t.

  • If d2>t, we proceed as follows. Assume to the contrary that the first d1 rows of E~1(m) together with the first d2 rows of E~2(m) were not linearly independent. By the structure of E~1(m) and E~2(m), this would immediately imply that the first d1 rows of D1(m-t) together with the first d2-t rows of D2((m-t)×(m-t-w2)) are not linearly independent, where d1+d2-t=m-t-w2. Note, however, that D1(m-t) contains D1(m-t-w2) as its left upper submatrix, and also D2((m-t)×(m-t-w2)) contains D2(m-t-w2) as its left upper submatrix. By the property that D1 and D2 generate a (0, 2)-sequence, and by the assumption that m-w2t, the first d1 rows of D1(m-t-w2) together with the first d2-t rows of D2(m-t-w2) must be linearly independent. The same must, however, then also hold for the corresponding rows of D1(m-t) and D2((m-t)×(m-t-w2)), which yields a contradiction.

Hence we have shown that ρmE~1(m),E~2(m)m-w2, and by Theorem 1 we must actually have ρmE~1(m),E~2(m)=m-w2.

Case 2: max{t,w2}=t. We claim that ρmE~1(m),E~2(m)=m-t. To this end, let d1,d20 such that d1+d2=m-t. Also here, we distinguish two sub-cases.

  • If d2t, it is obvious that the first d1 rows of E~1(m) together with the first d2 rows of E~2(m) are linearly independent, as D1 and D2 generate a (0, 2)-sequence. This is guaranteed since we know that d1 and d2 are both not larger than m-t.

  • If d2>t, we proceed as follows. Assume to the contrary that the first d1 rows of E~1(m) together with the first d2 rows of E~2(m) were not linearly independent. By the structure of E~1(m) and E~2(m), this would immediately imply that the first d1 rows of D1(m-t) together with the first d2-t rows of D2((m-t)×(m-t-w2)) are not linearly independent, where d1+d2-t=m-2tm-t-w2. Note, however, that D1(m-t) contains D1(m-t-w2) as its left upper submatrix, and also D2((m-t)×(m-t-w2)) contains D2(m-t-w2) as its left upper submatrix. By the property that D1 and D2 generate a (0, 2)-sequence, by the fact that d1+d2m-t-w2, and by the assumption that m-w2t, the first d1 rows of D1(m-t-w2) together with the first d2 rows of D2(m-t-w2) must be linearly independent. The same must, however, then also hold for the corresponding rows of D1(m-t) and D2((m-t)×(m-t-w2)), which yields a contradiction.

In summary, we have shown that (7) is sharp for strict digital nets.

Column reduction for (t,m,e,s)-nets

In [13] Tezuka introduced the concept of (t,m,e,s)-nets, which are a generalization of (tms)-nets. In this section, we briefly look at the quality parameter of column reduced nets under this generalized definition of nets. However, for the rest of the paper, we shall then stick to the notion of (tms)-nets again.

Definition 4

Let e=(e1,,es) and d=(d1,,ds) be integer vectors with ei1 and di0 for i{1,,s}, where s1 is the dimension. Let tm be non-negative integers with 0tm. A point set P[0,1)s with bm points is called a (t,m,e,s)-net in base b if every elementary interval J[0,1)s of volume bt-m and of the form

J=j=1s[ajbejdj,aj+1bejdj)

contains exactly bt points of P, where 0aj<bejdj for j{1,,s} and d satisfies the equation e1d1++esds=m-t.

If we choose e=(1,,1)Ns, we obtain the classical definition of a (tms)-net as given in Definition 1. For (tms)-nets, we have the propagation rule that a (tms)-net in base b is also a (vms)-net in base b for any integer v with tvm. However, with the above definition of (t,m,e,s)-nets, we do not have this propagation rule. In [8], the authors provided a revised definition of (t,m,e,s)-nets which ensures the above mentioned propagation rule. In this section, however, we work with the original definition provided by Tezuka in [13].

We note that all (tms)-nets are also (t,m,e,s)-nets, however for certain values of e, we can obtain a lower t-value for the corresponding (t,m,e,s)-net. In particular, for column-reduced digital nets, we can find certain examples where at least for some choices of e, the reduced net retains the original quality parameter t. Let us give some examples.

Example 1

Let b=2,s=2,m=4 and consider the (0, 4, 2)-net derived from the Sobol’ sequence, given by the generating matrices

C1=1000010000100001,C2=1111010100110001.

Let w1=0 and w2=1, then the resulting column reduced digital net is a (1, 4, 2)-net according to Theorem 1. However, for e=(e1,e2) chosen such that (e1d1,e2d2)=(4,0) is the only solution to the equation e1d1+e2d2=4, we obtain a column reduced net that is still a (0,4,e,2)-net. This is because the only elementary intervals J that satisfy the conditions in Definition 4 are of the form

J=[a1/24,(a1+1)/24)×[0,1).

Thus, the net property depends only on the first coordinates in the point set P, and since we do not set any columns of C1 to zero, i.e., w1=0, the resulting column reduced net is a (0,4,e,2)-net. Some concrete choices for e with the above property are (e1,e2)=(2,3), or (e1,e2)=(1,e2) where e2>4, or (e1,e2)=(4,e2) where e2=3 or e2>4.

In general, given a (tms)-net P derived from a (ts)-sequence and reduction indices 0=w1w2ws, for e=(e1,,es)=(m-t,k,,k), where either 1<k<m-t with gcd(k,m-t)=1 or k>m-t>0, the only solution to the equation j=1sejdj=m-t is d=(1,0,,0). Thus, the column reduced net P~ is an (m-t,m,e,s)-net for the above choice of e.

One might also consider digital (tms)-nets generated by C1,,Cs where the Cj for 2js are derived from a (ts)-sequence but C1 is not necessarily derived from a digital sequence, since we usually choose the first reduction index w1=0. In this case, one could perhaps find more choices of e=(e1,,es) such that the corresponding column reduced digital net is a (t,m,e,s)-net, depending on the reduction indices w2,,ws.

A general and complete analysis of reduced (t,m,e,s)-nets could be an interesting subject for future research.

Projections of column reduced digital nets

Due to the important role of the t-value, one sometimes also considers a slightly refined notion of a (tms)-net, which is then referred to as a ((tu)u[s],m,s)-net, where [s]:={1,,s}. The latter notion means that for any u, u[s], the projection of the net onto those components with indices in u is a (tu,m,u)-net. The notion of a ((tu)u[s],s)-sequence is defined analogously. Moreover, for u, we write u¯:=max(u).

If we assume (which we always do in this paper) that the reduction indices satisfy 0=w1w2ws, then, for any non-empty u[s], the reduction index wu¯ is the largest among all reduction indices corresponding to u. This yields the following adaption of Theorem 1, which obviously can be shown in the same manner.

Corollary 1

Let P be a digital ((tu)u[s],m,s)-net over Fb with generating matrices C1(m),,Cs(m), which has been derived from a digital ((tu)u[s],s)-sequence, where we assume that mt. Let C~1(m),,C~s(m) be the reduced generating matrices with respect to reduction indices 0=w1ws and let (t~u)u[s] be the minimal quality parameters of the projections of the net generated by the C~j(m). Then, for every non-empty u[s],

max{0,m-wu¯-tu}ρm((C~j(m))ju)max{0,m-wu¯},

and t~umin{m,wu¯+tu}.

Furthermore, if, for a non-empty u[s], the projection of P onto the components in u is a strict digital (tu,m,u)-net, it is true that

ρm((C~j(m))ju)max{0,m-max{tu,wu¯}}.

Applications of column reduced digital nets

A reduced matrix product algorithm

In this section, we return to the problem outlined in Section 1. Let P be a digital (tms)-net over Fb, with generating matrices C1(m),,Cs(m). Let w=(wj)j=1sN0s be a sequence of reduction indices with 0=w1w2ws. Let ss be the largest index such that ws<m. Let C~1(m),,C~s(m) be the reduced generating matrices corresponding to w1,,ws, and let Q be the corresponding reduced digital net. Let x0,,xN-1 be the points of Q, where we interpret x0,,xN-1 as column vectors. Let

X=[x0,x1,,xN-1]

be the N×s matrix whose k-th row is the k-th point of Q for 0kN-1.

Let ξj denote the j-th column of X, i.e., X=[ξ1,ξ2,,ξs]. Let A=[a1,,as], where ajR1×τ is the j-th row of A. Then we have

XA=[ξ1,ξ2,,ξs]·[a1,,as]=ξ1a1+ξ2a2++ξsas. 11

We will make use of a certain inherent repetitiveness of the reduced net Q, which we will illustrate by considering a reduction index 0wj<m for 1js, and the corresponding generator matrix C~j(m). The j-th components of the N=bm points of Q (i.e., the j-th column ξj of X) are then given by

ξj=C~j(m)0·(b-1,,b-m),,C~j(m)(bm-1)·(b-1,,b-m)=(Xj,,Xjbwjtimes),

where, as above, we write k to denote the vector of base b digits of length m for k{0,1,bm-1}, and where

Xj=C~j(m)0·(b-1,,b-m),,C~j(m)(bm-wj-1)·(b-1,,b-m).

The reason for this repetitive structure is that, for any wj with 0<wj<m, the last wj columns of C~j(m) are equal to zero, and thus, in the product C~j(m)k, the last wj entries of k become irrelevant. We will exploit this structure within Q to derive a fast matrix-matrix multiplication algorithm to compute XA.

Based on the above observations, it is possible to formulate the following algorithm to compute (11) in an efficient way. Note that for j>s the j-th column of X consists only of zeros, so there is nothing to compute for the entries of X corresponding to these columns.

Algorithm 1.

Algorithm 1

Fast reduced matrix-matrix product using column reduced digital nets.

Remark 4

The number of computations needed for Algorithm 1 is of order

Oj=1sbm-wj(τ+m(m-wj)).

Note that this algorithm also generates the points of the reduced digital net, whereas the standard multiplication or the analogous “row reduced algorithm” [1, Algorithm 4], both require pre-computed points of the digital net as input. Generating the points of a non-reduced digital net requires O(bmsm2) operations, see also [1, Algorithm 3] and the standard non-reduced matrix-matrix multiplication usually requires O(bmsτ) operations. Therefore, Algorithm 1 improves the runtime of both steps. We also point out that the number of operations necessary for Algorithm 1 is independent of s, and only depends on s. If the reduction indices wj grow sufficiently fast, then s can be significantly lower than s.

Remark 5

Let us consider mappings ϕ:[0,1]sR of the form ϕ(x)=(ϕ1(x1),,ϕs(xs)) that we apply simultaneously to all sample points of a given digital net. In this case, we can easily adapt Algorithm 1 such that we obtain a reduced net with points transformed by ϕ, but do not change the order of the computation time outlined in Remark 4. Such an adaption is useful when considering the case D=Rs, as pointed out in the introduction.

Error analysis

In the beginning of the paper we set out the task of approximating the integral (1) by the QMC rule (2). We have shown in the previous sections how to speed up the computation of the products xkA if we choose xk as the points of a column reduced digital net. However, we should also keep in mind the integration error made by using a QMC rule of the form (2) using those xk.

In this section, we restrict ourselves to the case D=[0,1]s, such that we do not need to transform the sample points xk before applying the corresponding QMC rule. In many applications of quasi-Monte Carlo, one considers so-called weighted function spaces such as weighted Sobolev or weighted Korobov spaces (see, e.g., [3, 5, 6]). The idea of studying weighted function spaces goes back to the seminal paper [12] of Sloan and Woźniakowski. The motivation for weighted spaces is that in many applications, different coordinates or different groups of coordinates may have different influence on a multivariate problem. To give a simple example, consider numerical integration of a function f:[0,1]sR, where

f(x1,,xs)=ex1+x2++xs2s.

Clearly, for large s, the first variable has much more influence on this problem than the others. In order to make such observations more precise, one introduces weights, which are nonnegative real numbers γu, one for each set u{1,,s}. Intuitively speaking, the number γu models the influence of the variables with indices in u. Larger values of γu mean more influence, smaller values less influence. Formally, we set γ=1, and we write γ={γu}u{1,,s}. These weights can now be used to modify the norm in a given function space, thereby modifying the set over which a suitable error measure, as for example the worst-case error, of a problem is considered. By making this set smaller according to the weights (in the sense that also here, certain groups of variables may have less influence than others), a problem may thus become easier to handle and even lose the curse of dimensionality, provided that suitable conditions on the weights hold. This effect also corresponds to intuition—if a problem depends on many variables, of which only some have significant influence, it is natural to expect that the problem will be easier to solve than one where all variables have the same influence.

The weighted star discrepancy is (via the well-known Koksma-Hlawka inequality or its weighted version, see, e.g., [3, 6, 10]), a measure of the worst-case quadrature error for a QMC rule with node set Q, with bm nodes, defined as

Dbm,γ(Q):=supx(0,1]smaxu[s]γuΔQ,u(x), 12

where

ΔQ,u(x):=#{(y1,,ys)Q:yj<xj,ju}bm-juxj. 13

Indeed, for certain weighted function classes based on Sobolev spaces of smoothness one, the weighted star discrepancy equals the worst-case quadrature error of a QMC rule with node set Q. Here, by the worst-case error, we mean the supremum of the integration error taken over the unit ball of the function class under consideration. We refer to [3, Section 5.3] for further details on the weighted Koksma-Hlawka inequality.

As shown in [11], we have

Dbm,γ(Q)=maxu[s]supx(0,1]sγuΔQ,u(x)=maxu[s]γusupx(0,1]sΔQ,u(x).

In the latter expression, the suprema over x(0,1]s just yield the values of the star discrepancy of the projections of Q, and thus, one can use existing discrepancy bounds for the projections of Q. Let us proceed as follows. Assume that P is a digital ((tu)u[s],m,s)-net over Fb with m×m generating matrices C1(m),,Cs(m) derived from a digital ((tu)u[s],s)-sequence, where mt. Let P~ be the corresponding column reduced digital net based on the reduction indices 0=w1w2ws, and let (t~u)u[s] be the minimal quality parameters of the projections of P~.

Whenever we consider a u[s] that is not a subset of [s], we know due to Corollary 1 that the quality parameter of the corresponding projection of P~ is m and therefore we can bound its discrepancy only trivially by 1. Whenever we have u[s], however, we can use existing discrepancy bounds for the corresponding net. To this end, we use the results from [7], which are, to our best knowledge, the currently best-known general upper discrepancy bounds for (tms)-nets. This yields, for any non-empty set u[s],

supx(0,1]sΔP~,u(x)1ifu[s],(bt~u/bm)v=0u-1av,b(u)mvifu[s]andu2,bt~u/bmifu[s]andu=1, 14

where

av,b(u)=u-2vb+22u-2-v(b-1)v2vv!(a0,b(2)+u2-4)+u-2v-1b+22u-1-v(b-1)v-12v-1v!a1,b(2),

for 0vu-1, with

a0,b(2)=b+84ifbiseven,b+42ifbisodd,anda1,b(2)=b24(b+1)ifbiseven,b-14ifbisodd.

This then yields

Dbm,γ(P~)maxmaxu[s]u[s]γu,maxu[s]u=1γubt~ubm,maxu[s]u2γubt~ubmv=0u-1av,b(u)mv. 15

Let us analyze the three maxima in the curly brackets in (15) in greater detail. To this end, as also in [1], we restrict ourselves to product weights in the following, i.e., we assume weights γu=juγj with γ1γ2>0.

For the first term, we proceed as in [1], namely we use that wjm if ju\[s], and obtain for v=u[s] that

γuγvγu\v1bmju\v(1+bwj)1bmjuγj(1+bwj). 16

For the second maximum in (15), note that we have u={j} for some j[s], and hence t~umin{m,wj+t{j}} by Corollary 1. Consequently,

maxu[s]u=1γubt~ubmmaxj[s]γjbmin{m,wj+t{j}}bm. 17

For the third maximum in (15), we again use Corollary 1, and obtain

maxu[s]u2γubt~ubmv=0u-1av,b(u)mvmaxu[s]u2γubmin{m,wu¯+tu}bmv=0u-1av,b(u)mv. 18

Using these estimates in (15), we obtain

Dbm,γ(P~)maxmaxu[s]u[s]1bmjuγj(1+bwj),maxj[s]γjbwj+t{j}bm,maxu[s]u2γubmin{m,wu¯+tu}bmv=0u-1av,b(u)mv. 19

Remark 6

A few remarks on (19) are in order. Note that only the first term in the curly brackets in (19) depends on s. The two remaining terms depend on s, which can be independent of s if the reduction indices wj increase sufficiently fast. However, let us give a few further details on these observations.

We may want that the first term

1bmjuγj(1+bwj)1bmj=1sγj(1+bwj)

be bounded by κ/bm for some constant κ>0 independent of s. Let j0N be minimal such that γj1 for all j>j0. Then we impose j=1sγj(1+bwj)γ10j=1s(1+γjbwj)κ. Hence it is sufficient to choose κ>γ10 and for all j[s],

wj:=minlogbκγ101/s-1γj,m. 20

The choice of the wj in (20) depends on s. For sufficiently fast decaying weights γj, it is possible to choose the wj such that they no longer depend on s. Indeed, suppose, e.g., that γj=j-2. Then we could choose the wj such that, for some τ(1,2),

wjminlogbj2-τ,m. 21

This then yields

j=1s(1+γjbwj)expj=1slog(1+γjbwj)expj=1sγjbwjexp(ζ(τ)),

where ζ(·) is the Riemann zeta function. This gives a dimension-independent bound on the term j=1sγj(1+bwj) from above, and hence a dimension-independent bound for all of Dbm,γ(P~).

Regarding the second term in (19), this term only depends on one-dimensional projections of P~. In particular, if we choose the wj as in (21), this expression should be easy to bound from above. This is even more so if the t-values of the one-dimensional projections of the non-reduced net P are low, which may often be the case (in fact, the t-values of one-dimensional projections might even be zero in many examples). Thus we can bound the second term by an expression of the form κ/bm, which only depends on s but not on s.

Regarding the third term in (19), it crucially depends on the weights γ and their interplay with the quality parameters of the projections of P, tu. In particular, small quality parameters, in combination with sufficiently fast decaying weights and a suitable choice of the reduction indices wj, should yield tighter error bounds. Indeed, we could proceed similarly to [7, Corollary 1], and bound the third term in (19) by a term of the form

maxu[s]u2γu1bmcumu-1+O(mu-2),

where cu depends on b,tu and u, but not on m. Note that also the third term only depends on s and not on s, so for sufficiently fast increasing reduction indices wj, the dimension s does not matter. In summary, we obtain

Dbm,γ(P~)maxκbm,κbm,maxu[s]u2γu1bmcumu-1+O(mu-2).

Remark 7

Note that our new result yields an advantage over the corresponding result for row reduced nets in [1]. In that paper, one needs to work with the quality parameters of the projections of the reduced net, which are, in general, not known. In the present paper, we benefit from the combination of the column reduction and the fact that the nets considered here are derived from digital sequences, which guarantees additional structure. Usually, it is computationally involved to determine the t-value of a digital net or sequence from the generating matrices, since many linear independence conditions need to be checked. Here, however, we can use Theorem 1 and Corollary 1, which relate the t-values of P to those of P~, and thus give us an advantage. In particular, if P is obtained from, say, a Sobol’ or a Niederreiter sequence, it should be possible to have t-values that are guaranteed to be reasonably low.

Numerical experiments

In this section, we test the computational performance of column reduced digital nets for matrix products XA, where A is an s×τ matrix, as detailed in Section 4.1. We implemented Algorithm 1 in the Julia programming language (Version 1.9.3).1 In the following plots, we compare the runtime of Algorithm 1 to the standard matrix multiplication and also the matrix multiplication using the points from row reduced digital nets as given in [1, Algorithm 4]. We remark that the reported runtimes are also affected by technical implementation details such as memory efficiency, a detailed discussion of which is out of scope here.

For the generating matrices C1(m),,Cs(m), we used random matrices in Fbm×m, since the matrix product computation itself does not depend on the entries of the matrix, i.e, we get similar relations of runtimes if we use generating matrices of specific digital sequences like Sobol’ or Niederreiter sequences.

In Fig. 1 we see, for fixed b=2,m=12, and τ=20, how the runtime changes as we vary s. We compare this for two different choices of reduction indices wj. We see that in this case, using column reduced digital nets in Algorithm 1 performs better than the use of row reduced digital nets in [1, Algorithm 4] and also the standard matrix multiplication.

Fig. 1.

Fig. 1

m=12,τ=20, varying wj

As the reduction indices wj increase more slowly (as in Fig. 1b), the difference in performance between the standard multiplication and Algorithm 1 reduces. We can see this also theoretically by inserting the weights in Remark 4.

In Fig. 2, we study the behavior for fixed b=2,s=800, and τ=20 as m increases. Note that we use the logarithmic scale for the time but not for m. We observe that also in this case Algorithm 1 seems to perform better than the row reduced case.

Fig. 2.

Fig. 2

s=800,τ=20, varying m

Overall, the numerical tests for the runtime using column reduced digital nets fit our theoretical estimate for the runtime as given in Remark 4 and comparison with the row reduced algorithm reveals that the column reduced algorithm could yield a better performance. Additionally to this practical advantage, column reduced matrices also have a theoretical advantage over row reduced matrices, as pointed out in Remark 7.

Conclusion

Column reduced digital nets have applications in the field of quasi-Monte Carlo methods. We can speed up the matrix-matrix multiplication in the quasi-Monte Carlo method by exploiting the repetitive structure of the points of a column reduced digital net. The bounds for the quality parameter (t-value) of column reduced digital nets have not been studied before.

In our research, we provide an algorithm for the matrix-matrix product using column reduced digital nets, which is faster than the standard matrix multiplication algorithm. In addition, we provide bounds for the t-value for column reduced digital nets. This is very essential for the error analysis of our method and has an advantage over the corresponding result for the row reduced nets in [1].

For future work, one could consider relaxing the conditions we impose on the t-value of the underlying digital sequence. One could also explore in-depth the interplay between column and row reduced digital nets.

Acknowledgements

The authors acknowledge the support of the Austrian Science Fund (FWF) Project 10.55776/P34808. For open access purposes, the authors have applied a CC BY public copyright license to any author accepted manuscript version arising from this submission.

Author Contributions

Both authors have contributed to all sections of the paper to equal parts.

Funding

Open access funding provided by Österreichische Akademie der Wissenschaften.

Data Availability

The supporting numerical experiments in the manuscript have been computed by source code that is available at: https://github.com/Vishnupriya-Anupindi/ReducedDigitalNets.jl.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Dick, J., Ebert, A., Herrmann, L., Kritzer, P., Longo, M.: The fast reduced QMC matrix-vector product. J. Comput. Appl. Math. 440, 115642 (2024) [Google Scholar]
  • 2.Dick, J., Kritzer, P., Leobacher, G., Pillichshammer, F.: A reduced fast component-by-component construction of lattice points for integration in weighted spaces with fast decreasing weights. J. Comput. Appl. Math. 276, 1–15 (2015) [Google Scholar]
  • 3.Dick, J., Kritzer, P., Pillichshammer, F.: Lattice Rules-Numerical Integration, Approximation, and Discrepancy. Springer, Cham (2022) [Google Scholar]
  • 4.Dick, J., Kuo, F.Y., Le Gia, Q.T., Schwab, C.: Fast QMC matrix-vector multiplication. SIAM J. Sci. Comput. 37, A1436–A1450 (2015) [Google Scholar]
  • 5.Dick, J., Kuo, F.Y., Sloan, I.H.: High-dimensional integration–the quasi-Monte Carlo way. Acta Numer. 22, 133–288 (2013) [Google Scholar]
  • 6.Dick, J., Pillichshammer, F.: Digital Nets and Sequences. Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press, Cambridge (2010) [Google Scholar]
  • 7.Faure, H., Kritzer, P.: New star discrepancy bounds for -nets and -sequences. Monatsh. Math. 172, 55–75 (2013) [Google Scholar]
  • 8.Hofer, R., Niederreiter, H.: A construction of (t, s)-sequences with finite-row generating matrices using global function fields. Finite Fields and Their Applications 21, 97–110 (2013) [Google Scholar]
  • 9.Niederreiter, H.: Low-discrepancy point sets obtained by digital constructions over finite fields. Czechoslovak Math. J. 42, 143–166 (1992) [Google Scholar]
  • 10.Niederreiter, H.: Random number generation and quasi-monte carlo methods. CBMS-NSF Regional Conference Series in Applied Mathematics, 63. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (1992)
  • 11.Pillichshammer, F.: Tractability properties of the weighted star discrepancy of regular grids. J. Complexity 46, 103–112 (2018) [Google Scholar]
  • 12.Sloan, I.H., Woźniakowski, H.: When are quasi-Monte Carlo algorithms efficient for high-dimensional integrals? J. Complexity 14, 1–33 (1998) [Google Scholar]
  • 13.Tezuka, S.: On the discrepancy of generalized Niederreiter sequences. J. Complexity 29, 240–247 (2013) [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The supporting numerical experiments in the manuscript have been computed by source code that is available at: https://github.com/Vishnupriya-Anupindi/ReducedDigitalNets.jl.


Articles from Numerical Algorithms are provided here courtesy of Springer

RESOURCES