Quantum algorithm for multivariate polynomial interpolation

Jianxin Chen; Andrew M Childs; Shih-Han Hung

doi:10.1098/rspa.2017.0480

. 2018 Jan 17;474(2209):20170480. doi: 10.1098/rspa.2017.0480

Quantum algorithm for multivariate polynomial interpolation

Jianxin Chen ³, Andrew M Childs ^1,^2,³, Shih-Han Hung ^1,^3,^✉

PMCID: PMC5806014 PMID: 29434504

Abstract

How many quantum queries are required to determine the coefficients of a degree-d polynomial in n variables? We present and analyse quantum algorithms for this multivariate polynomial interpolation problem over the fields $F_{q}$ , $R$ and $C$ . We show that $k_{C}$ and $2 k_{C}$ queries suffice to achieve probability 1 for $C$ and $R$ , respectively, where $k_{C} = ⌈ (1 / (n + 1)) (\binom{n + d}{d}) ⌉$ except for d=2 and four other special cases. For $F_{q}$ , we show that ⌈(d/(n+d))(n+d d) ⌉ queries suffice to achieve probability approaching 1 for large field order q. The classical query complexity of this problem is (n+d d) , so our result provides a speed-up by a factor of n+1, (n+1)/2 and (n+d)/d for $C$ , $R$ and $F_{q}$ , respectively. Thus, we find a much larger gap between classical and quantum algorithms than the univariate case, where the speedup is by a factor of 2. For the case of $F_{q}$ , we conjecture that $2 k_{C}$ queries also suffice to achieve probability approaching 1 for large field order q, although we leave this as an open problem.

Keywords: quantum algorithms, query complexity, polynomial interpolation

1. Introduction

Let $f (x_{1}, \dots, x_{n}) \in K [x_{1}, \dots, x_{n}]$ be a polynomial of degree d. Suppose d is known and we are given a black box that computes f on any desired input. The polynomial interpolation problem is to determine all the coefficients of the polynomial by querying the black box.

Classically, a multivariate polynomial can be interpolated by constructing a system of linear equations. Invertibility of the Vandermonde matrix implies that $(\binom{n + d}{d})$ queries are necessary and sufficient to determine all the coefficients. (Note that one must choose the input values carefully to construct a full-rank Vandermonde matrix for n>1 [1].)

Recent work has established tight bounds on the quantum query complexity of interpolating univariate polynomials over a finite field $F_{q}$ . In particular, Childs et al. [2] developed an optimal quantum algorithm that makes (d+1)/2 queries to succeed with bounded error and one more query to achieve success probability 1−O(1/q). They also showed that the success probability of the algorithm is optimal among all algorithms making the same number of queries. Previous work [3,4] shows that no quantum algorithm can succeed with bounded error using fewer queries, so the optimal success probability exhibits a sharp transition as the number of queries is increased.

For multivariate polynomials, Childs et al. [2] conjectured that a straightforward analogue of the univariate algorithm solves the interpolation problem with probability 1−o(1) using $⌊ (1 / (n + 1)) (\binom{n + d}{d}) ⌋ + 1$ queries. However, while that conjecture is natural, the analysis of the algorithm appeared to require solving a difficult problem in algebraic geometry and was left open. (In addition, Montanaro considered the quantum query complexity of interpolating a multilinear polynomial [5], but this is quite different from the general multivariate case.)

To the best of our knowledge, all previous work on quantum algorithms for polynomial interpolation has focused on finite fields. Cryptographic applications of interpolation typically use finite fields, and the multivariate case could lead to new applications in that domain. However, polynomial interpolation over infinite fields is also a natural problem, especially considering the ubiquity of real- and complex-valued polynomials in numerical analysis.

In this paper, we propose an approach to the quantum query complexity of polynomial interpolation in the continuum limit. To obtain a well-defined initial state, the algorithm prepares a superposition over a bounded working region. The bounded region limits the precision that can be achieved due to the uncertainty principle, but the algorithm can be made arbitrarily precise by taking an arbitrarily large region. Using this strategy, we present a quantum algorithm for multivariate polynomial interpolation over the real and complex numbers. To simplify the analysis, we allow the algorithm to work with arbitrarily precise inputs and outputs over $R$ or $C$ ; in practice, sufficiently fine discretization of the space could achieve similar performance. We also consider multivariate polynomial interpolation over finite fields, where our algorithm can be viewed as a generalization of the univariate polynomial interpolation algorithm proposed in [2].

To analyse the success probability of our approach, we relate it to the tensor rank problem. The rank of a given tensor, which is the smallest integer k such that the tensor can be decomposed as linear combination of k simple tensors (i.e. those that can be written as tensor products), was first introduced nearly a century ago. A half century later, with the advent of principal component analysis on multidimensional arrays, the study of tensor rank attracted further attention. However, it has recently been shown that most tensor problems, including tensor rank, are NP-hard [6–8], and restricting these problems to symmetric tensors does not seem to alleviate their NP-hardness [7,8]. More specifically, tensor rank is NP-hard over any field extension of $Q$ and NP-complete over a finite field $F_{q}$ .

Fortunately, analysing the success probability of multivariate polynomial interpolation does not require exactly computing the rank of a symmetric tensor. The number of queries needed to achieve success probability 1 can be translated to the smallest integer k such that almost every symmetric tensor can be decomposed as a linear combination of no more than k simple tensors. In turn, this quantity can be related to properties of certain secant varieties, which lets us take advantage of recent progress in algebraic geometry [9,10].

The success probability of our algorithm behaves differently as a function of the number of queries for the three fields we consider. Specifically, by introducing

k_{C} (n, d) := {\begin{cases} n + 1 & d = 2, n \geq 2; \\ ⌈ \frac{1}{n + 1} (\begin{matrix} n + d \\ d \end{matrix}) ⌉ + 1 & (n, d) = (4, 3), (2, 4), (3, 4), (4, 4); \\ ⌈ \frac{1}{n + 1} (\begin{matrix} n + d \\ d \end{matrix}) ⌉ & otherwise, \end{cases}

1.1

we have the following upper bounds on the query complexity:

Theorem 1.1 —

For positive integers d and n, there exists a quantum algorithm for interpolating an n-variate polynomial of degree d over the field $K$ using at most

(1) $(d / (n + d)) (\binom{n + d}{d})$ queries for $K = F_{q}$ , succeeding with probability 1−O(1/q);

(2) $2 k_{C}$ queries for $K = R$ , succeeding with probability 1;

(3) $k_{C}$ queries for $K = C$ , succeeding with probability 1.

Note that these upper bounds can be improved using known results [2] for univariate polynomial interpolation (see the final remark in §3c(iii)).

2. Preliminaries and notations

(a). Notation and definitions

Let $f \in K [x_{1}, \dots, x_{n}]$ be a polynomial of degree at most d over the field $K$ . We let $x^{j} := \prod_{i = 1}^{n} x_{i}^{j_{i}}$ for $j \in J$ , where $J := {j \in N^{n} : j_{1} + \dots + j_{n} \leq d}$ is the set of allowed exponents with size $J := (\binom{n + d}{d})$ . Thus, x^j is a monomial in x₁,…,x_n of degree j₁+j₂+⋯+j_n. With this notation, we write $f (x) = \sum_{j \in J} c_{j} x^{j}$ for some coefficients ${c_{j} \in K : j \in J}$ .

Access to the function f is given by a black box that performs |x,y〉↦|x,y+f(x)〉 for all $x \in K^{n}$ and $y \in K$ . We will compute the coefficients of f by performing phase queries, which are obtained by phase kickback over $K$ , as detailed in §3a.

For k-dimensional vectors $x, y \in K^{k}$ , we consider the inner product $\cdot : K^{k} \times K^{k} \to K$ defined by $x \cdot y = \sum_{i = 1}^{k} {\bar{x}}_{i} y_{i}$ , where $\bar{x}$ is the complex conjugate of x (where we let $\bar{x} = x$ for $x \in F_{q}$ ). We denote the indicator function for a set $A \subseteq R^{n}$ by $I_{A} (z)$ , which is 1 if z∈A and 0 if z∉A. We denote a ball of radius $r \in R^{+}$ centred at 0 by B(r).

A lattice Λ is a discrete additive subgroup of $R^{n}$ for positive integer n generated by $e_{1}, \dots, e_{n} \in R^{n}$ . For every element x∈Λ, we have $x = \sum_{i = 1}^{n} γ_{i} e_{i}$ for some $γ_{i} \in Z$ for i∈{1,…,n}. A fundamental domain T of Λ centred at zero is a subset of $R^{n}$ such that $T = {\sum_{i = 1}^{n} a_{i} e_{i} : a_{i} \in [- \frac{1}{2}, \frac{1}{2})}$ . The dual lattice of Λ, denoted by $\tilde{Λ}$ , is an additive subgroup of $R^{n}$ generated by $f_{1}, \dots, f_{n} \in R^{n}$ satisfying e_i⋅f_j=δ_ij for i,j∈{1,…,n}.

The standard basis over the real numbers is the set ${| x ⟩ : x \in R^{n}}$ for positive integer n. The amplitude of a state |ψ〉 in the standard basis is denoted by ψ(x) or 〈x|ψ〉. The standard basis vectors over real numbers are orthonormal in the sense of the Dirac delta function, i.e. 〈x′|x〉=δ⁽ⁿ⁾(x−x′) for $x, x^{'} \in R^{n}$ .

(b). Algebraic geometry concepts

A subset V of $K^{n}$ is an algebraic set if it is the set of common zeros of a finite collection of polynomials g₁,g₂,…,g_r with $g_{i} \in K [x_{1}, x_{2}, \dots, x_{n}]$ for 1≤i≤r.

A finite union of algebraic sets is an algebraic set, and an arbitrary intersection of algebraic sets is again an algebraic set. Thus by taking the open subsets to be the complements of algebraic sets, we can define a topology, called the Zariski topology, on $K^{n}$ .

A non-empty subset V of a topological space X is called irreducible if it cannot be expressed as the union of two proper (Zariski) closed subsets. The empty set is not considered to be irreducible. An affine algebraic variety is an irreducible closed subset of some $K^{n}$ .

We define projective n-space, denoted by $P^{n}$ , to be the set of equivalence classes of (n+1)-tuples (a₀,…,a_n) of complex numbers, not all zero, under the equivalence relation given by (a₀,…,a_n)∼(λa₀,…,λa_n) for all $λ \in K$ , λ≠0.

A notion of algebraic variety may also be introduced in projective spaces, giving the notion of a projective algebraic variety: a subset $V \subseteq P^{n}$ is an algebraic set if it is the set of common zeros of a finite collection of homogeneous polynomials g₁,g₂,…,g_r with $g_{i} \in K [x_{0}, x_{1}, \dots, x_{n}]$ for 1≤i≤r. We call open subsets of irreducible projective varieties quasi-projective varieties.

For any integers n and d, we define the Veronese map of degree d as

V_{d} : [x_{0} : x_{1} : \dots : x_{n}] \mapsto [x_{0}^{d} : x_{0}^{d - 1} x_{1} : \dots : x_{n}^{d}],

2.1

where the notation with square brackets and colons denotes homogeneous coordinates and the expression in the output of V _d ranges over all monomials of degree d in x₀,x₁,…,x_n. The image of V _d is an algebraic variety called a Veronese variety.

Finally, for an irreducible algebraic variety V , its kth secant variety σ_k(V) is the Zariski closure of the union of subspaces spanned by k distinct points chosen from V .

For more information about Veronese and secant varieties, refer to Example 2.4 and Example 11.30 in [11].

3. Quantum algorithm for polynomial interpolation

(a). The query model

Using the standard concept of phase kickback, we encode the results of queries in the phase by performing standard queries in the Fourier basis. We briefly explain these queries for the three types of fields we consider.

(i). Finite field $F_{q}$

The order of a finite field can always be written as a prime power q:=p^r. Let $e : F_{q} \to C$ be the exponential function e(z):=e^i2πTr(z)/p where the trace function $Tr : F_{q} \to F_{p}$ is defined by Tr(z):=z+z^p+z^p²+⋯+z^{p^r−1}. The Fourier transform over $F_{q}$ is a unitary transformation acting as $| x ⟩ \mapsto (1 / \sqrt{q}) \sum_{y \in F_{q}} e (x y) | y ⟩$ for all $x \in F_{q}$ . The k-dimensional quantum Fourier transform (QFT) is given by $| x ⟩ \mapsto (1 / q^{k / 2}) \sum_{y \in F_{q}^{k}} e (x \cdot y) | y ⟩$ for any $x \in F_{q}^{k}$ .

A phase query is simply the Fourier transform of a standard query. By performing an inverse QFT, a query, and then a QFT, we map |x,y〉↦e(yf(x))|x,y〉 for any $x, y \in F_{q}$ .

As in the univariate case, our algorithm is non-adaptive, making all queries in parallel for a carefully chosen superposition of inputs. With k parallel queries, we generate a phase $\sum_{i = 1}^{k} y_{i} f (x_{i}) = \sum_{i = 1}^{k} \sum_{j \in J} y_{i} x_{i}^{j} c_{j}$ for the input $(x, y) \in F_{q}^{k} \times F_{q}^{k}$ . For convenience, we define $Z : F_{q}^{n k} \times F_{q}^{k} \to F_{q}^{J}$ by $Z {(x, y)}_{j} = \sum_{i = 1}^{k} y_{i} {x_{i}}^{j}$ for $j \in J$ , so that $\sum_{i = 1}^{k} y_{i} f (x_{i}) = Z (x, y) \cdot c$ .

(ii). Real numbers $R$

Let $e : R \to C$ be the exponential function e(x):=e^i2πx. For any function ψ whose Fourier transform exists, the QFT over $R$ acts as

\int_{R} d x ψ (x) | x ⟩ \mapsto \int_{R} d y Ψ (y) | y ⟩,

3.1

where $Ψ (y) = \int_{R} d x e (- x y) ψ (x)$ . By Parseval’s theorem, the QFT is unitary.

As in the finite field case, we construct a phase query by making a standard query in the Fourier basis, giving

\int_{R^{2}} d x d y ψ (x, y) | x, y ⟩ \mapsto \int_{R^{2}} d x d y e (y f (x)) ψ (x, y) | x, y ⟩ .

3.2

An algorithm making k parallel queries generates a phase Z(x,y)⋅c, where we similarly define $Z : R^{n k} \times R^{k} \to R^{J}$ by $Z {(x, y)}_{j} = \sum_{i = 1}^{k} y_{i} {x_{i}}^{j}$ for $j \in J$ .

(iii). Complex numbers $C$

The complex numbers can be viewed as a field extension of the real numbers of degree 2, namely $C = R [\sqrt{- 1}]$ . For any positive integer n, let $ϕ_{n} : C^{n} \to R^{2 n}$ be an isomorphism ϕ_n(x):=(ℜ(x₁),ℑ(x₁),ℜ(x₂),ℑ(x₂),…,ℜ(x_n),ℑ(x_n)), which we also denote in boldface by x. A complex number $x \in C$ can be stored in a quantum register as a tensor product of its real and imaginary parts, |x〉=|ℜ(x)〉|ℑ(x)〉.

A complex function $ψ : C^{m} \to C^{n}$ can be seen as a function with 2m variables. Let $ψ (x) = \tilde{ψ} (x)$ . By abuse of notation, we will neglect the tilde and write ψ(x)=ψ(x). Let $e : C \to C$ be the exponential function e(x):=e^i2πℜ(x). For any function $ψ : C \to C$ whose Fourier transform exists, we define the transform

\int_{R^{2}} d^{2} x ψ (x) | x ⟩ \mapsto \int_{R^{2}} d^{2} y Ψ (y) | y ⟩,

3.3

where $Ψ (y) = \int_{R^{2}} d^{2} x e (- \bar{y} x) ψ (x)$ . Note that in general Ψ(y) cannot be written in the form of Ψ(y) with a complex variable $y \in C$ . To encode the output in the phase, the queries act as

\begin{aligned} \int_{R^{2}} d^{2} x \int_{R^{2}} d^{2} y ψ (x, y) | x, y ⟩ \\ \mapsto \int_{R^{2}} d^{2} x \int_{R^{2}} d^{2} y \int_{R^{2}} d^{2} z ψ (x, y) e (- \bar{y} z) | x, z ⟩ \end{aligned}

3.4

\begin{aligned} \mapsto \int_{R^{2}} d^{2} x \int_{R^{2}} d^{2} y \int_{R^{2}} d^{2} z ψ (x, y) e (- \bar{y} z) | x, z + f (x) ⟩ \end{aligned}

3.5

\begin{aligned} \mapsto \int_{R^{2}} d^{2} x \int_{R^{2}} d^{2} y \int_{R^{2}} d^{2} z \int_{R^{2}} d^{2} u ψ (x, y) e (- \bar{y} z) e (\bar{u} (z + f (x))) | x, u ⟩ \end{aligned}

3.6

\begin{aligned} \mapsto \int_{R^{2}} d^{2} x \int_{R^{2}} d^{2} y ψ (x, y) e (\bar{y} f (x)) | x, y ⟩, \end{aligned}

3.7

where we use the identity $\int_{R^{2}} d^{2} y e (y \bar{(x - x^{'})}) = δ^{(2)} (x - x^{'})$ for $x, x^{'} \in C$ .

An algorithm making k parallel queries generates a phase $\sum_{i = 1}^{k} {\bar{y}}_{i} f (x_{i}) = \sum_{i = 1}^{k} \sum_{j \in J} {\bar{y}}_{i} x_{i}^{j} c_{j}$ . We define $Z : C^{n k} \times C^{k} \to C^{J}$ satisfying $Z {(x, y)}_{j} = \sum_{i = 1}^{k} y_{i} {\bar{x}}_{i}^{j}$ for $j \in J$ , so that $\sum_{i = 1}^{k} {\bar{y}}_{i} f (x_{i}) = Z (x, y) \cdot c$ .

(b). The algorithm

Our algorithm follows the same idea as in [2]: we perform k phase queries in parallel for a carefully chosen superposition of inputs, such that the output states corresponding to distinct polynomials are as distinguishable as possible. For a k-query quantum algorithm, we consider the mapping $Z : K^{n k} \times K^{k} \to K^{J}$ defined in §3a for $K = F_{q}$ , $R$ and $C$ . Childs et al. [2] gave an optimal algorithm for n=1 using a uniform superposition over a unique set of preimages of the range $R_{k} := Z (K^{n k}, K^{k})$ of Z, so we apply the same strategy here. For each z∈R_k, we choose a unique $(x, y) \in K^{n k} \times K^{k}$ such that Z(x,y)=z. Let T_k be some set of unique representatives, so that Z : T_k→R_k is a bijection.

(i). $K = F_{q}$

The algorithm generates a uniform superposition over T_k, performs k phase queries, and computes Z in place, giving

\frac{1}{\sqrt{| T_{k} |}} \sum_{(x, y) \in T_{k}} | x, y ⟩ \mapsto \frac{1}{\sqrt{| T_{k} |}} \sum_{(x, y) \in T_{k}} e (Z (x, y) \cdot c) | x, y ⟩ \mapsto \frac{1}{\sqrt{| R_{k} |}} \sum_{z \in R_{k}} e (z \cdot c) | z ⟩ .

3.8

We then measure in the basis of Fourier states $| \tilde{c} ⟩ := (1 / \sqrt{q^{J}}) \sum_{z \in F_{q}^{J}} e (z \cdot c) | z ⟩$ . A simple calculation shows that the result of this measurement is the correct vector of coefficients with probability |R_k|/q^J.

(ii). $K = R$

We consider a bounded subset $S \subseteq R^{J}$ and a set T_k′ of unique preimages of each element in R_k∩S such that Z(T_k′)=R_k∩S and Z : T_k′→R_k∩S is bijective. The algorithm on input |ψ〉 with support supp(ψ)⊆R_k∩S gives

\begin{aligned} | ψ ⟩ = \int_{R_{k} \cap S} d^{J} z ψ (z) | z ⟩ \mapsto \int_{R_{k} \cap S} d^{J} z ψ (z) | z ⟩ | Z^{- 1} (z) ⟩ \end{aligned}

3.9

\begin{aligned} \mapsto \int_{R_{k} \cap S} d^{J} z ψ (z) e (z \cdot c) | z ⟩ | Z^{- 1} (z) ⟩ \end{aligned}

3.10

\begin{aligned} \mapsto \int_{R_{k} \cap S} d^{J} z ψ (z) e (z \cdot c) | z ⟩ =: | ψ_{c} ⟩ . \end{aligned}

3.11

The choice of S constrains the set of inputs that can be perfectly distinguished by this procedure, as captured by the following lemma.

Lemma 3.1 (Orthogonality) —

For positive integer n, let $m (A) := \int_{A} d^{n} z$ be the measure of the set $A \subseteq R^{n}$ . Let S be a bounded subset of $R^{n}$ with non-zero measure. Let $| \tilde{c} ⟩ = (1 / \sqrt{m (S)}) \int_{S} d^{n} z e (c \cdot z) | z ⟩$ and let U be the maximal subset of $R^{n}$ such that for any c,c′∈U with c≠c′,

$⟨ {\tilde{c}}^{'} | \tilde{c} ⟩ = \frac{1}{m (S)} \int_{S} d^{n} z e ((c - c^{'}) \cdot z) = 0.$ 3.12

Then there is a lattice Λ such that $U \in R^{n} / Λ$ .

Proof. —

By definition, c−c′ must be a zero of the Fourier transform $F (I_{S})$ of the indicator function $I_{S} (z)$ . We denote $Λ := {c : F (I_{S}) (c) = 0} \cup {0}$ and let c₀∈U. Clearly U⊆c₀+Λ as Λ contains all zeros. Since $⟨ \tilde{c + c_{0}} | {\tilde{c}}_{0} ⟩ = 0$ for all c∈Λ∖{0}, we have c₀+Λ⊆U and U=c₀+Λ. If c∈Λ∖{0}, then $⟨ \tilde{c_{0} + c} | {\tilde{c}}_{0} ⟩ = ⟨ {\tilde{c}}_{0} | \tilde{c_{0} - c} ⟩ = 0$ implies that −c∈Λ. If c,c′∈Λ∖{0}, then $⟨ \tilde{c + c_{0}} | \tilde{- c^{'} + c_{0}} ⟩ = ⟨ \tilde{c + c^{'} + c_{0}} | {\tilde{c}}_{0} ⟩ = 0$ implies c+c′∈Λ∖{0}. Therefore, Λ is an additive subgroup of $R^{n}$ .

Now we prove that Λ is a lattice. For ϵ>0, δ∈B(ϵ), and c∈Λ,

$| ⟨ \tilde{c + δ} | \tilde{c} ⟩ |^{2} = {| \int_{S} d^{n} z e (δ \cdot z) |}^{2} \geq {| \int_{S} d^{n} z \cos (2 π δ \cdot z) |}^{2} > 0,$ 3.13

if S⊆B(r) for r<1/4ϵ. Thus, B(ϵ) contains exactly one element in Λ and hence Λ is discrete. ▪

Roughly speaking, lemma 3.1 is a consequence of the uncertainty principle: restricting the support to a finite window limits the precision with which we can determine the Fourier transform. In the proof, note that a larger window offers better resolution of the coefficients.

We have shown that the set Λ of perfectly distinguishable coefficients forms a lattice. We also require the set ${| \tilde{c} ⟩ : c \in Λ}$ to be a complete basis. Since $⟨ z | \tilde{c} ⟩ = (1 / \sqrt{m (S)}) e (z \cdot c)$ , completeness implies that |z〉 is of the form $\sum_{c \in Λ} e (- z \cdot c) | \tilde{c} ⟩$ up to a normalization constant. More formally, we have the following lemma.

Lemma 3.2 (Completeness) —

For positive integer n, let $m (A) := \int_{A} d^{n} z$ be the measure of the set $A \subseteq R^{n}$ . Let Λ be a discrete additive subgroup of $R^{n}$ . Let S be a bounded set with non-zero measure and $| \tilde{c} ⟩ = (1 / \sqrt{m (S)}) \int_{S} d^{n} z e (z \cdot c) | z ⟩$ . Then ${| \tilde{c} ⟩ : c \in Λ}$ forms a complete basis over support S if and only if S is a fundamental domain of the dual lattice of Λ.

Proof. —

Let $\tilde{Λ}$ be the dual lattice of Λ. We observe that (ignoring the normalization constant)

$\begin{aligned} \sum_{c \in Λ} e (- z \cdot c) | \tilde{c} ⟩ & = \int_{S} d^{J} z^{'} \sum_{c \in Λ} e ((z^{'} - z) \cdot c) | z^{'} ⟩ = \int_{S} d^{J} z^{'} \sum_{z_{0} \in \tilde{Λ}} δ (z^{'} - z - z_{0}) | z^{'} ⟩ \end{aligned}$ 3.14

$\begin{aligned} = \sum_{z_{0} \in \tilde{Λ}} I_{S} (z + z_{0}) | z + z_{0} ⟩ = | (z + \tilde{Λ}) \cap S ⟩ . \end{aligned}$ 3.15

In equation (3.14), $\sum_{c \in Λ} e (z \cdot c) = \sum_{z_{0} \in \tilde{Λ}} δ (z - z_{0})$ up to a constant factor [12], Section 7.2. The set $(z + \tilde{Λ}) \cap S$ cannot be empty, so a fundamental domain of $\tilde{Λ}$ is a subset of S. For $z, z^{'} \in R^{n}$ , $⟨ (z + \tilde{Λ}) \cap S | (z^{'} + \tilde{Λ}) \cap S ⟩ = 0$ if $z^{'} \notin z + \tilde{Λ}$ , which implies that S is a subset of a fundamental domain of $\tilde{Λ}$ . ▪

Lemma 3.2 further restricts the bounded set S has to be a fundamental region of $\tilde{Λ}$ . Without loss of generality, one may choose S to be a fundamental domain of a lattice centred at zero. In the last step, the algorithm applies the unitary operator

\frac{1}{\sqrt{m (S)}} \sum_{c^{'} \in Λ} \int_{S} d^{J} z e (- z \cdot c^{'}) | c^{'} ⟩ ⟨ z |,

3.16

to the state |ψ_c〉 in equation (3.11). The algorithm outputs c′∈Λ with probability

\frac{1}{m (R_{k} \cap S) m (S)} {| \int_{R_{k} \cap S} d^{J} z ψ (z) e (z \cdot (c - c^{'})) |}^{2} \leq \frac{m (R_{k} \cap S)}{m (S)},

3.17

where the upper bound follows from the Cauchy–Schwarz inequality. The maximum is reached if $ψ (z) = (1 / \sqrt{m (R_{k} \cap S)}) I_{R_{k} \cap S} (z)$ and c happens to be a lattice point. If c∉Λ, the algorithm returns the closest lattice point with high probability.

To achieve arbitrarily high precision, one may want to take $S \to R^{J}$ . In this limit, the basis of coefficients is normalized to the Dirac delta function, i.e. $⟨ {\tilde{c}}^{'} | c ⟩ = δ^{(J)} (c - c^{'})$ . In this case, $Λ \to R^{J}$ and the unitary operator in equation (3.16) becomes the J-dimensional QFT over the real numbers. However, for the interpolation problem, the success probability m(R_k∩S)/m(S) is not well defined in the limit $S \to R^{J}$ since different shapes for S can give different probabilities. Thus, it is necessary to choose a bounded region, and we leave the optimal choice as an open question.

Though the size of the fundamental domain S affects the resolution of the coefficients, it does not affect the maximal success probability m(R_k∩S)/m(S). This can be seen by scale invariance: for every z∈R_k, there is a preimage (x,y) such that Z(x,y)=z. Then λz∈R_k since Z(x,λy)=λz for any $λ \in R$ . In terms of the bijection ℓ: z↦λz for $λ \in R^{\times}$ , we have ℓ(R_k)=R_k and ℓ(R_k∩S)=R_k∩ℓ(S). Then m(R_k∩ℓ(S))=m(ℓ(R_k∩S))=λ^Jm(R_k∩S) and hence m(R_k∩ℓ(S))/m(ℓ(S))=m(R_k∩S)/m(S). Thus, we can make the precision arbitrarily high by taking S arbitrarily large, and we call m(R_k∩S)/m(S) the success probability of the algorithm.

(iii). $K = C$

We consider a bounded set $S \subseteq C^{J}$ and a set T_k′ of unique preimages of each element in R_k∩S such that Z(T_k′)=R_k and Z : T_k′→R_k∩S is bijective. The algorithm on input |ψ〉 with support supp(ψ)⊆R_k∩S gives

\begin{aligned} | ψ ⟩ = \int_{ϕ (R_{k} \cap S)} d^{2 J} z ψ (z) | z ⟩ & \mapsto \int_{ϕ (R_{k} \cap S)} d^{2 J} z ψ (z) | z ⟩ | ϕ (Z^{- 1} (z)) ⟩ \end{aligned}

3.18

\begin{aligned} \mapsto \int_{ϕ (R_{k} \cap S)} d^{2 J} z ψ (z) e (z \cdot c) | z ⟩ | ϕ (Z^{- 1} (z)) ⟩ \end{aligned}

3.19

\begin{aligned} \mapsto \int_{ϕ (R_{k} \cap S)} d^{2 J} z ψ (z) e (z \cdot c) | z ⟩ =: | ψ_{c} ⟩ . \end{aligned}

3.20

By lemma 3.1 and 3.2, the set S must be a fundamental domain in $C^{J}$ . Let ${| \tilde{c} ⟩ : c \in Λ}$ be the measurement basis. In the last step of the algorithm, we apply the unitary operator

\frac{1}{\sqrt{m (S)}} \sum_{c^{'} \in ϕ (Λ)} \int_{ϕ (S)} d^{2 J} z e (- z \cdot c^{'}) | c^{'} ⟩ ⟨ z |,

3.21

to the state |ψ_c〉 in equation (3.20). The algorithm outputs c′∈Λ with probability

\frac{1}{m (R_{k} \cap S) m (S)} {| \int_{ϕ (R_{k} \cap S)} d^{2 J} z ψ (z) e (z \cdot (c - c^{'})) |}^{2} .

3.22

Again, since |ψ〉 is normalized, equation (3.22) cannot be arbitrarily large. By the Cauchy–Schwarz inequality, equation (3.22) is upper bounded by m(R_k∩S)/m(S); this maximal success probability is obtained if $ψ (z) = (1 / \sqrt{m (R_{k} \cap S)}) I_{ϕ (R_{k} \cap S)} (z)$ and c happens to be a lattice point. If c∉Λ, the algorithm returns the closest lattice point with high probability.

By the same argument as in §3b(ii), we can show scale invariance holds for complex numbers: for ℓ: z↦λz where $z \in C^{J}$ and $λ \in R^{\times}$ , m(R_k∩S)/m(S)=m(R_k∩ℓ(S))/m(ℓ(S)). Thus, we can make the precision of the algorithm arbitrarily high by taking S arbitrarily large without affecting the maximal success probability.

(c). Performance

We have shown in §3b(i) that the optimal success probability is at most |R_k|/q^J for $K = F_{q}$ . For real and complex numbers, we consider a bounded support S in which the algorithm is performed. The success probability of the algorithm with this choice is at most m(R_k∩S)/m(S), as shown in equations (3.17) and (3.22). To establish the query complexity, first we show that if $\dim R_{k} = J$ , the algorithm outputs the coefficients with bounded error.

Lemma 3.3 —

For positive integers n,k,d, let $J := (\binom{n + d}{d})$ and let $m (A) := \int_{A} d^{J} z$ be the volume of A⊆R^J. Let $Z : K^{n k} \times K^{k}$ , $Z (x, y) = \sum_{i = 1}^{k} y_{i} x_{i}^{j}$ for an infinite field $K$ . Let $R_{k} = Z (K^{n k}, K^{k})$ be the range of Z. If $\dim R_{k} = J$ , then m(R_k∩S)/m(S)>0 if S is a fundamental domain centred at 0.

Proof. —

R_k is a constructible set for $K = C$ and it is a semialgebraic set for $K = R$ . By [9,10], R_k has non-empty interior if $\dim (R_{k}) = J$ for both cases.

S is a fundamental domain centred at 0 with finite measure, so we only need to prove that m(R_k∩S) is of positive measure, or equivalently, that the interior of R_k and the interior of S have non-empty intersection.

If this is not the case, then any interior point of S cannot be in the interior of R_k. By scale invariance of R_k, any point in $K^{n}$ except 0 cannot be in the interior of R_k, which contradicts the fact that R_k has non-empty interior given $\dim (R_{k}) = J$ . ▪

Lemma 3.3 shows that for infinite fields, although we perform the algorithm over a bounded support, the query complexity can be understood by considering the dimension of the entire set R_k. Moreover, by invoking recent work on typical ranks, we can establish the minimum number of queries to determine the coefficients almost surely.

Now let v_d(x₁,x₂,…,x_n) be the (n+d d) -dimensional vector that contains all monomials with variables x₁,…,x_n of degree no more than d as its entries. Let

X_{n, d} := {v_{d} (x_{1}, x_{2}, \dots, x_{n}) : x_{1}, x_{2}, \dots, x_{n} \in K},

3.23

where $K$ is a given ground field. For example, we have

X_{3, 2} = {{(x_{1}^{2}, x_{2}^{2}, x_{3}^{2}, x_{1} x_{2}, x_{1} x_{3}, x_{2} x_{3}, x_{1}, x_{2}, x_{3}, 1)}^{T} : x_{1}, x_{2}, x_{3} \in K} .

3.24

Our question is to determine the smallest number k such that a generic vector in $K^{(\binom{n + d}{d})}$ can be written as a linear combination of no more than k elements from X_n,d. More precisely, we have $R_{k} = {\sum_{i = 1}^{k} c_{i} v_{i} : c_{i} \in K, v_{i} \in X_{n, d}}$ , and we ask what is the smallest number k such that R_k has full measure in $K^{(\binom{n + d}{d})}$ .

Our approach requires basic knowledge of algebraic geometry—specifically, the concepts of Zariski topology, Veronese variety and secant variety. Formal definitions can be found in §2b. For the reader’s convenience, we also explain these concepts briefly when we first use them.

Now we make two simple observations.

(1) In general, v_d(x₁,x₂,…,x_n) can be treated as an (n+d d) -dimensional vector that contains all monomials with variables x₁,…,x_n,x_n+1 of degree d as its entries, by simply taking the map (x₁,x₂,…,x_n)↦(x₁/x_n+1,…,x_n/x_n+1) and multiplying by $x_{n + 1}^{d}$ . For example, applying this mapping to X_3,2 gives
$X_{3, 2}^{'} = {{(x_{1}^{2}, x_{2}^{2}, x_{3}^{2}, x_{1} x_{2}, x_{1} x_{3}, x_{2} x_{3}, x_{1} x_{4}, x_{2} x_{4}, x_{3} x_{4}, x_{4}^{2})}^{T} : x_{1}, x_{2}, x_{3}, x_{4} \in K} .$
The new set X_n,d′ is slightly bigger than X_n,d since it also contains those points corresponding to x_n+1=0, but this will not affect our calculation since the difference is just a measure zero set in X_n,d′.
(2) The set X_n,d′ is the Veronese variety. One may also note that this set is isomorphic to ((x₁,x₂,…,x_n+1)^T)^⊗d in the symmetric subspace.

These observations imply that instead of studying R_k, we can study the new set

R_{k}^{'} = {\sum_{i = 1}^{k} c_{i} v_{i}^{'} : c_{i} \in K, v_{i}^{'} \in X_{n, d}^{'}} .

3.25

In general, we have a sequence of inclusions:

X_{n, d}^{'} = R_{1}^{'} \subseteq R_{2}^{'} \subseteq \dots \subseteq R_{k}^{'} \subseteq \dots \subseteq K^{(\binom{n + d}{d})} .

3.26

By taking the Zariski closure, we also have

{\bar{X}}_{n, d}^{'} = {\bar{R}}_{1}^{'} \subseteq {\bar{R}}_{2}^{'} \subseteq \dots \subseteq {\bar{R}}_{k}^{'} \subseteq \dots \subseteq K^{(\binom{n + d}{d})},

3.27

where ${\bar{R}}_{k}^{'}$ is the kth secant variety of the Veronese variety X_n,d′.

Palatini showed the following [13,14]:

Lemma 3.4 —

If $\dim {\bar{R}}_{k + 1}^{'} \leq \dim {\bar{R}}_{k}^{'} + 1$ , then ${\bar{R}}_{k + 1}^{'}$ is linear.

In particular, this shows that if $\dim {\bar{R}}_{k}^{'} = (\binom{n + d}{d})$ , then ${\bar{R}}_{k}^{'} = K^{(\binom{n + d}{d})}$ .

For an infinite field $K$ , define $k_{K}$ to be the smallest integer such that $m (R_{k_{K}} \cap S) / m (S) = 1$ . Thus, $k_{K}$ represents the minimal number of queries such that our algorithm succeeds with probability 1. For the finite field case $K = F_{q}$ , we only require that $m (R_{k_{F_{q}}} \cap S) / m (S)$ goes to 1 when q tends to infinity.

(i). $K = C$

A theorem due to Alexander & Hirschowitz [15] implies an upper bound on the query complexity of polynomial interpolation over $C$ .

Theorem 3.5 (Alexander–Hirschowitz Theorem, [15]) —

The dimension of ${\bar{R}}_{k}^{'}$ satisfies

$\dim {\bar{R}}_{k}^{'} = {\begin{cases} k (n + 1) - \frac{k (k - 1)}{2} & d = 2, 2 \leq k \leq n; \\ (\begin{matrix} n + d \\ d \end{matrix}) - 1 & (d, n, k) = (3, 4, 7), (4, 2, 5), (4, 3, 9), (4, 4, 14); \\ min {k (n + 1), (\begin{matrix} n + d \\ d \end{matrix})} & otherwise. \end{cases}$ 3.28

Thus, the minimum k to make ${\bar{R}}_{k}^{'} = C^{(\binom{n + d}{d})}$ is

k_{C} (n, d) := {\begin{cases} n + 1 & d = 2, n \geq 2; \\ ⌈ \frac{1}{n + 1} (\begin{matrix} n + d \\ d \end{matrix}) ⌉ + 1 & (n, d) = (4, 3), (2, 4), (3, 4), (4, 4); \\ ⌈ \frac{1}{n + 1} (\begin{matrix} n + d \\ d \end{matrix}) ⌉ & otherwise. \end{cases}

3.29

By parameter counting, we see that R_k is of full measure in R_k′. It remains to show that R_k′ is of full measure in its Zariski closure ${\bar{R}}_{k}^{'}$ :

Theorem 3.6 —

R_k′ is of full measure in ${\bar{R}}_{k}^{'}$ .

Proof. —

R_k′ is just the image of the map (Q₁,Q₂,…,Q_k)↦(Q₁+Q₂+⋯+Q_k). By Exercise 3.19 in ch. II of [16], R_k′ is a constructible set, so it contains an open subset of each connected component of ${\bar{R}}_{k}^{'}$ . Therefore, its complement is of measure 0. ▪

This immediately implies the following:

Corollary 3.7 —

R_k has measure 0 in $C^{(\binom{n + d}{d})}$ for $k < k_{C} (n, d)$ and measure 1 in $C^{(\binom{n + d}{d})}$ for $k \geq k_{C} (n, d)$ .

Thus, as the integer k increases, m(R_k′∩S)/m(S) suddenly jumps from 0 to 1 at the point $k_{C} (n, d)$ , and so does m(R_k∩S)/m(S). This implies part (3) of theorem 1.1.

(ii). $K = R$

Now consider the case $K = R$ . For d=2, (n+1)-variate symmetric tensors are simply (n+1)×(n+1) symmetric matrices, so a random (n+1)-variate symmetric tensor will be of rank n+1 with probability 1. However, if the order of the symmetric tensors is larger than 2, the situation is much more complicated. For example, a random bivariate symmetric tensor of order 3 will be of two different ranks, 2 and 3, both with positive probabilities.

From the perspective of algebraic geometry, it still holds that ${\bar{R}}_{k}^{'} = R^{(\binom{n + d}{d})}$ for $k \geq k_{C} (n, d)$ and for $k < k_{C} (n, d)$ , ${\bar{R}}_{k}^{'}$ is of measure zero in $R^{(\binom{n + d}{d})}$ . It also holds that R_k is of full measure in R_k′. However, the claim that R_k′ has full measure in ${\bar{R}}_{k}^{'}$ no longer holds over $R$ . As we explained in the proof of theorem 3.6, R_k′ is the image of the map (Q₁,Q₂,…,Q_k)↦(Q₁+Q₂+⋯+Q_k). For an algebraically closed field $K$ , it is known that the image of any map is always a constructible set in its Zariski closure. Thus, R_k′ is of full measure in ${\bar{R}}_{k}^{'}$ . Over $R$ , it is easy to verify that the image may not be of full measure in its Zariski closure (a simple counterexample is x↦x²). Consequently, over $C$ , R_k′ has non-empty interior for a unique value of k, and this value of k is called the generic rank. Over $R$ , R_k′ is just a semialgebraic set and it has non-empty interior for several values of k, which are called the typical ranks.

For the univariate case, we have the following theorem:

Theorem 3.8 ([17]; [18]) —

For n=1, all integers from $k_{C} = ⌈ (d + 1) / 2 ⌉$ to $k_{R} = d$ are typical ranks.

For the multivariate case n≥2, it still holds that $k_{C} (n, d)$ defined in §3c(i) is the smallest typical rank [10]. According to Bernardi et al. [9], every rank between $k_{C} (n, d)$ and the top typical rank $k_{R} (n, d)$ is also typical. Thus, we only need to study the top typical rank $k_{R} (n, d)$ . Unfortunately, the top typical rank in general is not known. In the literature, considerable effort has been devoted to understanding the maximum possible rank $k_{max} (n, d)$ , which, by definition, is also an upper bound for $k_{R} (n, d)$ . In particular, we have $k_{max} (n, 2) \leq n + 1$ for n≥2, $k_{max} (2, 4) \leq 11$ , $k_{max} (3, 4) \leq 19$ , $k_{max} (4, 4) \leq 29$ , $k_{max} (4, 3) \leq 15$ and $k_{max} (n, d) \leq 2 ⌈ (1 / (n + 1)) (\binom{n + d}{d}) ⌉$ otherwise [10].

The above result implies $k_{R} (n, d) \leq k_{max} (n, d) \leq 2 k_{C} (n, d)$ . We also mention a few other upper bounds on $k_{max} (n, d)$ . Trivially, we have $k_{max} (n, d) \leq (\binom{n + d}{d})$ . In [19,20], this was improved to $k_{max} (n, d) \leq (\binom{n + d}{d}) - n$ . Later work showed that $k_{max} (n, d) \leq (\binom{n + d - 1}{n})$ [21]. Jelisiejew [22] then proved that $k_{max} (n, d) \leq (\binom{n + d - 1}{n}) - (\binom{n + d - 5}{n - 2})$ , and Ballico & De Paris [23] then improved this to $k_{max} (n, d) \leq (\binom{n + d - 1}{n}) - (\binom{n + d - 5}{n - 2}) - (\binom{n + d - 6}{n - 2})$ . For small cases, these bounds may be stronger than the bound $k_{max} (n, d) \leq 2 k_{C} (n, d)$ mentioned above.

To summarize, we have the following, which implies part (2) of theorem 1.1:

Theorem 3.9 —

As the integer k increases from $k_{C} (n, d) - 1$ to $k_{R} (n, d) \leq 2 k_{C} (n, d)$ , m(R_k′∩S)/m(S) forms a strictly increasing sequence from 0 to 1, and so does m(R_k∩S)/m(S).

(iii). $K = F_{q}$

We link the finite field case with the complex case using the Lang–Weil theorem:

Theorem 3.10 (Lang–Weil Theorem, [24]) —

There exists a constant A(n,d,r) depending only on n,d,r such that for any variety $V \subseteq P^{n}$ with dimension r and degree d, if we define V over a finite field $F_{q}$ , the number of points in V must satisfy

$| N - q^{r} | \leq (d - 1) (d - 2) q^{r - 1 / 2} + A (n, d, r) q^{r - 1} .$ 3.30

The Lang–Weil theorem shows that when q is large enough, the number of points in a variety over $F_{q}$ is very close to $q^{\dim V}$ . So it actually tells us that m(R_k′∩S)/m(S)=0 if $k < k_{C} (n, d)$ . It remains unclear whether m(R_k′∩S)/m(S)>0 for $k = k_{C} (n, d)$ . Once again, for the finite field case, when we talk about the measure, we always assume q is sufficiently large. As in the real field case, the main challenge now is to study the measure of R_k′ in ${\bar{R}}_{k}^{'}$ .

For the upper bound, recall our notation that v_d(x₁,x₂,…,x_n) is the (n+d d) -dimensional vector that contains all monomials with degree no more than d as its entries.

Here we make a slight change to the definition in which we require all those x_is in v_d to be non-zero. We can similarly define X_n,d′′ and R_k′′. We prove the following:

Lemma 3.11 —

Let r_n,d be the minimum number such that |R_{r_n,d}′′|=q^(n+dd)− O(q^(n+dd)−1). Then r_n,d≤r_n−1,d+r_n,d−1.

Proof. —

The proof is by induction on n+d.

For n+d=2, it is easy to verify r_2,2=3≤r_1,2+r_2,1=2+1. Assume lemma 3.11 holds for n+d≤m−1 and consider the pair (n,d) with n+d=m. For the sake of readability, we first explain how the induction works for the specific example (n,d)=(3,2), and then generalize our idea to any (n,d).

The vector

$v_{2} (x_{1}, x_{2}, x_{3}) = {(x_{1}^{2}, x_{2}^{2}, x_{3}^{2}, x_{1} x_{2}, x_{1} x_{3}, x_{2} x_{3}, x_{1}, x_{2}, x_{3}, 1)}^{T} \in X_{3, 2}^{″}$ 3.31

can be rearranged as ${(x_{3}, x_{3} x_{1}, x_{3} x_{2}, x_{3}^{2}, x_{1}^{2}, x_{2}^{2}, x_{1} x_{2}, x_{1}, x_{2}, 1)}^{T}$ . The first four entries can be rewritten as $x_{3}^{2} {(1 / x_{3}, x_{1} / x_{3}, x_{2} / x_{3}, 1)}^{T} = x_{3}^{2} v_{1} (1 / x_{3}, x_{1} / x_{3}, x_{2} / x_{3})$ , and the last six entries form v₂(x₁,x₂).

When (x₁,x₂,x₃) ranges over all 3-tuples in $F_{q} ∖ {0}$ , (1/x₃,x₁/x₃,x₂/x₃) also ranges over all possible 3-tuples in $F_{q} ∖ {0}$ . By assumption, if we take linear combinations of r_3,1 vectors chosen from X_3,2′′, the first four entries will range over no fewer than q^{(3+1 1)}−O(q^{(3+1 1)−1}) different vectors in $F_{q}^{(\binom{3 + 1}{1})}$ .

For any such linear combination, we can add r_2,2 extra vectors from X_3,2 with the restriction that x₃=0, which will guarantee these extra vectors do not affect the first four entries. By assumption, the last six entries will range over no fewer than q^{(2+2 2)}−O(q^{(2+2 2)−1}) different vectors in $F_{q}^{(\binom{2 + 2}{2})}$ .

Thus, in total, we have $(q^{(\binom{3 + 1}{1})} - O (q^{(\binom{3 + 1}{1}) - 1})) (q^{(\binom{2 + 2}{2})} - O (q^{(\binom{2 + 2}{2}) - 1}))$ different vectors in $F_{q}^{(\binom{3 + 2}{2})}$ if we take linear combinations of r_3,1+r_2,2 vectors from X_3,2′′, which implies r_3,2≤r_3,1+r_2,2.

For general (n,d), the analogous partition of v_d(x₁,x₂,…,x_n) is still valid. Those (n+d d) −(n−1+d d) =(n+d−1 d−1) entries involving x_n will form $x_{n}^{d - 1} v_{d - 1} (1 / x_{n}, x_{1} / x_{n}, \dots, x_{n - 1} / x_{n})$ and the rest will form v_d(x₁,x₂,…,x_n−1). All arguments follow straightforwardly, so we have r_n,d≤r_n−1,d+r_n,d−1 for n+d=m and for any (n,d) by induction. ▪

Corollary 3.12 —

r_n,d≤(n+d−1 d−1) .

Proof. —

We use induction on n+d. For n+d=2, it is easy to verify. If it is true for n+d=m, then for n+d=m+1, we have r_n,d≤r_n−1,d+r_n,d−1≤(n+d−2 d−1) +(n+d−2 d−2) = (n+d−1 d−1) . ▪

R_k′′ is obviously a subset of R_k′, so $k_{F_{q}} (n, d) \leq r_{n, d}$ . By combining theorem 3.10 and corollary 3.12, we have the following, which implies part (1) of theorem 1.1:

Corollary 3.13 —

$k_{C} (n, d) \leq k_{F_{q}} (n, d) \leq r_{n, d} \leq (\binom{n + d - 1}{d - 1}) = (d / (n + d)) (\binom{n + d}{d})$ .

Remark 3.14 —

By combining corollary 3.12 with the fact $r_{n, 2} \geq k_{C} (n, 2)$ , we have r_n,2=n+1.

Remark 3.15 —

It was previously known that r_n,1=1 [25,26] and r_1,d=⌈(d+1)/2⌉ [2]. We can further refine the upper bound using these boundary conditions:

$\begin{aligned} r_{n, d} & \leq \sum_{i = 0}^{d - 2} (\binom{n - 2 + i}{i}) r_{1, d - i} + (\binom{d + n - 3}{d - 1}) \leq \sum_{i = 0}^{d - 2} (\binom{n - 2 + i}{i}) \frac{d - i + 3}{2} + (\binom{d + n - 3}{d - 1}) \\ = \frac{n + d + 2}{2} (\binom{n + d - 3}{n - 1}) - \frac{n - 1}{2} (\binom{n + d - 2}{n}) + (\binom{d + n - 3}{d - 1}) . \end{aligned}$ 3.32

4. Optimality

In this section, we show that our algorithm is optimal for the case of finite fields. Specifically, we show that no k-query quantum algorithm can succeed with probability greater than |R_k|/q^J. This follows by essentially the same argument as in the univariate case [2].

First we show that the final state of a k-query algorithm is restricted to a subspace of dimension |R_k|. We prove the following:

Lemma 4.1 (cf. Lemma 3 of [2], arXiv version) —

Let $J := (\binom{n + d}{d})$ , and let |ψ_c〉 be the state of any quantum algorithm after k queries, where the black box contains $c \in F_{q}^{J}$ . Then $\dim span {| ψ_{c} ⟩ : c \in F_{q}^{J}} \leq | R_{k} |$ .

Proof. —

Following the same technique as in the proof of Lemma 3 in [2], arXiv version, consider a general k-query quantum algorithm U_kQ_cU_k−1Q_c…Q_cU₁Q_cU₀ acting on a state space of the form |x,y,w〉 for an arbitrary-sized workspace register |w〉. Here Q_c: |x,y〉↦e(yf(x))|x,y〉 for $x \in F_{q}^{n}, y \in F_{q}$ is the phase query. Starting from the initial state |x₀,y₀,w₀〉=|0,0,0〉, we can write the output state in the form

$| ψ_{c} ⟩ = \sum_{z \in R_{k}} e (z \cdot c) | ξ_{z} ⟩,$ 4.1

where with $x = (x_{1}, \dots, x_{k}) \in {(F_{q}^{n})}^{k}$ , $y = (y_{1}, \dots, y_{k}) \in F_{q}^{k}$ , w=(w₁,…,w_k+1) and I an appropriate index set,

$| ξ_{z} ⟩ = \sum_{(x, y) \in Z^{- 1} (z)} \sum_{\begin{matrix} x_{k + 1} \in F_{q}^{n}, \\ y_{k + 1} \in F_{q}, \\ w \in I^{k + 1} \end{matrix}} (\prod_{j = 0}^{k} ⟨ x_{j + 1}, y_{j + 1}, w_{j + 1} | U_{j} | x_{j}, y_{j}, w_{j} ⟩) | x_{k + 1}, y_{k + 1}, w_{k + 1} ⟩ .$ 4.2

Then $\dim span {| ψ_{c} ⟩ : c \in F_{q}^{J}} \leq \dim span {| ξ_{z} ⟩ : z \in R_{k}} \leq | R_{k} |$ . ▪

We also use the following basic lemma about the distinguishability of a set of quantum states in a space of restricted dimension.

Lemma 4.2 (Lemma 2 of [2], arXiv version) —

Suppose we are given a state |ϕ_c〉 with c∈C chosen uniformly at random. Then the probability of correctly determining c with some orthogonal measurement is at most $\dim span {ϕ_{c} : c \in C} / | C |$ .

Combining these lemmas, the success probability of multivariate interpolation under the uniform distribution over $c \in F_{q}^{J}$ (and hence also in the worst case) is at most |R_k|/q^J.

Unfortunately, it is unclear how to generalize this argument to the infinite-dimensional case, so we leave lower bounds on the query complexity of polynomial interpolation over $R$ and $C$ as a topic for future work.

5. Conclusion and open problems

In this paper, we studied the number of quantum queries required to determine the coefficients of a degree-d polynomial in n variables over a field $K$ . We proposed a quantum algorithm that works for $K = C$ , $R$ or $F_{q}$ , and we used it to give upper bounds on the quantum query complexity of multivariate polynomial interpolation in each case. Our results show a substantially larger gap between classical and quantum algorithms than the univariate case.

There are still several open questions that remain. Recall that $k_{K}$ represents the minimal number of queries required for our algorithm to succeed with probability 1 over the field $K$ (or with probability approaching 1 for large q if $K = F_{q}$ ). First, for the finite field case $K = F_{q}$ , can we bound $k_{F_{q}}$ by $C k_{C}$ where C is a constant independent of the degree d? For the values of (n,d) for which explicit values of $k_{C}$ , $k_{R}$ and $k_{F_{q}}$ are known, we always have $k_{C} \leq k_{F_{q}} \leq k_{R}$ . For example, $k_{C} (1, d) = k_{F_{q}} (1, d) = ⌈ (d + 1) / 2 ⌉ \leq d = k_{R} (1, d)$ and $k_{C} (n, 2) = k_{F_{q}} (n, 2) = k_{R} (n, 2) = n + 1$ . Thus, it is plausible to conjecture that $k_{F_{q}} (n, d) \leq k_{R} (n, d)$ , which would imply $k_{F_{q}} \leq 2 k_{C}$ .

Another question is whether we can always obtain positive success probability with only $k_{C}$ queries. We know that $k_{C}$ queries are sufficient to achieve positive success probability for $K = C$ and $R$ , but are they also sufficient for $K = F_{q}$ ? Indeed, if they are, then $k_{F_{q}} \leq 2 k_{C}$ follows immediately. To see this, if there is a point p with rank greater than $2 k_{C}$ , then consider a line through p and a point q with rank $k_{C}$ . This line has no other points with rank at most $k_{C}$ , since otherwise p would be of rank no more than $2 k_{C}$ , a contradiction. Therefore, the measure of the set of points with rank $k_{C}$ must be less than a fraction 1/q of the whole space, which contradicts the assumption that $k_{C}$ queries suffice. Thus, there is no point with rank greater than $2 k_{C}$ —or in other words, if $k_{C}$ queries are sufficient to achieve positive success probability, then $2 k_{C}$ queries are sufficient to achieve success probability 1.

While we considered an algorithm with a bounded working region, it is unclear what is the highest success probability that can be achieved by a general k-query algorithm without this restriction (and in particular, whether fewer than $k_{K}$ queries could suffice to solve the problem with high probability). Indeed, even for the algorithm we proposed in §3b, it remains open to understand what choice of the region S leads to the highest success probability. As mentioned in §4, it would be useful to establish lower bounds on the query complexity of polynomial interpolation over infinite fields. Also, as stated in [2], for the univariate case over finite fields, the algorithm is time efficient since the function Z⁻¹(z), i.e. finding a preimage of elements in the range of Z, is efficiently computable. However, for multivariate cases, it remains open whether there is an analogous efficiency analysis.

Finally, Zhandry has placed the quantum algorithm for polynomial interpolation in a broader framework that includes other problems such as polynomial evaluation and extrapolation [27]. It could be interesting to consider these problems for multivariate polynomials and/or over infinite fields.

Acknowledgements

We thank Charles Clark for encouraging us to consider quantum algorithms for polynomial interpolation over the real numbers. J.C. would also like to thank Jun Yu and Chi-Kwong Li for helpful comments in early discussions of the project.

Data accessibility

This work does not have any experimental data.

Authors' contributions

All authors contributed equally to the original ideas, analytical derivations and final writing of this manuscript, and gave final approval for publication.

Competing interests

We have no competing interests.

Funding

This work received support from the Canadian Institute for Advanced Research, the Department of Defense and the National Science Foundation (grant no. 1526380).

References

1.Gasca M, Sauer T. 2000. Polynomial interpolation in several variables. Adv. Comput. Math. 12, 377–410. (doi:10.1023/A:1018981505752) [Google Scholar]
2.Childs AM, van Dam W, Hung S-H, Shparlinski IE. 2016. Optimal quantum algorithm for polynomial interpolation. In 43rd International Colloquium on Automata, Languages, and Programming (ICALP 2016). Leibniz International Proceedings in Informatics, vol. 55, pp. 16:1–16:13, Dagstuhl, Germany: Schloss Dagstuhl-Liebniz-Zentrum fuer Informatik. Available at arXiv:1509.09271.
3.Kane DM, Kutin SA. 2011. Quantum interpolation of polynomials. Quantum Inf. Comput. 11, 95–103. [Google Scholar]
4.Meyer DA, Pommersheim J. 2011. On the uselessness of quantum queries. Theor. Comput. Sci. 412, 7068–7074. (doi:10.1016/j.tcs.2011.06.037) [Google Scholar]
5.Montanaro A. 2012. The quantum query complexity of learning multilinear polynomials. Inf. Process. Lett. 112, 438–442. (doi:10.1016/j.ipl.2012.03.002) [Google Scholar]
6.Håstad J. 1990. Tensor rank is NP-complete. J. Algorithms 11, 644–654. (doi:10.1016/0196-6774(90)90014-6) [Google Scholar]
7.Hillar CJ, Lim L-H. 2013. Most tensor problems are NP-hard. J. ACM 60, 45:1–45:39. (doi:10.1145/2512329) [Google Scholar]
8.Shitov Y.2016. How hard is the tensor rank? (http://arxiv.org/abs/1611.01559. )
9.Bernardi A, Blekherman G, Ottaviani G.2015. On real typical ranks. (http://arxiv.org/abs/1512.01853. )
10.Blekherman G, Teitler Z. 2015. On maximum, typical and generic ranks. Math. Ann. 362, 1021–1031. (doi:10.1007/s00208-014-1150-3) [Google Scholar]
11.Harris J. 1992. Algebraic geometry: a first course. Graduate Texts in Mathematics, vol. 133 Berlin, Germany: Springer. [Google Scholar]
12.Hörmander L. 1983. The analysis of linear partial differential operators: Distribution theory and Fourier analysis. Berlin, Germany: Springer. [Google Scholar]
13.Palatini F. 1909. Sulle varietà algebriche per le quali sono di dimensione minore dell’ordinario, senza riempire lo spazio ambiente, una o alcune delle variet‘a formate da spazi seganti. Atti. Accad. Torino 44, 362–374. [Google Scholar]
14.Adlandsvik B. 1987. Joins and higher secant varieties. Math. Scand. 61, 213–222. (doi:10.7146/math.scand.a-12200) [Google Scholar]
15.Alexander J, Hirschowitz A. 1995. Polynomial interpolation in several variables. J. Algebr. Geom. 4, 201–222. [Google Scholar]
16.Hartshorne R. 1977. Algebraic geometry. Graduate Texts in Mathematics, vol. 52 Berlin, Germany: Springer. [Google Scholar]
17.Comon P, Ottaviani G. 2012. On the typical rank of real binary forms. Linear Multilinear Algebra 60, 657–667. (doi:10.1080/03081087.2011.624097) [Google Scholar]
18.Causa A, Re R. 2011. On the maximum rank of a real binary form. Ann. Mat. Pura Appl. 190, 55–59. (doi:10.1007/s10231-010-0137-2) [Google Scholar]
19.Geramita AV, Schenck HK. 1998. Fat points, inverse systems, and piecewise polynomial functions. J. Algebra 204, 116–128. (doi:10.1006/jabr.1997.7361) [Google Scholar]
20.Landsberg JM, Teitler Z. 2010. On the ranks and border ranks of symmetric tensors. Found. Comput. Math. 10, 339–366. (doi:10.1007/s10208-009-9055-3) [Google Scholar]
21.Białynicki-Birula A, Schinzel A. 2008. Representations of multivariate polynomials by sums of univariate polynomials in linear forms. Colloq. Mathematicum 112, 201–233. (doi:10.4064/cm112-2-2) [Google Scholar]
22.Jelisiejew J.2013. An upper bound for the Waring rank of a form. (http://arxiv.org/abs/1305.6957. )
23.Ballico E, De Paris A. 2017. Generic power sum decompositions and bounds for the Waring rank. Discrete. Comput. Geom. 57, 1–19. (doi:10.1007/s00454-017-9886-7) [Google Scholar]
24.Lang S, Weil A. 1954. Number of points of varieties in finite fields. Am. J. Math. 76, 819–827. (doi:10.2307/2372655) [Google Scholar]
25.Bernstein E, Vazirani U. 1997. Quantum complexity theory. SIAM J. Comput. 26, 1411–1473. (doi:10.1137/S0097539796300921) [Google Scholar]
26.de Beaudrap JN, Cleve R, Watrous J. 2002. Sharp quantum versus classical query complexity separations. Algorithmica 34, 449–461. (doi:10.1007/s00453-002-0978-1) [Google Scholar]
27.Zhandry M.2015. Quantum oracle classification: the case of group structure. (http://arxiv.org/abs/1510.08352. )

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This work does not have any experimental data.

[RSPA20170480C1] 1.Gasca M, Sauer T. 2000. Polynomial interpolation in several variables. Adv. Comput. Math. 12, 377–410. (doi:10.1023/A:1018981505752) [Google Scholar]

[RSPA20170480C2] 2.Childs AM, van Dam W, Hung S-H, Shparlinski IE. 2016. Optimal quantum algorithm for polynomial interpolation. In 43rd International Colloquium on Automata, Languages, and Programming (ICALP 2016). Leibniz International Proceedings in Informatics, vol. 55, pp. 16:1–16:13, Dagstuhl, Germany: Schloss Dagstuhl-Liebniz-Zentrum fuer Informatik. Available at arXiv:1509.09271.

[RSPA20170480C3] 3.Kane DM, Kutin SA. 2011. Quantum interpolation of polynomials. Quantum Inf. Comput. 11, 95–103. [Google Scholar]

[RSPA20170480C4] 4.Meyer DA, Pommersheim J. 2011. On the uselessness of quantum queries. Theor. Comput. Sci. 412, 7068–7074. (doi:10.1016/j.tcs.2011.06.037) [Google Scholar]

[RSPA20170480C5] 5.Montanaro A. 2012. The quantum query complexity of learning multilinear polynomials. Inf. Process. Lett. 112, 438–442. (doi:10.1016/j.ipl.2012.03.002) [Google Scholar]

[RSPA20170480C6] 6.Håstad J. 1990. Tensor rank is NP-complete. J. Algorithms 11, 644–654. (doi:10.1016/0196-6774(90)90014-6) [Google Scholar]

[RSPA20170480C7] 7.Hillar CJ, Lim L-H. 2013. Most tensor problems are NP-hard. J. ACM 60, 45:1–45:39. (doi:10.1145/2512329) [Google Scholar]

[RSPA20170480C8] 8.Shitov Y.2016. How hard is the tensor rank? (http://arxiv.org/abs/1611.01559. )

[RSPA20170480C9] 9.Bernardi A, Blekherman G, Ottaviani G.2015. On real typical ranks. (http://arxiv.org/abs/1512.01853. )

[RSPA20170480C10] 10.Blekherman G, Teitler Z. 2015. On maximum, typical and generic ranks. Math. Ann. 362, 1021–1031. (doi:10.1007/s00208-014-1150-3) [Google Scholar]

[RSPA20170480C11] 11.Harris J. 1992. Algebraic geometry: a first course. Graduate Texts in Mathematics, vol. 133 Berlin, Germany: Springer. [Google Scholar]

[RSPA20170480C12] 12.Hörmander L. 1983. The analysis of linear partial differential operators: Distribution theory and Fourier analysis. Berlin, Germany: Springer. [Google Scholar]

[RSPA20170480C13] 13.Palatini F. 1909. Sulle varietà algebriche per le quali sono di dimensione minore dell’ordinario, senza riempire lo spazio ambiente, una o alcune delle variet‘a formate da spazi seganti. Atti. Accad. Torino 44, 362–374. [Google Scholar]

[RSPA20170480C14] 14.Adlandsvik B. 1987. Joins and higher secant varieties. Math. Scand. 61, 213–222. (doi:10.7146/math.scand.a-12200) [Google Scholar]

[RSPA20170480C15] 15.Alexander J, Hirschowitz A. 1995. Polynomial interpolation in several variables. J. Algebr. Geom. 4, 201–222. [Google Scholar]

[RSPA20170480C16] 16.Hartshorne R. 1977. Algebraic geometry. Graduate Texts in Mathematics, vol. 52 Berlin, Germany: Springer. [Google Scholar]

[RSPA20170480C17] 17.Comon P, Ottaviani G. 2012. On the typical rank of real binary forms. Linear Multilinear Algebra 60, 657–667. (doi:10.1080/03081087.2011.624097) [Google Scholar]

[RSPA20170480C18] 18.Causa A, Re R. 2011. On the maximum rank of a real binary form. Ann. Mat. Pura Appl. 190, 55–59. (doi:10.1007/s10231-010-0137-2) [Google Scholar]

[RSPA20170480C19] 19.Geramita AV, Schenck HK. 1998. Fat points, inverse systems, and piecewise polynomial functions. J. Algebra 204, 116–128. (doi:10.1006/jabr.1997.7361) [Google Scholar]

[RSPA20170480C20] 20.Landsberg JM, Teitler Z. 2010. On the ranks and border ranks of symmetric tensors. Found. Comput. Math. 10, 339–366. (doi:10.1007/s10208-009-9055-3) [Google Scholar]

[RSPA20170480C21] 21.Białynicki-Birula A, Schinzel A. 2008. Representations of multivariate polynomials by sums of univariate polynomials in linear forms. Colloq. Mathematicum 112, 201–233. (doi:10.4064/cm112-2-2) [Google Scholar]

[RSPA20170480C22] 22.Jelisiejew J.2013. An upper bound for the Waring rank of a form. (http://arxiv.org/abs/1305.6957. )

[RSPA20170480C23] 23.Ballico E, De Paris A. 2017. Generic power sum decompositions and bounds for the Waring rank. Discrete. Comput. Geom. 57, 1–19. (doi:10.1007/s00454-017-9886-7) [Google Scholar]

[RSPA20170480C24] 24.Lang S, Weil A. 1954. Number of points of varieties in finite fields. Am. J. Math. 76, 819–827. (doi:10.2307/2372655) [Google Scholar]

[RSPA20170480C25] 25.Bernstein E, Vazirani U. 1997. Quantum complexity theory. SIAM J. Comput. 26, 1411–1473. (doi:10.1137/S0097539796300921) [Google Scholar]

[RSPA20170480C26] 26.de Beaudrap JN, Cleve R, Watrous J. 2002. Sharp quantum versus classical query complexity separations. Algorithmica 34, 449–461. (doi:10.1007/s00453-002-0978-1) [Google Scholar]

[RSPA20170480C27] 27.Zhandry M.2015. Quantum oracle classification: the case of group structure. (http://arxiv.org/abs/1510.08352. )

PERMALINK

Quantum algorithm for multivariate polynomial interpolation

Jianxin Chen

Andrew M Childs

Shih-Han Hung

Abstract

1. Introduction

Theorem 1.1 —

2. Preliminaries and notations

(a). Notation and definitions

(b). Algebraic geometry concepts

3. Quantum algorithm for polynomial interpolation

(a). The query model

(i). Finite field Fq

(ii). Real numbers R

(iii). Complex numbers C

(b). The algorithm

(i). K=Fq

(ii). K=R

Lemma 3.1 (Orthogonality) —

Proof. —

Lemma 3.2 (Completeness) —

Proof. —

(iii). K=C

(c). Performance

Lemma 3.3 —

Proof. —

Lemma 3.4 —

(i). K=C

Theorem 3.5 (Alexander–Hirschowitz Theorem, [15]) —

Theorem 3.6 —

Proof. —

Corollary 3.7 —

(ii). K=R

Theorem 3.8 ([17]; [18]) —

Theorem 3.9 —

(iii). K=Fq

Theorem 3.10 (Lang–Weil Theorem, [24]) —

Lemma 3.11 —

Proof. —

Corollary 3.12 —

Proof. —

Corollary 3.13 —

Remark 3.14 —

Remark 3.15 —

4. Optimality

Lemma 4.1 (cf. Lemma 3 of [2], arXiv version) —

Proof. —

Lemma 4.2 (Lemma 2 of [2], arXiv version) —

5. Conclusion and open problems

Acknowledgements

Data accessibility

Authors' contributions

Competing interests

Funding

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

(i). Finite field $F_{q}$

(ii). Real numbers $R$

(iii). Complex numbers $C$

(i). $K = F_{q}$

(ii). $K = R$

(iii). $K = C$

(i). $K = C$

(ii). $K = R$

(iii). $K = F_{q}$