Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2013;31(4):350–367. doi: 10.1016/j.exmath.2013.01.004

Formal Laurent series in several variables

Ainhoa Aparicio Monforte a, Manuel Kauers b,
PMCID: PMC4587312  PMID: 26478652

Abstract

We explain the construction of fields of formal infinite series in several variables, generalizing the classical notion of formal Laurent series in one variable. Our discussion addresses the field operations for these series (addition, multiplication, and division), the composition, and includes an implicit function theorem.

MSC 2010: primary, 13F25; secondary, 05A15

Keywords: Formal power series

1. Introduction

The purpose of this article is twofold. In the first part (Sections  2, 3, 4), we explain how to construct fields of formal Laurent series in several variables. This part has an expository flavour. The construction we present is not new; similar constructions can already be found in the literature. However, the justification of their validity is usually kept brief or more abstract than necessary. We have found it instructive to formulate the arguments in a somewhat more concrete and expanded way, and we include these proofs here in the hope that this may help to demystify and popularize the use of formal Laurent series in several variables. The results in the second part (Sections 56) seem to be new. We discuss there the circumstances under which we can reasonably define the composition of multivariate formal Laurent series, and we present a version of the implicit function theorem applicable to multivariate formal Laurent series.

Recall the situation of a single variable. The set Kx of formal power series f(x)=n=0anxn with coefficients in some field K forms an integral domain together with the usual addition and multiplication. Such a series f(x) admits a multiplicative inverse g(x)Kx if and only if a00 (see, e.g.,  [14], [10]). If f(x) is any nonzero element of Kx, then not all its coefficients are zero, and if e is the smallest index such that ae0, then we have f(x)=xeh(x) for some h(x)Kx which admits a multiplicative inverse. The object xeh(x)1 qualifies as a multiplicative inverse of f(x). In the case of a single variable, we may therefore define K((x)) as the set of all objects xeh(x) where e is some integer and h(x) is some element of Kx. Then K((x)) together with the natural addition and multiplication forms a field. This is the field of formal Laurent series in the case of one variable.

The case of several variables is more subtle. The set Kx,y of formal power series f(x,y)=n,k=0an,kxnyk in two variables x,y with coefficients in K also forms an integral domain, and it remains true that an element f(x,y)Kx,y admits a multiplicative inverse if and only if a0,00. But in general, it is no longer possible to write an arbitrary power series f(x,y) in the form f(x,y)=xe1ye2h(x,y) where h(x,y)Kx,y admits a multiplicative inverse in Kx,y. As an example, consider the series f(x,y)=x+y=x1y0+x0y1Kx,y. If we want to write f(x,y)=xe1ye2h(x,y) for some h(x,y)Kx,y, we have h(x,y)=x1e1ye2+xe1y1e2. In order for h(x,y) to have a nonzero constant term, we can only choose (e1,e2)=(1,0) or (e1,e2)=(0,1), but for these two choices, h(x,y) is 1+x1y or xy1+1, respectively, and none of them belongs to Kx,y.

There are at least three possibilities to resolve this situation. The first and most direct way is to consider fields of iterated Laurent series  [17, Chapter 2], for instance the field K((x))((y)) of univariate Laurent series in y whose coefficients are univariate Laurent series in x. Clearly this field contains Kx,y, and the multiplicative inverse of x+y in K((x))((y)) is easily found via the geometric series to be

1x+y=1/x1(y/x)=n=0(1)nxn1yn.

Of course, viewing x+y as an element of K((y))((x)) leads to a different expansion.

The second possibility is more abstract. This construction goes back to Malcev  [11] and Neumann  [13] (see  [15], [17] for a more recent discussion). Start with an abelian group G (e.g., the set of all power products x1i1xpip with exponents i1,,ipZ and the usual multiplication) and impose on the elements of G some order which respects multiplication (see Section  3 below for definitions and basic facts). Define K((G)) as the set of all formal sums

a=gGagg

with agK for all gG and the condition that their supports supp(a){gGag0} contain no infinite strictly -decreasing sequence. If addition and multiplication of such series are defined in the natural way, it can be shown that K((G)) is a field (cf. Theorem 5.7 in  [13] or Corollary 3.1–11 in  [17]).

The third possibility is more geometric and goes back to MacDonald  [12]. He considers formal infinite sums of terms of the form ai1,,ipx1i1xpip where the exponent vectors (i1,,ip) are constrained to some fixed cone CRp. It turns out that for every cone C not containing a line, these series form a ring (Theorem 10 below; see Section  2 below for definitions and basic facts concerning cones). MacDonald shows using a multivariate generalization of the Newton–Puiseux method that for every polynomial f(x1,,xp,y)K[x1,,xp,y] one can find a cone such that the corresponding ring contains a series g(x1,,xp) (possibly with fractional exponents) such that f(x1,,xp,g(x1,,xp))=0. The rings of MacDonald are not fields, but Aroca, Cano and Jung  [2], [3] observe that a field can be obtained by taking the union of all the rings for some suitable collection of shifted cones (similar to Theorem 15 below). Again allowing fractional exponents, Aroca et al. show that the fields constructed in this way are even algebraically closed. Their elements can thus be considered as the natural multivariate generalizations of Puiseux series.

The construction we give below is, in a sense, a mixture of the approach by Malcev and Neumann on the one hand, and MacDonald–Aroca–Cano–Jung on the other hand. Our goal was to keep the geometric intuition inherent to the latter while at the same time avoiding any technical considerations related to Newton polygons. Our construction is more specific than Malcev–Neumann’s in that we do not consider arbitrary groups as carriers of the series, and it is more specific than MacDonald–Aroca–Cano–Jung’s in that we do not consider rational exponents. Our series are thus formal infinite sums of terms of the form ai1,,ipx1i1xpip where (i1,,ip) ranges over (some suitable subset of) Zp. A need to reason about such series arises for instance in lattice path counting (see, e.g.,  [6] and references given there), in Ehrhart’s theory of counting integer points in polytopes (see, e.g.,  [5] and references given there), or in MacMahon’s theory of integer partitions (see, e.g.,  [1] and references given there). We want to promote them as natural multivariate generalization of the notion of formal Laurent series.

2. Cones

In general, a cone CRp is a set with the property that whenever uC and c0, then cuC. The cones we consider here have the following special properties.

Definition 1

A cone CRp is called

  • 1.
    finitely generated if there exist v1,,vnRp such that
    C={z1v1+z2v2++znvnz1,z2,,zn0}.
    In this case {v1,,vn} is called a generating set for C.
  • 2.
    rational if it is finitely generated and has a generating set
    {v1,,vn}Zp.
  • 3.

    line-free if for every vC{0} we have vC.

Since we will be only considering rational finitely generated cones in this article, we drop these attributes from now on and only say “cone”. With this convention, cones are obviously closed, they obviously all contain 0, and they are obviously unbounded or equal to {0}. It is also easy to see that all cones are convex (i.e., for all u,vC and for all c[0,1] we have cu+(1c)vC as well), and that u,vC implies u+vC. Finally, when C,D are cones, then so is C+D={u+vuC,vD}. The following facts are less obvious, but also well-known.

Proposition 2

LetKRpbe a closed and convex set.

  • 1.

    Kis unbounded if and only if there existu,vRpwithv0such thatu+cvKfor allc0(i.e.,Kcontains a ray).

  • 2.

    LetwRpandR={cwc0}. Then for allu,vKwe haveu+RKv+RK.

Proof

See statements 1 and 2 in Section 2.5 of Grünbaum [9]. □

In order to give a meaning to an operation (e.g., multiplication) for formal infinite series, we will ensure that every coefficient of the result (e.g., the product) depends only on finitely many coefficients of the operands (e.g., the factors). For some of the operations defined below, it turns out that this property can be shown using the following two lemmas.

Lemma 3

LetCRpbe a cone andARpbe a closed and convex set withCA={0}. Then for everyaRp, the setC(a+A)is bounded.

Proof

Fix aRp and set K=C(a+A)Rp. Assume that K is unbounded. Since C and A are closed and convex, K is also closed and convex, and Proposition 2(1) implies the existence of u,vRp with v0 and u+cvK for all c0. We show that vCA={0} in order to arrive at a contradiction.

Indeed, with c=0 it first follows that uKC. Since also 0C, it follows from Proposition 2(2) that cvC for all c0. In particular vC.

Similarly, Proposition 2(2) applied to the convex set a+A and the points uKa+A and aa+A imply a+cva+A for all c0. Therefore a+va+A, and finally vA. □

Lemma 4

Let CRp be a line-free cone and SCZp . Then there exists a finite subset {s1,,sn} of S such that Si=1n(si+C) .

Proof

If C is the cone generated by the unit vectors (i.e., CZp=Np), then this is the classical Dickson Lemma  [8], [4].

The general case is reduced to this situation as follows. Let {v1,,vk}Zp be a set of generators of C. Then each sS can be written as s=s1v1++skvk for some nonnegative s1,,skQ. Setting nisi(i=1,,k), we have s=n1v1++nkvk+c for some cZp which is a linear combination of the vi with coefficients in [0,1].

Since {z1v1++zkvk:z1,,zk[0,1]} is a bounded set, its intersection with Zp is finite, say {c1,,c}. For a fixed vector c, let NcNk be the set of all vectors (n1,,nk)Nk such that n1v1++nkvk+cS. Then by the original Dickson Lemma, for each of these sets Nc there is a finite subset BcNc such that for all (n1,,nk)Nc there exists (b1,,bk)Bc with (n1,,nk)(b1,,bk)+Nk, viz.  nibi0 for i=1,,k. Then, since C is a cone, we also have n1v1++nkvk+cb1v1++bkvk+c+C.

It finally follows that the finite set i=1{b1v1++bkvk+ci:(b1,,bk)Bci} has the desired property. □

3. Orders

Definition 5

A total order on Zp is called additive if for all i,j,kZp we have

iji+kj+k.

An additive order is called compatible with a cone CRp if 0k for all kCZp.

We take the freedom to write ij instead of ji, and Inline graphic instead of ijij, and similar shorthand notations.

The additivity of an order implies that when v,wZp are such that v,w0, then also av+bw0 for every nonnegative a,b. Note that this is not only true for integers a,b but also for any rational numbers a,b for which av+bwZp. The reason is that for an additive order we have v0dv0 for every positive integer d, which allows us to clear denominators.

Example 6

  • 1.
    Let n=(n1,,np)Rp be some vector whose components are linearly independent over Q. For i,jZn, define
    injinjn,
    where refers to the standard scalar product on Rp. Then n is an additive order.
    Geometrically, inj can be interpreted as follows. The affine hyperplane H=i+n divides Rp into two open half spaces, one towards the direction to which n points and one towards the opposite direction. We have inj if and only if j belongs to the half space in the direction of n.

    graphic file with name fx2.jpg

    The requirement that the coordinates of n be linearly independent over Q ensures that n is indeed a total order, for if i,jZp are such that inj and jni, then ni=nj, so n(ij)=0, and hence, since the coordinates of i and j are integers, i=j.

    If C is a line-free cone and nC, then n is compatible with C. Moreover, it follows from Lemma 3 that for every iZp there exist only finitely many jCZp such that jni. This need not be the case for every additive order, as shown in the following example.

  • 2.

    For i,jZp, the lexicographic order is defined by letting ilexj if and only if i=j or the leftmost nonzero coordinate of the vector ij is negative. This is an additive order.

    If C is a cone which contains no vector (i1,,ip) where any of the coordinates i1,,ip1 is negative, then lex is compatible with C. With this order, it may happen that for a fixed iZn there are infinitely many jCZp with jlexi. For instance, if CR2 contains (1,0) and (0,1), then (u,0)lex(0,1) for every uN.

    However, it is still true that lex is a well-founded order on CZn. We show in Lemma 8 that this is true for every additive order.

The following two lemmas contain the key properties regarding cones and additive orders which we will use below. The first of them is straightforward, and the second is a reformulation of Lemma 4.

Lemma 7

LetC,DRpbe cones and letbe an additive order onZp. Let{v1,,vk}be a set of generators forC.

  • 1.

    IfCis compatible with, thenCis line-free.

  • 2.

    Cis compatible withif and only ifvi0for alli.

  • 3.

    IfC,Dare compatible with, thenC+Dis also compatible with.

Proof

  • 1.

    If C={0}, then C is trivially line-free. If C{0}, take some vC{0}. Then v0, because C is compatible with . Then v0, because is an additive order. Since v0, also v0, and hence vC.

  • 2.

    The direction “” is clear because viC for all i. The direction “” follows from the observation made after Definition 5 that v,w0 and a,b0 implies av+bw0.

  • 3.

    Let {w1,,wm} be a generating set for D. Since C+D is generated by v1,,vk,w1,,wm, the claim follows from part 2. □

Lemma 8

Let be an additive order and C be a cone. If is compatible with C , then is a well-founded order on CZp , i.e., every strictly decreasing sequence Inline graphic of elements in CZp terminates, or equivalently, every subset of CZp contains a -minimal element.

Proof

Let SCZp. By Lemma 4, there exists a finite subset {s1,,sn} of S such that Si=1n(si+C). From the assumption that C is compatible with it follows that when v,wZp are such that vw+C, then wv. Therefore, the -minimum of the finite set {s1,,sn} is also the minimum of S whose existence was to be shown. □

4. Construction

Let K be a field and x1,,xp be indeterminates. We consider formal infinite series of the form

f(x)f(x1,,xp)=kakxk

where the sum runs over all k=(k1,,kp)Zp, the ak are elements of K, and xk is a short-hand notation for x1k1x2k2xpkp.

These objects form a vector space over K together with the natural addition and scalar multiplication, for if

f(x)=kakxkandg(x)=kbkxk

then the coefficient of xk in uf(x)+vg(x) is simply uak+vbk, which is clearly an element of K for any fixed u,vK.

Multiplication is more delicate. In the natural definition

f(x)g(x)k(iaibki)xk,

the inner sum ranges over infinitely many elements aibki of K, which is not meaningful in general. To make this summation finite, we restrict the attention to series f(x)=kakxk whose support suppf(x){kZpak0} is contained in a fixed line-free cone.

Definition 9

Let CRp be a line-free cone. Then we define the set

KCx{f(x)suppf(x)C}.

Using Lemma 3, it can be shown that every coefficient in the product of two elements of KCx is determined by a sum with only finitely many nonzero terms. The support of the product is again contained in C, as is the support of the sum of two elements of KCx. Therefore, we have the following theorem.

Theorem 10

KCxtogether with the natural addition and multiplication forms a ring.

Proof

To see that multiplication is well defined, we need to show that for every kZp there exist only finitely many iZp such that iC and kiC. Since C is line-free, we have CC={0}. We can therefore apply Lemma 3 to C,A=C and a=k, and obtain that C(kC) is bounded. A bounded subset of Rp can only contain finitely many points with integer coordinates, so C(kC)Zp is finite, and this is what was to be shown.

To see that KCx is closed under multiplication, consider some kZp. In order for the coefficient of some term xk in the product of two elements of KCx to be nonzero, there must be at least one iC such that kiC as well. Since C is a cone and cones are closed under addition, kC.

Closure under addition is obvious, and it is also obvious that the neutral elements 0 and 1=x0 belong to KCx. □

When C is the cone consisting of all vectors with nonnegative components, KCx is the usual ring Kx of formal power series in x1,,xp. This ring is an integral domain, and a series f(x)Kx admits a multiplicative inverse if and only if its constant term is different from zero. Both properties generalize to rings KCx for arbitrary line-free cones C. The proof ideas are the same as for the usual formal power series ring Kx.

Theorem 11

IfCRpis a line-free cone, thenKCxis an integral domain.

Proof

Let f(x)=kakxk and g(x)=kbkxk be two nonzero elements of KCx. This means both suppf(x) and suppg(x) are nonempty. Let h(x)=f(x)g(x). We show that h(x) is not zero, i.e., that supph(x) is not empty.

Fix some additive order on Zp which is compatible with C and let mmin(suppf(x)+suppg(x)). If usuppf(x) and vsuppg(x) are such that u+v=m, then we necessarily have u=minsuppf(x) and v=minsuppg(x), because Inline graphic or Inline graphic would imply

Proof

Therefore, the coefficient of xm in h(x) is

iaibmi=isuppf(x)jsuppg(x)i+j=maibj=aubv0,

because au0 and bv0. □

Theorem 12

Let CRp be a line-free cone and f(x)=kakxkKCx . Then there exists g(x)KCx with f(x)g(x)=1 if and only if a00 .

Proof

Assume that a0=0. We show that no multiplicative inverse of f(x) exists. Indeed, if g(x)=kbkxk is any element of KCx then the coefficient of x0 in the product f(x)g(x) is a0b0=0, while for a multiplicative inverse we would need a0b0=1.

Assume now a00. We show that a multiplicative inverse of f(x) exists. Fix an additive order compatible with C. Let g(x)=kbkxk be a series with undetermined coefficients bk. Set b0=1/a0, which is possible because a00. Then the constant term of f(x)g(x) is 1, regardless of the values of the other bk. We now show by noetherian induction on k that there is a unique way to choose the coefficients bi for Inline graphic such that the coefficient of xi in f(x)g(x)1 is equal to 0 for all ik.

Assume as induction hypothesis that this is true for all i with Inline graphic. Then for the coefficient of xk in f(x)g(x) we have

Proof

Since a00 and all the bki on the right hand side are uniquely determined by induction hypothesis, we can (and have to) take Inline graphic. With this (and only this) choice, the coefficient of xk in f(x)g(x) will be zero, as desired. □

In the univariate case, if the constant term of some nonzero series f(x)Kx is zero, we can write f(x)=xeh(x) for some eN and h(x)Kx with h(0)0. Then h(x) has a multiplicative inverse and we find xeh(x)1 as the multiplicative inverse of f(x) if we allow terms with negative exponents. Defining formal Laurent series via K((x))eZxeKx therefore already leads to a field.

In the multivariate case, it is not always possible to write a given f(x)KCx in the form f(x)=xeh(x) for some h(x)KCx, as already illustrated in the introduction. But in cases where this is not possible, we can still write f(x) in the desired form if we allow h(x) to belong to an enlarged ring KCx for a suitably chosen line-free cone C containing the original cone C. Then h(x) has a multiplicative inverse in this ring by Theorem 12, and we can regard xeh(x)1 as the multiplicative inverse of f(x).

Example 13

Consider the power series f(x,y)=x+yKx,y from the introduction. (C is the cone generated by the two unit vectors (1,0) and (0,1) here.) This series can also be viewed as an element of KCx,y, where C is the cone generated by (1,0) and (1,1). Then we have f(x,y)=x1y0h(x,y) with h(x,y)=1+x1y1KCx,y. In this ring, h(x,y) has a multiplicative inverse, and therefore we can regard x1y0h(x,y)1 as the multiplicative inverse of f(x,y).

If a collection of rings is such that for any two rings R1,R2 from the collection, the collection contains some other ring R3 with R1R3 and R2R3, and if for any two rings R1,R2 from the collection, the respective addition and multiplication of these rings coincide on R1R2, then the union over all the rings from the collection forms again a ring in a natural way.

We can therefore make the following definition.

Definition 14

Let be an additive order on Zp. Then we define the sets

KxCCKCxandK((x))eZpxeKx,

where C is the set of all cones CRp which are compatible with .

Theorem 15

If is an additive order on Zp , then Kx is a ring and K((x)) is a field.

Proof

To see that Kx is a ring, consider two rings KC1x,KC2x from the collection, i.e., consider two cones C1,C2 that are compatible with . By Lemma 7, the cone C3C1+C2 is also compatible with , so KC3x also appears in the union. Furthermore, it is clear that addition and multiplication of the rings KC1x and KC2x agree on KC1xKC2x. This shows that Kx is well-defined as a ring.

To see that K((x)) is a field, consider two nonzero elements f(x),g(x)K((x)). We show that their sum, their product, and the multiplicative inverse of f(x) also belong to K((x)). Let A,BRp be cones compatible with and let a,bZp be such that f(x)=xaa(x) and g(x)=xbb(x) for some a(x)KAx and b(x)KBx. Then f(x)g(x) belongs to xa+bKA+Bx and f(x)+g(x) belongs to xmin(a,b)KCx where C is the cone generated by a generating set of A, a generating set of B, and the single vector max(a,b)min(a,b)0. Note that A+B and C are compatible with by Lemma 7. It follows that K((x)) is closed under addition and multiplication.

As for the multiplicative inverse, let f(x)0 and eminsuppf(x). This minimum exists by Lemma 8 and because suppf(x) is nonempty for nonzero f(x). Let {s1,,sn}suppf(x)A be a finite set such that suppf(x)i=1n(si+A). Such a finite set exists by Lemma 4. Let C be the cone generated by a generating set of A, and s1e,,sne. By the choice of e, we have sie0 for all i, so by Lemma 7, the cone C is compatible with . Now we can write f(x)=xeh(x) for some h(x)KCx with nonzero constant term. By Theorem 12 there exists a multiplicative inverse h(x)1KCxKx. Hence f(x)1=xeh(x)1 belongs to K((x)), as claimed. □

Example 16

Consider the univariate polynomial f(x)=1+x. The only two additive orders on Z are the natural order and its reverse.

With respect to the natural order , the smallest exponent of f(x) is 0, so f(x) has a multiplicative inverse in Kx=Kx. Its coefficients can be determined following the proof of Theorem 12, the result being

f(x)1=1x+x2x3+x4x5+.

Let now 1 denote the reversed order, i.e., i1jji. Then the smallest exponent of f(x) with respect to this order is 1. Write f(x)=x(1+x1). The smallest exponent of 1+x1 with respect to 1 is 0, so this series does have a multiplicative inverse in K1x=Kx1. Its first terms are 1x1+x2x3+. Consequently, the multiplicative inverse of f(x) in K1((x)) reads

f(x)1=x1x2+x3x4+.

More generally, the various possible series expansions of a multivariate rational function r(x)=u(x)/v(x)K(x) can be obtained as follows. An exponent vector esuppv(x)Zp qualifies as a minimal element if there exists an affine hyperplane HRp which contains e and which is furthermore such that all other elements of suppv(x) belong to the same of the two open half spaces defined by H. Geometrically, these points e are the corner points in the convex hull of suppv(x). For each such corner point e, the cone C generated by the elements of suppv(x)e is line-free, and there exists a series expansion of r(x) in xeKCx.

The coefficients in these series expansions all satisfy a multivariate linear recurrence equation with constant coefficients, which can be read off from the denominator polynomial. In the univariate case, also the converse is true: every sequence satisfying a linear recurrence with constant coefficients is the coefficient sequence of a series expansion of a rational function. The latter implication is no longer valid in the case of several variables. As worked out by Bousquet-Mélou and Petkovšek  [7], a multivariate power series whose coefficient sequence satisfies a linear recurrence equation with constant coefficients need not be rational, not even algebraic, not even differentially algebraic.

5. Composition

Our next goal is to understand the composition of multivariate Laurent series. In order to formulate the results, it is convenient to adopt the following notation. If f(x)=kakxk is any series, then for any fixed kZp we write [xk]f(x)ak for the coefficient of xk in f(x). Furthermore, if is an additive order and f(x) a nonzero series, we call lexpf(x)minsuppf(x)Zp the leading exponent of f(x), and ltf(x)xlexpf(x) the leading term. We may omit the subscript when the order is clear from the context.

For two univariate formal power series f(x)=k=0akxk and g(x), it is natural to define the composition f(g(x)) as the power series k=0akg(x)k. The latter expression is meaningful provided that g(0)=0 because in this case, g(x)=xh(x) for some power series h(x), and g(x)k=xkh(x)k has zero coefficients for all terms of degree less than k. Therefore, for every nN the coefficient of xn in k=0akg(x)k is in fact the coefficient of xn in the finite sum k=0nakg(x)k. Neumann  [13, Theorem 4.7] and Xin [17, Theorem 3-1.7] prove generalized versions of this criterion for compositions f(g) where f(x)Kx is a univariate power series in the usual sense and g is a Malcev–Neumann series.

We are interested here more generally in compositions f(g1(x),,gm(x)) where f(y) and the gi(x) are formal Laurent series in several variables as defined above. In order to formally define them, fix an additive order on Zp, let U be any set and consider a function c:UK((x)) with the following two properties.

  • 1.

    For all kZp, the set {uUksuppc(u)} is finite.

  • 2.

    There exists a line-free cone CRp such that suppc(u)C for all uU.

We can then define h(x)uUc(u) as the unique element of KCx whose coefficient of xk is equal to uU[xk]c(u) for all kZp. The first requirement ensures that this sum is finite for every kZp, and the second one ensures that the support of h(x) is contained in C.

Composition of Laurent series can be viewed as a special case of the construction just described: let be an additive order on Zq and an additive order on Zp, let f(y)=kakykK((y)) and g1(x),,gq(x)K((x)), let Usuppf(y)Zq and define

c:UK((x)),c(k)akg1(x)k1gq(x)kq.

Then the composition f(g1(x),,gq(x))K((x)) is defined as the sum uUc(u), provided that this sum exists in the sense defined before.

The main result of this section is the following sufficient condition for the existence of the composition.

Theorem 17

LetCRqbe a line-free cone andf(y)KCy. Letbe an additive order onZpandg1(x),,gq(x)K((x)){0}. LetMZp×qbe the matrix whosei-th column consists of the leading exponentlexp(gi(x))(i=1,,q). LetCRpbe a cone containingMC{MxxC}Rpas well assupp(gi(x)/lt(gi(x)))fori=1,,q. Suppose thatCkerM={0}and thatCis line-free. Thenf(g1(x),,gq(x))is well-defined and belongs to the ringKCx.

Proof

We show (1) that for every fixed kZp there are only finitely many (u1,,uq)Zq with ksupp(g1(x)u1gq(x)uq), and (2) that supp(g1(x)u1gq(x)uq)C for all (u1,,uq)suppf(y).

For the second requirement, first observe that the gi(x)/lt(gi(x)) are elements of KCx with nonzero constant term. Therefore, by Theorem 10, Theorem 12, also (g1(x)lt(g1(x)))u1(gq(x)lt(gq(x)))uq is an element of KCx for every choice (u1,,uq)Zp. Second, because of MCC, the exponent vector of the term lt(g1(x))u1lt(gq(x))uq belongs to C for every (u1,,uq)suppf(y)C, so the term itself belongs to KCx. Using once more that KCx is a ring, it follows that supp(g1(x)u1gq(x)uq)C for every (u1,,uq)suppf(y).

For the first requirement, let kZp. Then by Lemma 3 with k,C and C playing the roles of a,C and A, there are only finitely many nZp such that kn+C. For some fixed nZp, consider the set {uCZqMu=n}. If this set is empty, it is trivially finite. If not, fix an element w from the set. Then every other element u of the set can be written as u=w+v for some vkerM: if u,u are two elements of the set, then Mu=Mu=n, so uukerM. Therefore we can write {uCMu=n}=C(w+kerM). By Lemma 3 with w,C and kerM playing the roles of a,C and A, it follows that the set contains only finitely many integer points.

Altogether, we have shown that for all kZp there are only finitely many uCZq such that kMu+C. The claim follows because

supp(g1(x)u1gq(x)uq)Mu+C

by definition of M and C. □

Example 18

  • 1.

    The classical condition for the composition of two power series in a single variable is contained as a special case in Theorem 17. In this case, C and C are the positive halfline and M is a 1×1-matrix whose single entry is positive if and only if g(0)=0 if and only if CkerM={0}.

  • 2.
    Consider a power series f(x)Kx1 with negative exponents (i.e., C is the negative halfline) and let g(x)K((x)){0} be a usual formal Laurent series. Then M is a 1×1-matrix whose single entry is the smallest nonzero exponent appearing in g(x). If this is negative, then MC is the positive halfline, and since also lt(g(x))1g(x)Kx, we can take for C the positive halfline. Therefore f(g(x)) is well-defined. For example, for
    f(x)=12x1+3x24x3+5x46x5+Qx1
    and
    g(x)=x2+x1+1+x+x2+x3+Q((x))
    we have
    f(g(x))=12x2+2x3+3x46x5x6+12x7+Q((x)).
  • 3.
    As an example with several variables, let CR2 be the cone generated by (11) and (01), and let f(x,y)KCx,y. Let be the lexicographic order with xy, and let g1(x,y)=x+y,g2(x,y)=1/(x+y)=x1x2y+x3y2+K((x,y)). Then lt(g1(x,y))=x=x1y0 and lt(g2(x,y))=x1=x1y0, so
    M=(1100).
    The kernel of M is the vector space generated by (11), therefore CkerM={0}.
    MC is the cone generated by M(11)=(20) and M(01)=(10), and the supports of
    g1(x,y)/lt(g1(x,y))=1+x1y2,
    g2(x,y)/lt(g2(x,y))=1x1y+x2y2+
    belong to the cone generated by (12) and (11). We can therefore take for C the cone generated by (12) and (10). Note that this is indeed a line-free cone.

    In conclusion, the composition f(g1(x,y),g2(x,y)) is well-defined.

In Theorem 17, no restrictions are imposed on the series f(y)KCy into which the gi(x) are plugged: if the theorem allows the composition of some fixed set of gi(x)K((x)) into some fixed element f(y) of KCy, then it also allows the composition of these gi(x) into any other element of KCy. We can therefore consider a map Φ:KCyK((x)) which to every f(y)KCy assigns the composition f(g1(x),,gq(x))K((x)). We show next that this map preserves the ring structure, a fact that is not too surprising but also not entirely trivial.

Theorem 19

LetCRqbe a line-free cone,an additive order onZp, andg1(x),,gq(x)K((x)){0}. LetMZp×qandCRpbe defined as in   Theorem  17, and assume, also as in   Theorem  17, that CkerM={0} and that C is line-free. Then the map

Φ:KCyKCx,f(y)f(g1(x),,gq(x))

is a ring homomorphism.

Proof

It is clear that Φ(1)=1, and it is also easy to verify that Φ(a(y)+b(y))=Φ(a(y))+Φ(b(y)) for all a(y),b(y)KCy. We show the case of multiplication in some more detail.

Let a(y)=nanyn,b(y)=nbnynKCy. Then Φ(a(y)),Φ(b(y)), and Φ(a(y)b(y)) all belong to KCx. We show that for all nZpC we have [xn]Φ(a(y)b(y))=[xn]Φ(a(y))Φ(b(y)).

As shown in the proof of Theorem 10, for every kZpC, the set

Ik{iZpiC  and  kiC}ZpC

is finite. Furthermore, as shown in the proof of Theorem 17, for every iZp, the set

Ui{(u1,,uq)isupp(g1(x)u1gq(x)uq)}Zq

is finite.

Now fix an arbitrary nZpC and let UiInUiZq. Then U is finite and we have UnU because 0In. By the definition of multiplication and composition, and because [xn]g1(x)k1gq(x)kq=0 whenever (k1,,kq)U, we can write

[xn]Φ(a(y)b(y))=[xn]k(iakibi)gk(x)=kU(iIkakibi)[xn]gk(x),

where gk(x) is a shorthand notation for g1(x)k1gq(x)kq. Furthermore, for i,jZpC with i+j=n, we can write

[xi]Φ(a(y))=[xi]kakgk(x)=kUiak[xi]gk(x)=kUak[xi]gk(x)

and

[xj]Φ(b(y))=[xj]kbkgk(x)=kUjbk[xj]gk(x)=kUbk[xj]gk(x).

Consequently,

[xn]Φ(a(y))Φ(b(y))=jIn([xnj]Φ(a(y)))([xj]Φ(b(y)))=jIn(kUak[xnj]gk(x))(kUbk[xj]gk(x))=jInkUiIkaki[xnj]gki(x)bi[xj]gi(x)=kUiIk(akibijIn[xnj]gki(x)[xj]gi(x))=kUiIkakibi[xn]gk(x)=[xn]Φ(a(y)b(y)),

where in the fifth step we have used gki(x)gi(x)=gk(x) and the definition of multiplication. All the performed operations are legitimate because all the sums involved are finite. □

One of the consequences of Theorem 19 is an alternative way to determine the coefficients of a multiplicative inverse of a series h(x)KCx with [x0]h(x)=1. For the univariate series f(y)=k0ykKy we know (1y)f(y)=1. Applying Theorem 19 to g(x)=1h(x) gives Φ(1y)Φ(f(y))=1, so

h(x)1=k0(1h(x))k.

Therefore, in order to determine the coefficient of some term xe in h(x)1 we can simply choose a term order compatible with C and sum up all the powers (1h(x))k for which klexp(h(x))e. The coefficient of xe in h(x)1 is then equal to the coefficient of xe in this sum.

6. Equations

Finally, we consider the question under which circumstances an equation f(x,y)=0 can be solved for y in some field of multivariate Laurent series. The results below are variants of the implicit function theorem answering this question. For better readability, we have split the derivation into two theorems, the first serving as a lemma used in the proof of the second. The proof of Theorem 20 follows closely one of the many proofs of the classical implicit function theorem  [16]. In Theorem 21 we then relax the hypothesis by making use of the fact that K((x)) is a field.

Theorem 20

LetCRpbe a line-free cone, and let

f(x,y)=k=0ak(x)ykKCxy

be such that[x0]a0(x)=0anda1(x)=1. Then there exists exactly oneg(x)KCxwith[x0]g(x)=0andf(x,g(x))=0.

Proof

First observe that the composition f(x,g(x)) is legitimate for every g(x)KCx whose constant term is zero. For g(x)=0 this is obvious, and for g(x)0 it follows from Theorem 17 as follows. Regard f(x,y) as an element of KC×Hx,y, where HR denotes the positive halfline. Note that C×HRp+1 is a line-free cone. Taking g1(x)=x1,,gp(x)=xp, and gp+1(x)=g(x), we have M=(I,e)Zp×(p+1) where I is the identity matrix of size p and e=lexp(g(x)). Since H is generated (as cone) by 1, kerM is generated (as vector space) by (e,1), and e belongs to C, we have kerM(C×H)={0}, as required. Because of Theorem 15, there exists a cone CRp containing C and supp(g(x)/lt(g(x))), and since eC implies M(C×H)=C, this cone C also contains MC, as required.

Turning to the claim of the theorem, fix some additive order on Zp which is compatible with C. Consider an ansatz g(x)=kbkxkKCx with b0=0 and otherwise undetermined coefficients bk. We show by noetherian induction that there is precisely one way of choosing the coefficients bk such that [xn]f(x,g(x))=0 for all n0.

Let n0 and suppose as induction hypothesis that the claim is true for every kZp with Inline graphic. The coefficient of xn in f(x,g(x)) is

bn+[xn]a0(x)+k=2[xn]ak(x)g(x)k.

The terms [xn]ak(x)g(x)k only depend on coefficients bk with Inline graphic, because lexp(ak(x))0 and k2 and Inline graphic together imply

Proof

for every kn, and the expression on the left hand side denotes the smallest possible exponent vector for which the corresponding coefficient may depend on bk. By assumption, the coefficients bk for Inline graphic are uniquely determined, and hence in order to have [xn]f(x,g(x))=0, there is one and only one choice for bn, as claimed. □

Theorem 21

Let be an additive order on Zp , let CRp be a cone compatible with , and let

f(x,y)=n=0an(x)ynKCxy

be such that Inline graphic , and lexp(a1(x))lexp(an(x)) for all nN with an(x)0 . Then there exists exactly one g(x)Kx with [x0]g(x)=0 and f(x,g(x))=0 .

Proof

Because of Theorem 11, we have f(x,g(x))=0 if and only if u(x)f(x,g(x))=0 for every u(x)K((x)){0}. It is therefore sufficient to prove the theorem for

f~(x,y)k=0a~k(x)yka1(x)1f(x,y)K((x))y

in place of f(x,y). We show that f~(x,y) satisfies the requirements of Theorem 20. To do so, we need to show that [x0]a~0(x)=0,a~1(x)=1, and that there is some line-free cone C~Rp such that suppa~k(x)C~ for all k0.

Since a~k(x)=a1(x)1ak(x) for all kN by definition, it is immediate that a~1(x)=1, and that a~k(x)=0 for every kN with ak(x)=0. Furthermore, Inline graphic implies Inline graphic, which in turn implies [x0]a0(x)=0. For k2 with ak(x)0, we have by assumption that lexp(a~k(x))=lexp(ak(x))lexp(a1(x))0. Lemma 4 applied to S{lexp(ak(x))k2  with  ak(x)0} yields a finite subset {s1,,sn} of S such that Si=1n(si+C). Let C~ be the cone generated by C, some -compatible cone containing supp(a1(x)1lt(a1(x))), and s1lexp(a1(x)),,snlexp(a1(x)). Then C~ is finitely generated, compatible with (hence also line-free; cf.  Lemma 7), and contains suppa~k(x) for all k2. Therefore, by Theorem 20, there exists exactly one g(x)KC~x with f~(x,g(x))=0. Since Theorem 20 still applies if we replace C~ by any larger cone which is compatible with , it follows that there is exactly one g(x)Kx with f~(x,g(x))=0, as was to be shown. □

The main restriction in the above theorems is that we only allow positive powers of y in f(x,y). We may equivalently allow only negative powers of y, but we have not been able to come up with a version of the implicit function theorem that is applicable to series f(x,y)KCx,y where CRp+1 is such that its projection to the last coordinate is the full real line. Note that there is no such restriction, not even implicitly, in Theorem 17: it may well be possible that f(x,g(x)) can be formed even when f(x,y) contains infinitely many positive and negative powers of y. On the other hand, the following examples show that for such f(x,y) there may be more than one solution g(x) with f(x,g(x))=0, or no solution at all. This indicates that a naive generalization of the implicit function theorem to such series will be false.

Example 22

  • Consider the series
    f(x,y)=n=1(y)n+n=1x2nyn.
    This series belongs to KCx,y where CR2 is the cone generated by (01) and (21). Because of [x0y0]f(x,y)=0 and [x0y1]f(x,y)=10, we might expect that some suitable version of the implicit function theorem guarantees the existence of a unique series g(x)Kx with f(x,g(x))=0. However, it turns out that there are two different solutions:
    g1(x)=x2+x1+x2=x+x2+12x318x5+Kx
    andg2(x)=x2x1+x2=x+x212x3+18x5+Kx,
    where 1+x2=n=0(1/2n)(1)nx2n.
  • Now consider the series
    f(x,y)=n=1(y)n+2n=1x2nyn,
    which belongs to the same ring KCx,y as before.

    Suppose that there is a nonzero g(x) with f(x,g(x))=0. If xe is the leading term of g(x), then in the notation of Theorem 17, we have M=(1e)Z1×2, and MC is the cone generated by e and 2e in R. In order for this cone to be line-free, we must either have e0 and 2e0 or e0 and 2e0. The only candidates for eZ are therefore e=0 or e=1 or e=2.

    But e=0 would imply (01)CkerM, so this case is excluded. Likewise, e=2 would imply (21)CkerM, so this case is excluded as well and the only remaining possibility for a solution g(x) is that its leading term is x1 if we want to use Theorem 17 to secure the existence of f(x,g(x)).

    Make an ansatz g(x)=a1x+Kx for the leading coefficient a1 of g(x). Then g(x)n=a1nxn+ and g(x)n=a1nxn+ for all nN. Therefore, equating coefficients of x1 in
    f(x,g(x))=n=1(g(x))n+2n=1x2ng(x)n=!0
    forces a1+2a11=0, viz. a12=2. Depending on the ground field K, this equation may or may not have a solution. For example, if K=Q, no such a1 exists, and hence no g(x)Qx with f(x,g(x))=0 exists.

Acknowledgement

Both authors were supported by the Austrian Science Fund (FWF) under grant Y464-N18.

Contributor Information

Ainhoa Aparicio Monforte, Email: ainhoaaparicio@gmail.com.

Manuel Kauers, Email: mkauers@risc.jku.at.

References

  • 1.Andrews George E., Paule Peter, Riese Axel. MacMahons partition analysis: the Omega package. European Journal of Combinatorics. 2001;22(7):887–904. [Google Scholar]
  • 2.Aroca Fuensanta, Cano Jose M. Formal solutions of linear PDE’s and convex polyhedra. Journal of Symbolic Computation. 2001;32(6):717–737. [Google Scholar]
  • 3.Aroca Fuensanta, Cano Jose M., Jung Francois R. Proceedings of ISSAC’03. ACM Press; 2003. Power series solutions for non-linear PDE’s; pp. 15–22. [Google Scholar]
  • 4.Becker Thomas, Weispfenning Volker, Kredel Heinz. Springer; 1993. Gröbner Bases. [Google Scholar]
  • 5.Beck Matthias, Robins Sinai. Springer; 2007. Computing the Continuous Discretely: Integer-Point Enumeration in Polyhedra. [Google Scholar]
  • 6.Bousquet-Mélou Mireille, Mishna Marni. Walks with small steps in the quarter plane. Contemporary Mathematics. 2010;520:1–40. [Google Scholar]
  • 7.Bousquet-Mélou Mireille, Petkovšek Marko. Linear recurrences with constant coefficients: the multivariate case. Discrete Mathematics. 2000;225:51–75. [Google Scholar]
  • 8.Dickson Leonard E. Finiteness of the odd perfect and primitive abundant numbers with n distinct prime factors. The American Journal of Mathematics. 1913;35:413–422. [Google Scholar]
  • 9.Grünbaum Branko. John Wiley; 1967. Convex Polytopes. [Google Scholar]
  • 10.Kauers Manuel, Paule Peter. Springer; 2011. The Concrete Tetrahedron. [Google Scholar]
  • 11.Malcev Anatoly I. On embedding of group algebras in a division algebra. Doklady Akademii Nauk SSSR. 1948;60:1499–1501. (in Russian) [Google Scholar]
  • 12.McDonald John. Fiber polytopes and fractional power series. Journal of Pure and Applied Algebra. 1995;104:213–233. [Google Scholar]
  • 13.Neumann Bernhard H. On ordered division rings. Transactions of the American Mathematical Society. 1949;66:202–252. [Google Scholar]
  • 14.Niven Ivan. Formal power series. The American Mathematical Monthly. 1969;76:871–889. [Google Scholar]
  • 15.Passmann Donald S. Wiley-Interscience; 1977. The Algebraic Structure of Group Rings. Reprinted by Dover in 2011. [Google Scholar]
  • 16.Sokal Alan D. A ridiculously simple and explicit implicit function theorem. Seminaire Lotharingien de Combinatoire. 2009;61A [Google Scholar]
  • 17.Guoce Xin, The ring of Malcev–Neumann series and the residue theorem, Ph.D. Thesis, Brandeis University, 2004. arXiv:math.CO/0405133.

RESOURCES