Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2012 Oct;33(7):1449–1466. doi: 10.1016/j.ejc.2012.03.011

Context-free pairs of groups I: Context-free pairs and graphs

Tullio Ceccherini-Silberstein a, Wolfgang Woess b
PMCID: PMC4819043  PMID: 27087724

Abstract

Let G be a finitely generated group, A a finite set of generators and K a subgroup of G. We define what it means for (G,K) to be a context-free pair; when K is trivial, this specializes to the standard definition of G to be a context-free group.

We derive some basic properties of such group pairs. Context-freeness is independent of the choice of the generating set. It is preserved under finite index modifications of G and finite index enlargements of K. If G is virtually free and K is finitely generated then (G,K) is context-free. A basic tool is the following: (G,K) is context-free if and only if the Schreier graph of (G,K) with respect to A is a context-free graph.

1. Introduction

Let G be a finitely generated group and K a (not necessarily finitely generated) subgroup of G. We can choose a finite set AG of generators such that every element of G is of the form g=g1gn, where n0 and g1,,gnA. Thus, A generates G as a semigroup. We shall say that (G,K) is context-free, if–loosely spoken–the language of all words over A that represent an element of K is context-free.

The precise definition needs some preparation. Let Σ be a finite alphabet and ψ:ΣG be a (not necessarily injective) mapping such that A=ψ(Σ) satisfies the above finite generation property for G. Then ψ has a unique extension, also denoted ψ, as a monoid homomorphism ψ:ΣG. Recall that Σ consists of all words w=a1an, where n0 and a1,,anΣ (repetitions allowed). The number n is the length |w| of w. If n=0 this means that w=ϵ, the empty word. This is the neutral element of Σ, and Σ is a free monoid with the binary operation of concatenation of words. The extension of ψ is of course given by

ψ(a1an)=ψ(a1)ψ(an),

where the product on the right hand side is taken in G. Given these ingredients, we shall say that ψ:ΣG is a semigroup presentation of G, referring to the fact that A generates G as a semigroup. A language over Σ is a non-empty subset of Σ.

Definition 1.1

The word problem of (G,K) with respect to ψ is the language

L(G,K,ψ)={wΣ:ψ(w)K}.

We say that the triple (G,K,ψ) is context-free, if L(G,K,ψ) is a context-free language.

A context-free grammar is a quadruple C=(V,Σ,P,S), where V is a finite set of variables, disjoint from the finite alphabet Σ (the terminal symbols), the variable S is the start symbol, and PV×(VΣ) is a finite set of production rules. We write Tu or (Tu)P if (T,u)P. For v,w(VΣ), we write vw if v=v1Tv2 and w=v1uv2, where u,v1,v2(VΣ) and Tu. This is a single derivation step, and it is called rightmost, if v2Σ. A derivation is a sequence v=w0,w1,,wk=w(VΣ) such that wi1wi; we then write vw. A rightmost derivation is one where each step is rightmost. The succession of steps of any derivation TwΣ can be reordered so that it becomes a rightmost derivation. For TV, we consider the language LT={wΣ:Tw}. The language generated by C is L(C)=LS.

A context-free language is a language generated by a context-free grammar. As a basic reference for Language and Automata Theory, we refer to the magnificent monograph of Harrison[6].

The above definition of a context-free pair, or rather triple, (G,K,ψ) makes sense when G is a finitely generated monoid and K is a sub-monoid, but here we are interested in groups. When in addition K={1G}, this leads to the notion of G being a context-free group. In two celebrated papers, Muller and Schupp[11], [12] have carried out a detailed study of context-free groups and more generally, context-free graphs. In particular, context-freeness of a group is independent of the particular choice of the generating set A of G. The main result of [11], in combination with a fundamental theorem of Dunwoody [4], is that a finitely generated group is context-free if and only if it is virtually free, that is, it contains a free subgroup of finite index. (In [11], it is assumed that A=A1 and that ψ:ΣA=ψ(Σ) is one-to-one, but the results carry over immediately to the more general setting where those two properties are not required.)

Previously, Anisimov [1] had shown that the groups whose word problem L(G,{1G},ψ) is regular (see Section 2 for the definition) are precisely the finite groups.

The above mentioned context-free graphs are labelled, rooted graphs with finitely many isomorphism classes of cones. The latter are the connected components of the graph that remain after removing a ball around the root with arbitrary radius. See Section 4 for more precise details. As shown in [12], there is a natural correspondence between such graphs and pushdown automata, which are another tool for generating context-free languages; see Section 3.

Among subsequent work, we mention Pélecq [13] and Sénizergues [16], who studied actions on, resp. quotients of context-free graphs. Group-related examples occur also in Ceccherini-Silberstein and Woess [3].

More recently, Holt et al. [7] have introduced and studied co-context-free groups, which are such that the complement of L(G,{1G},ψ) is context-free, see also Lehnert and Schweitzer [9]. This concept has an obvious extension to co-context-free pairs of groups, resp. graphs, on whose examination we do not (yet) embark.

In the present notes, we collect properties and examples of context-free pairs of groups (G,K).

  • The language L(G,K,ψ) is regular if and only if the index [G:K] of K in G is finite (Proposition 2.4).

  • The property that L(G,K,ψ) is context-free does not depend on the specific choice of the semigroup presentation ψ, so that context-freeness is just a property of the pair (G,K), a consequence of Lemma 3.1.

  • If (G,K) is context-free then L(G,K,ψ) is a deterministic context-free language (see Section 3 for the definition) for any semigroup presentation ψ:ΣG (Corollary 4.8.a).

  • If (G,K) is context-free and H is a finitely generated subgroup of G, then the pair (H,KH) is context-free (Lemma 3.1).

  • If [G:H]< then (G,K) is context-free if and only if (H,KH) is context-free (Proposition 3.3 & Lemma 4.9).

  • If (G,K) is context-free and H is a subgroup of G with KH and [H:K]< then (G,H) is context-free (Lemma 4.9).

  • If K is finite then G is context-free if and only if (G,K) is context-free (Lemma 4.11).

  • If (G,K) is context-free then (G,g1Kg) is context-free for every gG (Corollary 4.8.b).

  • If G is virtually free and K is a finitely generated subgroup of G then (G,K) is context-free (Corollary 5.3).

Several of these properties rely on the following.

  • A fully deterministic, symmetric labelled graph (see Section 2 for definitions) is context-free in the sense of Muller and Schupp if and only if the language of all words which are labels of a path that starts and ends at a given root vertex is context-free (Theorem 4.2, Theorem 4.6).

    The (harder) “if” part is not contained in previous work. It implies the following.

  • The pair (G,K) is context-free if and only if for some ( any) symmetric semigroup presentation ψ:ΣG, the Schreier graph of (G,K) with respect to ψ is a context-free graph. (See again Section 2 for precise definitions).

In a second paper [20], a slightly more general approach to context-freeness of graphs via cuts and tree-sets is given. It allows to show that certain structural properties (“irreducibility”) are preserved under finite-index-modifications of the underlying pair of groups. This is then applied to random walks, leading in particular to results on the asymptotic behaviour of transition probabilities.

In concluding the Introduction, we remark that with the exception of some “elementary” cases, context-free pairs of groups are always pairs with more than one end. Ends of pairs of groups were studied, e.g., by Scott [15], Swarup [18] and Sageev [14]. This leads directly to asking about the interplay between context-freeness of pairs and decomposition as amalgamated products or HNN-extensions. An example at the end of Section 5 shows that there is no immediate answer.

2. Schreier graphs and the regular case

Let Σ be a finite alphabet. A directed graph labelled by Σ is a triple (X,E,), where X is the (finite or countable) set of vertices, EX×Σ×X is the set of oriented, labelled edges and :E(x,a,y)aΣ is the labelling map.

For an edge e=(x,a,y)E, its initial vertex is e=x and its terminal vertex is e+=y, and we say that e is outgoing from x and ingoing into y. If y=x then e is a loop, which is considered both as an outgoing and as an ingoing edge. We allow multiple edges, i.e., edges of the form e1=(x,a1,y) and e2=(x,a2,y) with a1a2, but here we exclude multiple edges where also the labels coincide. The graph is always assumed to be locally finite, that is, every vertex is an initial or terminal vertex of only finitely many edges. We also choose a fixed vertex oX, the root or origin. We shall often just speak of the graph X, keeping in mind the presence of E and .

We call X fully labelled if at every vertex, each aΣ occurs as the label of at least one outgoing edge. We say that X is deterministic if at every vertex all outgoing edges have distinct labels, and fully deterministic if it is fully labelled and deterministic. Finally, we say that X is symmetric or undirected if there is a fixed point free involution aa1 of Σ (i.e., (a1)1=a, excluding the possibility that a1=a) such that for each edge e=(x,a,y)E, also the reversed edge e1=(y,a1,x) belongs to E.

A path in X is a sequence π=e1e2en of edges such that ei+=ei+1 for i=1,,n1. The vertices π=e1 and π+=en+ are the initial and the terminal vertex of π. The number |π|=n is the length of the path. The label of π is (π)=(e1)(e2)(en)Σ. We also admit the empty path starting and ending at a vertex x, whose label is ϵ. Denote by Πx,y=Πx,y(X) the set of all paths π in X with initial vertex π=x and terminal vertex π+=y. The following needs no proof.

Lemma/Definition 2.1

Let (X,E,) be a labelled graph, xX and wΣ. We define Πx(w)={π:π=x,(π)=w}, the set of all paths that start at x and have label w. The set of all terminal vertices of those paths is denoted xw={π+:πΠx(w)}.

Analogously, we define Π¯x(w)={π:π+=x,(π)=w}, the set of all paths that terminate at x and have label w, and write xw={π:πΠ¯x(w)}.

If X is fully labelled, then Πx(w) is always non-empty.

If X is deterministic, then Πx(w) has at most one element, and if that element exists, it is denoted πx(w), while xw just denotes its endpoint.

If X is fully deterministic, then xw is a unique vertex of X for every xX, wΣ.

Finally, if X is symmetric (not necessarily deterministic), then Π¯x(w)=Πx(w1), where for w=a1an, one defines w1=an1a11.

With a labelled, directed graph as above, we can associate various languages. We can, e.g., consider the language

Lx,y=Lx,y(X)={(π):πΠx,y(X)},where x,yX. (1)

Definition 2.2

Let G be a finitely generated group, K a subgroup and ψ:ΣG a semigroup presentation of G. The Schreier graph X=X(G,K,ψ) has vertex set

X=KG={Kg:gG}

(the set of all right K-cosets in G), and the set of labelled, directed edges

E={e=(x,a,y):x=Kg,y=Kgψ(a), where gG,aΣ}.

X is a rooted graph with origin o=K, the right coset corresponding to the neutral element 1G of the group G. The Schreier graph is fully deterministic. It is also strongly connected: for every pair x,yX, there is a path from x to y. (This follows from the fact that ψ(Σ) generates G as a semigroup.) When K={1G} then we write X(G,ψ). This is the Cayley graph of G with respect to ψ, or more loosely speaking, with respect to the set ψ(Σ) of generators.

Note that X can have the loop e=(x,a,x)E with x=Kg. This holds if and only if ψ(a)g1Kg. It can also have the multiple edges e1=(x,a1,y) and e2=(x,a2,y) with x=Kg and a1a2. This occurs if and only if ψ(a2)ψ(a1)1g1Kg. In particular, there might be multiple loops. The following is obvious.

Lemma 2.3

Let K be a subgroup of G and ψ:ΣG be a semigroup presentation of G . Then

L(G,K,ψ)=Lo,o(X)

is the language of all labels of closed paths starting and ending at o=K in the Schreier graph X(G,K,ψ) .

A context-free grammar C=(V,Σ,P,S) and the language L(C) are called linear, if every production rule in P is of the form Tv1Uv2 or Tv, where v,v1,v2Σ and T,UV. If furthermore in this situation one always has v2=ϵ (the empty word), then grammar and language are called right linear or regular.

A finite automaton A consists of a finite directed graph X=(X,E,) with label set Σ and labelling map , together with a root vertex o and a non-empty set FX. The vertices of X are called the states of A, the root o is the initial state, and the elements of F are the final states. The automaton is called (fully) deterministic provided the labelled graph X is (fully) deterministic. The language accepted by A is

L(A)=xFLo,x(X).

If A is deterministic, then for each wL(A) there is a unique path πxFΠo,x(X) such that (π)=w. A state yX is called useful if there is some word wL such that the vertex y lies on a path in xFΠo,x(X) with label w. It is clear that we can remove all useless states and their ingoing and outgoing edges to obtain an automaton which accepts the same language and is reduced: it has only useful states.

It is well known [6, Chapter 2] that a language LΣ is regular if and only if L is accepted by some deterministic finite automaton.

The following, which corresponds to Theorem 1 in [5], generalizes Anisimov’s [1] characterization of groups with regular word problem, and also simplifies its proof, as well as the simpler one of [11, Lemma 1].

Proposition 2.4

Let G be a finitely generated group, K a subgroup and ψ:ΣG a semigroup presentation of G . Then (G,K) has regular word problem with respect to ψ if and only if K has finite index in G .

Proof

Suppose first that the index of K in G is finite. Consider the finite automaton A=(X,o,{o}) where X is the Schreier graph X(G,K,ψ), and the initial and unique final state is o=K (as a vertex of X). Then L(G,K,ψ)=L(A): indeed, wΣ belongs to L(G,K,ψ), i.e. ψ(w)K, if and only if K=Kψ(w). This shows that L(G,K,ψ) is regular.

Conversely, suppose that L=L(G,K,ψ) is regular and accepted by the reduced, deterministic finite automaton A=(X,o,F). For yX there is some word wL such that the vertex y lies on the unique path from o to F with label w. We choose one such w and let wy be the label of the final piece of the path, starting at y and ending at F. We set gy=ψ(wy)1G.

Let gG. There are w,w¯Σ with ψ(w)=g and ψ(w¯)=g1. Thus, ww¯L=L(G,K,ψ), and there is a (unique) path π with label ww¯ from o to some final state. Now consider the initial piece πw of π, that is, the path starting at o whose label is our w that we started with. [Thus, we have proved that such a path πw must exist in X !] Let y be the final state (vertex) of πw. Then clearly wwyL(A), which means that ggy1=ψ(wwy)K. Since ψ(Σ)=G, it follows that

G=yXKgy,

and K has finitely many cosets in G. □

Corollary 2.5

Let G be finitely generated and K a subgroup. Then the property of the pair (G,K) to have a regular word problem is independent of the semigroup presentation of G .

We shall see that the same also holds in the context-free case. Another corollary that we see from the proof of Proposition 2.4 is the following.

Corollary 2.6

Let G be finitely generated and K a subgroup of finite index. Then for any semigroup presentation ψ:ΣG , any reduced deterministic automaton A=(X,o,F) that accepts L(G,K,ψ) has a surjective homomorphism (as a labelled oriented graph with root o ) onto the Schreier graph X(G,K,ψ) . Also, the labelled graph X is fully deterministic.

Proof

Let A=(X,o,F) be deterministic and reduced, as in part 2 of the proof of Proposition 2.4.

Let yX, and recall the construction of the label wy of a path from y to F, and gy=ψ(wy)1G. If v is another path from y to F, and h=ψ(v)1, then we can take wLo,y (which we know to be non-empty) and find that wwy,wvL(G,K,ψ), so that ψ(w)KgyKh. Thus Kgy=Kψ(w)=Kh, and the map κ:XKG,yKgy is well defined. It has the property that when wLo,y, then Kψ(w)=Kgy. The map κ is clearly surjective, and κ(o)=K by construction.

Now let yX and aΣ. Take wLo,y and consider the word wa. Again by part 2 of the proof of Proposition 2.4, there is a unique path πwa in X starting at o with label wa. If y is its final vertex, then there is the edge e=(y,a,z) in X. In this situation, κ(z)=Kψ(wa)=Kgyψ(a)=κ(y)ψ(a). This means that in the Schreier graph, there is the edge with label a from κ(y) to κ(z). Therefore κ is a homomorphism of labelled graphs.  □

The following simple example shows that, in general, the map κ constructed in the proof of the previous corollary is not injective.

Example 2.7

Let G=Z2={1,t} be the group of order two and K={1} the trivial subgroup. Let Σ={a} and consider the presentation ψ:ΣG such that ψ(a)=t. Then L(G,K,ψ)={a2n:n0}.

In Fig. 1 we have represented, from left to right, the Schreier graph X(G,K,ψ) (which is nothing but the Cayley graph of G w.r. to ψ), and two automata A1 and A2. As usual, o denotes the origin, while the sets of final states are F1={o} and F2={o,f}, respectively. We have L(A1)=L(A2)=L(G,K,ψ).

Fig. 1.

Fig. 1

(From left to right) the Schreier graph X(G,K,ψ) described in Example 2.7 and two automata A1 and A2 such that L(A1)=L(A1)=L(G,K,ψ).

3. Pushdown automata

Besides grammars, we shall need another instrument for generating context-free languages. A pushdown automaton is a 7-tuple A=(Q,Σ,Z,δ,q0,Qf,z0), where Q is a finite set of states, Σ the input alphabet as above, Z a finite set of stack symbols, q0Q the initial state, QfQ the set of final states, and z0Z{ϵ} is the start symbol. Finally, the function δ:Q×(Σ{ϵ})×(Z{ϵ})Pfin(Q×Z) is the transition function. Here, Pfin(Q×Z) stands for the collection of all finite subsets of Q×Z.

The automaton works in the following way. At any time, it is in some state pQ, and the stack contains a word ζZ. The automaton reads a word wΣ from the “input tape” letter by letter from left to right. If the current letter of w is a, the state is p and the top (=rightmost) symbol of the stack word ζ is z, then it performs one of the following transitions.

  • (i)

    A selects some (q,ζ)δ(p,a,z), changes into state q, moves to the next position on the input tape (it may be empty if a was the last letter of w), and replaces the rightmost symbol z of ζ by ζ, or

  • (ii)

    A selects some (q,ζ)δ(p,ϵ,z), changes into state q, remains at the current position on the input tape (so that a has to be treated later), and replaces the rightmost symbol z of ζ by ζ.

If both δ(p,a,z) and δ(p,ϵ,z) are empty then A halts.

The automaton is also allowed to continue to work when the stack is empty, i.e., when ζ=ϵ. Then the automaton acts in the same way, by putting ζ on the stack when it has selected (q,ζ)δ(p,a,ϵ) in case (i), resp. (q,ζ)δ(p,ϵ,ϵ) in case (ii).

We say that A accepts a word wΣ if starting in the state q0 with only z0 on the stack and with w on the input tape, after finitely many transitions the automaton can reach a final state with empty stack and empty input tape. The language accepted by A is denoted L(A).

The pushdown automaton is called deterministic if for any pQ, aΣ and zZ{ϵ}, it has at most one option what to do next, that is,

|δ(p,a,z)|+|δ(p,ϵ,z)|1.

(Here, || denotes cardinality.)

It is well known [6] that a language is context-free if and only if it is accepted by some pushdown automaton. A context-free language is called deterministic if it is accepted by a deterministic pushdown automaton. We also remark here that a deterministic context-free language L is un-ambiguous, which means that it is generated by some context-free grammar in which every word of L has precisely one rightmost derivation.

The following lemma is modelled after the indications of [11, Lemma 2]. For the sake of completeness, we include the full proof.

Lemma 3.1

Suppose that G,K,Σ and ψ:ΣG are as above. Let H be a finitely generated subgroup of G , and let Σ be another alphabet and ψ:ΣH be such that F=ψ(Σ) generates H as a semigroup.

Then, if L(G,K,ψ) is context-free, also L(H,KH,ψ) is context-free, and if in addition L(G,K,ψ) is deterministic, then so is L(H,KH,ψ) .

Proof

We start with a pushdown automaton A=(Q,Σ,Z,δ,q0,Qf,z0) that accepts L(G,K,ψ).

For each bΣ, there is u(b)Σ such that ψ(b)=ψ(u(b)), and we may choose u(b) to have length 1. Thus,

w=b1bnL(H,KH,ψ)u(b1)u(bn)L(G,K,ψ).

With this in mind, we modify A in order to obtain a pushdown automaton A that accepts L(H,KH,ψ). Our A has to translate any w=b1bn(Σ) into w=u(b1)u(bn)Σ and to use A in order to check whether wL(G,K,ψ).

Let m+1=max{|u(b)|:bΣ}. If m=0 then the only modification of A needed is to replace Σ by its subset Σ and to use the resulting restriction of the transition function.

Otherwise, we set Σm=ΣΣ2Σm. For vΣ+=Σ{ϵ}, we denote by v+ its subword obtained by deleting the first letter. We define Q=Q(Q×Σm) and A=(Q,Σ,Z,δ,q0,Qf,z0) with the transition function δ as follows. For each pQ and zZ,

δ(p,ϵ,z)=δ(p,ϵ,z),
δ(p,b,z)=δ(p,a,z),ifu(b)=aΣ,
δ(p,b,z)={((q,u(b)+),ζ):(q,ζ)δ(p,a,z)},if u(b)aΣ+,
δ((p,v),ϵ,z)={((q,v),ζ):(q,ζ)δ(p,ϵ,z)}{((q,v+),ζ):(q,ζ)δ(p,a,z)},if vaΣ+,δ((p,a),ϵ,z)={((q,a),ζ):(q,ζ)δ(p,ϵ,z)}δ(p,a,z),ifaΣ.

Thus, the new states of the form (p,v) with 1|v|<m serve to remember the terminal parts v of the words u(b), bΣ. This automaton accepts L(G,K,ψ), and it is deterministic, if A has this property. □

Corollary 3.2

Being context-free is a property of the pair (G,K) that does not depend on the specific choice of the alphabet Σ and the map ψ:ΣG for which ψ(Σ) generates G as a semigroup.

Therefore, it is justified to refer to the context-free pair (G,K) rather than to the triple (G,K,ψ). Furthermore, whenever this is useful, we may restrict attention to the case when the graph X(G,K,ψ) is symmetric: we say that ψ is symmetric, if there is a proper involution aa1 of Σ such that ψ(a1)=ψ(a)1 in G. (Again, it is not necessary to assume that ψ is one-to-one, so that we have that a1a even when ψ(a)2=1G.)

Proposition 3.3

Let G be finitely generated, H be a subgroup with [G:H]< . If K is a subgroup of H then (G,K) is context-free if and only if (H,K) is context-free.

Proof

The “only if” is contained in Lemma 3.1. (Observe that H inherits finite generation from G, since [G:H]<.)

For the converse, we assume that (H,K) is context-free and let ψ:ΣH and ψ:ΣG be semigroup presentations of H and G, respectively. There is a pushdown automaton A=(Q,Σ,Z,δ,q0,Qf,z0) that accepts L(H,K,ψ).

Let F be a set of representatives of the right cosets of H in G, with 1GF. Thus, |F|<, and

G=gFHg,

For every gF and bΣ there is a unique g¯=g¯(g,b)F such that gψ(b)Hg¯. Therefore there is a word u=u(g,b)Σ such that

gψ(b)=ψ(u(g,b))g¯(g,b).

An input word w=b1bn is transformed recursively into u1un, along with the sequence g0,g1,,gn of elements of F that indicate the current H-coset at each step:

g0=1G;uk=u(gk1,bk)andgk=g¯(gk1,bk).

Then ψ(w)K if and only if gn=1G and ψ(u1un)K.

Thus, our new automaton A recalls at each step the current coset Hgk1, which is multiplied on the right by ψ(bk), where bk is the next input letter. Then the new coset is Hg¯(gk1,bk), and A simulates what A does next upon reading u(gk1,bk). Then w is accepted when at the end the coset is H=H1G and A is in a final state.

The simple task to write down this automaton in detail is left to the reader. □

4. Context-free graphs

In this section, we assume that (X,E,) is symmetric. We may think of each pair of oppositely oriented edges (x,a,y) and (y,a1,x) as one non-oriented edge, so that X becomes an ordinary graph with symmetric neighbourhood relation, but possibly multiple edges and loops. If it is in addition fully deterministic, then X is a regular graph, that is, the number of outgoing edges (which coincides with the number of ingoing edges) at each vertex is |Σ|. Attention: if we consider non-oriented edges, then each loop at x has to be counted twice, since it corresponds to two oriented edges of the form (x,a,x) and (x,a1,x). For all our purposes it is natural to require that X is connected: for any pair of vertices x,y there is a path from x to y. The distance d(x,y) is the minimum length (number of edges) of a path from x to y, which defines the integer-valued graph metric. A geodesic path is one whose length is the distance between its endpoints.

We select a finite, non-empty subset F of X and consider the balls B(F,n)={x:d(x,F)n} (where d(x,F)=min{d(x,y):yF}). If we delete B(F,n) then the induced graph XB(F,n) will fall apart into a finite number of connected components, called cones with respect to F. Each cone is a labelled, symmetric graph C with the boundary C consisting of all vertices x in C having a neighbour outside C (i.e., in B(F,n)).

The following notion was introduced in [12] for symmetric, labelled graphs and F={o}.

Definition 4.1

The graph X is called context-free with respect to F if there is only a finite number of isomorphism types of the cones with respect to F as labelled graphs with boundary.

This means that there are finitely many cones C1,,Cr (generally with respect to different radii n) such that for each cone C, we can fix a bijection ϕC from (the vertex set of) C to precisely one of the Ci, this bijection sends C to Ci, and (x,a,y) is an edge with both endpoints in C if and only if its image (ϕC(x),a,ϕC(y)) is an edge of Ci. In this case, we say that C is a cone of type i.

Generally, as in [12], we are interested in the case when F={o} (or any other singleton), but there is at least one point where it will be useful to admit arbitrary finite, non-empty F.

Another natural notion of context-freeness of X with respect to o is to require that the language Lo,o(X) is context-free. We shall see that for deterministic, symmetric graphs this is equivalent with context-freeness with respect to o in the sense of Definition 4.1. One direction of this equivalence is practically contained in [12], but not stated explicitly except for the case of Cayley graphs of groups. The other direction (that context-freeness of Lo,o implies that of the graph) is shown in [12] only for Cayley graphs of groups, which is substantially simpler than the general case treated below in Theorem 4.6.

Theorem 4.2

If the symmetric, labelled graph (X,E,) with label alphabet Σ is context-free with respect to the finite, non-empty set FX , then Lx,y is a context-free language for all x,yX . Furthermore, if the graph X is deterministic, then so is the context-free language Lx,y .

Proof

Just for the purpose of this proof, we write x0,y0 instead of x,y for the vertices for which Lx0,y0 will be shown to be context-free. We may assume without loss of generality that x0,y0 in F. Indeed, if this is not the case, then we can replace F by F=B(F,n), which contains x0 and y0 when n is sufficiently large. The cones with respect to F are also cones with respect to F, so that X is also context-free with respect to F.

Similarly to [12, Lemma 2.3], we construct a deterministic pushdown automaton that accepts Lx0,y0.

We consider also the whole graph X as a cone C0 with boundary F, which we keep apart from the other representatives C1,,Cr of cones.

If C is a cone, then as a component of XB(F,n) for some n0 it must be a successor of another cone C. The latter is the unique component of XB(F,n1) that contains C, when n1, while it is C0=X when n=0. We also call C the predecessor of C.

Different cones of type j{1,,r} may have predecessors of different types. Conversely, a cone C of type i{0,,r} may have none, one or more than one successors of type j, and the number di,j of those successors depends only on i and j. In the representative cone Ci, we choose and fix a numbering of the distinct successors of type j as Ci,jk, k=1,,di,j. If C is any cone with type i then we use the isomorphism ϕC:CCi to transport this numbering to the successors of C that have type j, which allows us to identify the k-th successor of C with type j.

One can visualize the cone structure by a finite, oriented graph Γ with multiple edges and root 0: the vertex set is the set of cone types i{0,,r}, and there are di,j oriented edges, which we denote by ti,jk (k=1,,di,j) from vertex i to vertex j (i0, j1).

Every vertex x of X belongs to the boundary of precisely one cone C=C(x) with respect to F. We define the type i of x as the type of C(x). Under the mapping ϕC, our x corresponds to precisely one element of Ci. We write ϕ(x) for that element, without subscript C, so that ϕ maps X onto iCi. In particular, ϕ(x)=x for every xF.

Let yXF with type j. Then there is i (depending on y) such that every neighbour x of y with d(x,F)=d(y,F)1 has type i, and there is precisely one successor cone Ci,jk of Ci that contains ϕC(x)(y). In this case, we write τ(y)=ti,jk, the second order type of y. Compare with [12]. If y is such that C(y)=C(y) then τ(y)=τ(y).

We now finally construct the required pushdown automaton A. (Comparing with [12], we use more states and stack symbols, which facilitates the description.) The set of states and stack symbols are

Q=i=0rCiandZ=F{ti,jk:i=1,,r,j=0,,r,k=1,,di,j}.

(When di,j=0 then there is no ti,jk.) Note that both sets contain F. In order to generate the language Lx0,y0, where x0,y0F, then we use x0 as the initial state and y0 as the (only) final state. We describe the transition function, which–like Q and Z–does not depend on x0,y0.

We want to read an input word, which has to correspond to the label starting in x0. Inside the subgraph of X induced by F, our A behaves just like that subgraph, seen as a finite automaton.

Outside of F, it works as follows. At the m-th step, the automaton will be in a state that describes the m-th vertex, say x, of that path, by identifying x as above with the element ϕ(x) of Cj, where j is the type of x. The current stack symbol is of the form ti,jk and serves to recall that x lies in the k-th successor cone of type j of a cone with type i. If the next vertex along the path, say y, satisfies d(y,F)=d(x,F)+1, and y has type j then the state is changed to ϕ(y)Cj, and the symbol tj,jk=τ(y) is added to the stack. If d(y,F)=d(x,F), then only the state is changed from ϕ(x) to ϕ(y). Finally, if d(y,F)=d(x,F)1 then the new state is again ϕ(y), while the top symbol on the stack is deleted. Formally, we get the following list of transition rules. If xF=QZ:

δ(x,a,x)={(y,y):(x,a,y)E,yF}{(ϕ(y),xτ(y)):(x,a,y)E,d(y,F)=1}.

If xXF:

δ(ϕ(x),a,τ(x))={(ϕ(y),a,τ(x)τ(y)):(x,a,y)E,d(y,F)=d(x,F)+1}{(ϕ(y),τ(y)=τ(x)):(x,a,y)E,d(y,F)=d(x,F)}{(ϕ(y),ϵ):(x,a,y)E,d(y,F)=d(x,F)1}.

This is a finite collection of transitions, since ϕ() and τ() can take only finitely many different values.

In view of the above explanations, A accepts Lx0,y0. Also, when the graph X is deterministic, then so is A. □

Before proving a converse of Theorem 4.2, we first need some preliminaries, and start by recalling a fact proved in [11], [12], see also Woess [21] and Berstel and Boasson [2].

Lemma 4.3

IfLo,ois context-free then there is a constantMsuch that for each coneCwith respect too, one hasdiam(C)M.

(The diameter is of course taken with respect to the graph metric.) We shall see below how to deduce this, but it is good to know it in advance.

A context-free grammar C=(V,Σ,P,S) is said to have Chomsky normal form (CNF), if (i) every production rule is of the form TUUˆ or Ta, where U,UˆV (not necessarily distinct), resp. aΣ, and (ii) if ϵL(C), then there is the rule Sϵ, and S is not contained in the right hand side of any production rule.

With a slight deviation from [11], we associate with each w=a1anL(C), n2 a labelled (closed) polygon P(w) with length n+1. As a directed graph, it has distinct vertices t0,t1,,tn and labelled edges (ti1,ai,ti), i=1,,n, plus the edge (t0,S,tn). A (diagonal) triangulation of P(w) is a plane triangulation of P(w) obtained by inserting only diagonals. Here, we specify those diagonals as oriented, labelled edges (ti,T,tj), where ti,tj are not neighbours in P(w) and TV. Furthermore, we will never have two diagonals between the same pair of vertices of P(w). (If |w|2 we consider P(w) itself triangulated.) The proof of the following Lemma may help to make the construction of [11] (used for Cayley graphs of groups) more transparent.

Lemma 4.4

If C=(V,Σ,P,S) is in CNF and w=a1anL(C) with n2 then there is a diagonal triangulation of P(w) with the property that whenever (ti,T,tj) is a diagonal edge, then T occurs in a derivation Sw , ji2 and Tai+1aj .

Proof

We start with a fixed derivation Sw, and explain how to build up the triangles step by step. Suppose that TV occurs in our derivation, and that we have a “sub-derivation” TUUˆai+1ak, where U,UˆV. Then there is j{i+1,,k1} such that Uai+1aj and Uˆaj+1ak. In this case, we draw a triangle with three oriented, labelled edges, namely the ‘old’ edge (ti,T,tk) and the two ‘new’ edges (ti,U,tj) and (tj,Uˆ,tk).

If we have the derivation Sa1an, then it uses successive steps of the form TUUˆ with UUˆai+1ak as above. We work through these steps one after the other, starting with ST1Tˆ1, where T1a1ak and Tˆ1ak+1an. The first triangle has the ‘old’ edge (t0,S,tn) and the ‘new’ edges (t0,T1,tk) and (tk,Tˆ1,tn).

At any successive step, we take one of the ‘new’ edges (ti,T,tk), where ki2 and proceed as explained at the beginning, so that we add two ‘new’ edges that make up a triangle together with (ti,T,tk), which is then declared ‘old’. We continue until all derivation steps of the form TUUˆ in our derivation Sw are exhausted. At this point, we have obtained a tiling of triangles that constitute a diagonal triangulation of its outer polygon, whose edges have the form (t0,S,tn) and (ti1,Ui,ti) with UiV, i=1,,n. The only steps of our derivation that we have not yet considered are the terminal ones Uiai. Thus, we conclude by replacing the label Ui of (ti1,Ui,ti) by ai. □

The construction is best understood by considering an example: suppose our rightmost derivation is

ST1Tˆ1T1(T2Tˆ2)T1(T2(T3Tˆ3))T1(T2(T3a6))T1(T2((T4Tˆ4)a6))T1(T2((T4a5)a6))T1(T2((a4a5)a6))T1(a3((a4a5)a6))T1(a3((a4a5)a6))(T5Tˆ5)(a3((a4a5)a6))(T5a2)(a3((a4a5)a6))(a1a2)(a3((a4a5)a6)).

(We have inserted the parentheses to make the rules that we used in each step more visible.) The associated triangulation is as in Fig. 2.

Fig. 2.

Fig. 2

The polygon P(w), with w=a1a2a3a4a5a6, and its (diagonal) triangulation associated with the rightmost derivation S(a1a2)(a3((a4a5)a6)).

The variables of the terminal rules T5a1, Tˆ5a2, T2a3, T4a4, Tˆ4a5 and Tˆ3a6 are not visible in this figure (but we might add them to the boundary edges). Apart from this, one can read the derivation Sw from the diagonalization in a similar way as it can be read from the so-called derivation tree (see e.g. [6, Section 1.6] for the latter).

The following goes back to [11] in the case of (Cayley graphs of) finitely generated groups (recall from Lemma/Definition 2.1 that in case X is deterministic and symmetric, if xX and w=a1a2anΣ, then xw=xan1a21a11X denotes the initial vertex of the path π in X terminating at x with label (π)=w).

Lemma 4.5

LetC=(V,Σ,P,S)be in CNF andL(C)=Lx,y(X), whereXis a deterministic, symmetric graph. Ifw=a1anLx,y(X)and(ti,T,tj)is a diagonal edge in a triangulation ofP(w)as in Lemma 4.4, then the verticesx¯=xa1aiandy¯=xa1ajofXsatisfyd(x¯,y¯)m(T), where

m(T)=min{|w|:wLT}. (2)

Proof

Since X is deterministic, Lemma 2.3 implies that πx(w) exists as the unique path with initial vertex x and label w. In particular, x¯ and y¯ lie on that path. Furthermore, we have y¯=yaj+1an.

Now let vLT with |v|=m(T). Then by Lemma 4.4, T arises in a derivation Sa1aiTaj+1anw. But then we also have Sa1aivaj+1an, a word in Lx,y. By Lemma 2.3, again using that X is symmetric and deterministic, x¯v=yaj+1an=y¯. Therefore, x¯ and y¯ are connected by a path with label v. Its length is m(T). □

Theorem 4.6

Let (X,E,) be a fully deterministic, symmetric graph with label alphabet Σ and root o . If Lo,o is a context-free language, then X is a context-free graph with respect to o , and in particular, Lo,o is deterministic.

Proof

There is a reduced grammar C=(V,Σ,P,S) in CNF that generates Lo,o. Each of the languages LT, TV, is non-empty, only LS contains ϵ, and we define

m=max{m(T):TV}, (3)

where m(T) is as in (2).

Let C be a cone with respect to o such that k=d(o,C)>m.

Construction ofD˜(C). We define D(C) as the subgraph of X induced by all vertices yX with

d(o,x)=d(o,y)+d(x,y)andd(x,y)mfor some xC.

In particular, y lies on some geodesic path from o to C.

Now let x1,x2C, and consider some path πΠx1,x2(C) (i.e., it lies in C). Choose a geodesic path π1 from o to x1 and a geodesic path π2 from x2 to o. Then we can concatenate the three paths to a single path π1ππ2Πo,o. Its label is the word w=(π1)(π)(π2)Lo,o. Set n=|w| and write

w=(a1ak)(ak+1ank)(ank+1an)

where the 3 pieces in the parentheses are (in order) (π1), (π) and (π2). The words (π1), (π) and (π2)S are the labels of three consecutive arcs that fill the boundary of the polygon P(w). (To be precise, along the last edge of the 3rd arc, we are reading the label S in the reversed direction.) By [11, Lemma 5], its triangulation has a triangle which meets each of those arcs. (It may also occur that one corner of the triangle meets two arcs.) Thus, there are i{0,,k} and i{k,,nk} such that the vertices ti and ti of P(w) lie on that triangle. They correspond to the vertices y1=oa1ai and y=oa1ai of X. We either have ii1, or else a diagonal (ti,U,ti) is a side of our triangle. By Lemma 4.5, we get d(y1,y)m(U)m. Thus kid(o,y)i+m, that is, ikm>0. In particular, ti does not lie on the third arc. In the same way, there is j{nk,,nk+m} (and not larger) such that tj is a corner of our triangle. This yields that there must be a “true” diagonal (ti,T,tj) of P(w). We set v1=ai+1ak and v2=ank+1aj, so that x1=y1v1, and let y2=x2ank+1aj. The points y1 and y2 are in D(C), and by Lemma 4.4, Tv1(π)v2.

[It is here that we can see Lemma 4.3, since we deduced that d(x1,x2)3m for all x1,x2C.]

By Lemma 4.4, we also have

Sa1aiTaj+1an,

so that vLT implies a1aivaj+1anLo,o and consequently vLy1,y2, that is, y1v=y2.

We now insert into D(C) the additional labelled edge (y1,v1Tv2,y2), whose label is the word v1Tv2ΣVΣ. We insert all diagonals of the same type that can be obtained in the same way, and write D˜(C) for the resulting “edge-enrichment” of D(C).

Subsuming, we have an edge (y1,v1Tv2,y2) in D˜(C) if and only if the following properties hold.

  • |vi|m (i=1,2) and TV,

  • the path with label v1 starting at y1 and ending at x1=y1v1C is part of a geodesic from o to x1,

  • the path with label v2 starting at x2=y2v2C and ending at y2 is part of a geodesic from x2 to o, and

  • there is a path π in C from x1 to x2 such that Tv1(π)v2,

  • if TvΣ then v is the label of a path in Πy1,y2.

Now, there are only finitely many cones C with respect to o with d(C,o)m. On the other hand, for all cones C with d(C,o)m, there is a bound on the number of vertices of D˜(C), as well as on the number of possible labels on its edges. In particular, there are only finitely many possible isomorphism types of the labelled graphs (D˜(C),C) with “marked” boundary CD˜(C).

We now suppose that C and C are two cones at distance m from o, such that (D˜(C),C) and (D˜(C),C) are isomorphic. We claim that C and C are isomorphic, and this will conclude the proof that there are only finitely many isomorphism types of cones with respect to o.

Let ϕ:D˜(C)D˜(C) be an isomorphism with ϕ(C)=C, and ϕ its inverse mapping. We extend ϕ to a mapping from C to C, also denoted ϕ.

Claim 1

LetxCandvΣ+such that the pathπx(v)lies inCand meetsConly in its initial pointx. Then the pathπx(v)lies inCand meetsConly in its initial pointx=ϕ(x)C.

Proof

If a is the initial letter of v then (always using the notation of Definition 1.1) the first edge of πx(v) is (x,a,xa). We now consider the path πx(v) with label v starting at xC. We first claim that the latter lies in C and only its initial point x is in C. Let (x,a,(x)a) be the first edge of the path. Then (x)a cannot lie in D˜(C), since otherwise (x,a,xa)=(ϕ(x),a,ϕ(x)a) would be an edge in D˜(C), a contradiction. Thus, the path πx(v) goes at least initially into CC.

So now suppose that πx(v) ever returns to C, and let π be its initial part up to the first return. Then v=(πx(v)) is an initial part of v with |v|2, and π is a path within C from x1=x to x2=(x)vC. But then, by construction, D˜(C) must contain an edge (y1,v1Tv2,y2) such that x1=(y1)v1, y2=(x2)v2, and Tv1vv2. Using the isomorphism ϕ:D˜(C)D˜(C), we set yi=ϕ(yi), i=1,2, and x2=ϕ(x2)C. We have of course x1=ϕ(x1). Now we must have the edge (y1,v1Tv2,y2) in D˜(C). But then v1vv2Ly1,y2, and consequently vLx1,x2, that is, x1vC. But this contradicts the fact that πx(v) meets C only in its initial point. We conclude that also the path πx(v) lies in C and meets C only in its initial point, and Claim 1 is verified. □

Now let zCC. Then there are xC and vΣ+ such that z=xv and the path πx(v) from x to z meets C only in its initial point x. By Claim 1, the analogous statement holds for the path πx(v) in C, where x=ϕ(x). The only choice is to define ϕ(z)=z=(x)v, which lies in CC as required. We have to show that ϕ is well-defined. This will follow from the next claim.

Claim 2

Letx1,x2C,v,wΣ+such that the pathsπx1(v)andπx2(w)lie inC, meetConly in their initial points and end at the same point ofCC. Then, settingxi=ϕ(xi), alsoπx1(v)andπx2(w)end at the same point ofCC.

Proof

Let w1 be the “inverse” of w, as defined in Definition 1.1. Then x2w1=x2w, and vw1 is the label of the path from x1 to x2 that we obtain by first following πx1(v) and then the “inverse” of πx2(w). It lies entirely in C, and only its endpoints are in C. By construction, D˜(C) has an edge (y1,v1Tv2,y2) such that y1v1=x1, x2v2=y2 and Tv1vw1v2. We set yi=ϕ(yi), i=1,2. Then (y1,v1Tv2,y2) is an edge of D˜(C). Therefore v1vw1v2Ly1,y2. But this implies that vw1 is the label of a path from x1 to x2, and we know from Claim 1 that it lies in C and has only its endpoints in C. Thus (x1)v=(x2)w1=(x2)w, and Claim 2 is true. □

Thus, ϕ is well defined, and the same works of course also for ϕ by exchanging the roles of Cand C.

Claim 3

The mapϕ:CCis bijective.

Proof

We know that ϕ:CC is bijective and that ϕ(CC)CC. Let zCC, and let xC, vΣ+ such that πx(v) is a path from x to z that intersects C only at the initial point. Setting x=ϕ(x), z=ϕ(z), we know from the construction of ϕ and Claim 1 that πx(v) is a path in C from x to z that meets C only in its initial point. Now the way how ϕ is constructed yields that ϕ(z)=z. Therefore ϕϕ is the identity on C. Exchanging roles, we also get the ϕϕ is the identity on C. This proves Claim 3. □

It is now immediate from the construction that ϕ also preserves the edges and their labels, so that it is indeed an isomorphism between the labelled graphs C and C that sends C to C. This concludes the proof of Theorem 4.6. □

[12, Cor. 2.7] says that if a symmetric labelled graph is context-free with respect to one root o, then it is context-free with respect to any other vertex chosen as the root x. In view of Theorem 4.2, Theorem 4.6, this is also obtained from the following, when the graph is fully deterministic.

Corollary 4.7

Let (X,E,) be a fully deterministic, strongly connected graph with label alphabet Σ . If Lo,o is context-free then Lx,y is deterministic context-free for all x,yX .

Theorem 4.2, Theorem 4.6, together with Lemma 3.1 also imply the following.

Corollary 4.8

Let G be a finitely generated group and K a subgroup.

  • (a)

    The pair (G,K) is context-free if and only if for any symmetric ψ:ΣG , the Schreier graph X(G,K,ψ) is a context-free graph. In this case, the language L(G,K,ψ) is deterministic for every (not necessarily symmetric) semigroup presentation ψ:ΣG .

  • (b)

    If (G,K) is context-free, then also (G,g1Kg) is context-free for every gG .

Proof

(a) is clear. Regarding (b), for the Schreier graph X(G,K,ψ), we have L(G,K,ψ)=Lo,o and L(G,g1Kg,ψ)=Lx,x with x=Kg, gG. Thus, the statement follows from Corollary 4.7. □

Lemma 4.9

Let G be a finitely generated group and K,H be subgroups with KH and [H:K]< .

If (G,K) is context-free then also (G,H) is context-free.

Proof

In the context-free graph X(G,K,ψ), consider the finite set of vertices F={Kh:hH}, containing the root vertex o=oK=K. Then L(G,H,ψ)=xFLo,x is a finite (disjoint) union of context-free languages. Therefore it is context-free by standard facts. □

Remark 4.10

In terms of Schreier graphs, we have the mapping KgHg which is a homomorphism of labelled graphs from X=X(G,K,ψ) onto Y=X(G,H,ψ) which is finite-to-one. The lemma says that in this situation, if X is a context-free graph then so is Y. We do not see an easy direct proof of this fact in terms of graphs, the main problem being how the homomorphism XY interacts with the isomorphisms between the cones of X with respect to the set F. On the other hand, reformulating this in terms of the associated “path languages” with the help of Theorem 4.2, Theorem 4.6, it has become straightforward.

The converse of Lemma 4.9 is not true, that is, when (G,H) is context-free and [H:K]< then (G,K) is not necessarily context-free. See Example 5.6 in the last section. However, we have the following.

Lemma 4.11

If K is a finite subgroup of G then (G,K) is context-free if and only if G is a context-free (i.e. virtually free) group.

Proof

Fix Σ and ψ. Let X=X(G,ψ) be the associated Cayley graph of G, and Y=X(G,K,ψ). We let o be the root of Y, that is, o=K1G as an element of Y (a coset). The group K acts on X by automorphisms of that labelled graph. It leaves the set F=K (now as a set of vertices of X) invariant. The factor graph of X by this action is Y. Write π for the factor mapping. It is |K|-to-one. Each cone of X with respect to F is mapped onto a cone of Y with respect to o, and this mapping sends boundaries of cones of X to boundaries of cones of Y. By assumption, Y is a context-free graph. By Lemma 4.3, there is an upper bound on the number of elements in the latter boundaries. Therefore there also is an upper bound on the number of elements of any of the boundaries of the cones of X with respect to F.

Without going here into the details of the definition of the space of ends of X, we refer to the terminology of Thomassen and Woess [19] and note that the above implies that all ends of X are thin. But then, as proved in [19], G must be a virtually free group. □

One should not tend to believe that in the situation of the last lemma, the Cayley graphs of G are quasi-isometric with the Schreier graphs of (G,K). As a simple counter-example, take for G the infinite dihedral group a,ba2=b2 and for K the 2-element subgroup generated by a.

5. Covers and Schreier graphs

We assume again that (X,E,) is symmetric and fully deterministic. Recall the involution aa1a of Σ. A word in Σ is called reduced if it contains no subword of the form aa1, where aΣ. We write TΣ for the set of all reduced words in Σ. We can equip TΣ with the structure of a labelled graph, whose edges are of the form

(v,a,w)and(w,a1,v),where v,wTΣ,aΣ,va=w. (4)

Thus, the terminal letter of v must be different from a1. Then TΣ is fully deterministic, and it is a tree, that is, it has no closed path whose label is a (non-empty) reduced word. As the root of TΣ, we choose the empty word ϵ. Then TΣ is the universal cover of X. Namely, if we choose (and fix) any vertex oX as the root, then the mapping

Φ:TΣX,Φ(w)=ow, (5)

is a covering map: it is a surjective homomorphism between labelled graphs which is a local isomorphism, that is, it is one-to-one between the sets of outgoing (resp. ingoing) edges of any element wTΣ and its image Φ(w). (Note that this allows the image of an edge to be a loop.) “Universal” means that it covers every other cover of X, but this is not very important for us. The property of wTΣ to be reduced is equivalent with the fact that the path πo(w) in X is non-backtracking, that is, it does not contain two consecutive edges which are the reversal of each other.

We now realize that TΣ is the standard Cayley graph of the free group FΣ, where Σ is the set of free generators together with their inverses. The group product is the following: if v,wTΣFΣ, then vw is obtained from the concatenated word vw by step after step deleting possible subwords of the form aa1 that can arise from that concatenation. The group identity is ϵ, and the inverse of w is w1 as at the end of Definition 1.1. With Φ as in (5), let

K=K(X)=Φ1(o)={wTΣ:πo(w) is a closed path from o to o in X}. (6)

Then, under the indentification TΣFΣ, we clearly have that K is a subgroup of FΣ. The following is known, see e.g. Lyndon and Schupp [10, Ch. III] or (our personal source) Imrich [8].

Proposition 5.1

The graphXis the Schreier graph of the pair of groups(FΣ,K(X))with respect to the semigroup presentationψgiven byψ(a)=a,aΣ.

In ψ(a)=a, we interpret a simultaneously as a letter from the alphabet and as a generator of the free group.

Thus, in reality the study of context-free pairs of groups is the same as the study of fully deterministic, symmetric context-free graphs under a different viewpoint.

The same is not true without assuming symmetry. Indeed, given a semigroup presentation ψ of G, for every aΣ there must be waΣ such ψ(wa)=ψ(a)1, the inverse in G. But then in the Schreier Graph X(G,K,ψ), for any subgroup K of G, we have the following: if (x,a,y)E then ywa=x, that is, there is the oriented path from y to x with label wa. In a general fully deterministic graph this property does not necessarily hold, even if it has the additional property that for each aΣ, there is precisely one incoming edge with label a at every vertex. As an example, consider X={x,y,z}, Σ={a,b} and labelled edges (x,a,y),(x,b,y),(y,a,z),(y,b,x),(z,a,x),(z,b,z).

We return to the situation of Proposition 5.1. As a subgroup of the free group, the group K(X) is itself free. There is a method for finding a set of free generators. First recall the notion of a spanning tree of X. This is a tree T, which as subgraph of X is obtained by deleting edges (but no vertices) of X. Every connected (non-oriented) graph has a spanning tree, for locally finite graphs it can be constructed inductively. Now let T be a spanning tree of X, and consider all edges of X that are not edges of T. They must come in pairs (e,e1). For each pair, we choose one of the two partner edges, and we write E0 for the chosen (oriented) edges. For each eE0, we choose non-backtracking paths in T from o to e and from e+ to o. Together with e (in the middle), they give rise to a non-backtracking path in X that starts and ends at o. Let w(e) be the label on that path. Then the following holds [10], [8].

Proposition 5.2

As elements of FΣ , the w(e) , eE0 , are free generators of K(X) .

Corollary 5.3

Let G be a virtually free group and K a finitely generated subgroup. Then (G,K) is context-free.

Proof

Let F=FΣ be a free subgroup of G of finite index. Then K=KF is a free subgroup of K with [K:K]<. Since K is finitely generated, also K is finitely generated. In the Schreier graph X of (F,K) with respect to the standard labelling by Σ, choose a spanning tree and remaining set E0 of edges, as described above. Since all sets of free generators of K must have the same cardinality, E0 is finite. Thus, X is obtained by adding finitely many edges to a tree. If o is the root vertex of X and n is the largest distance between o and an endpoint of some edge in E0, then every cone C of X with d(C,o)>n is a rooted, labelled tree that is isomorphic to one of the cones of TΣ. Thus, the Schreier graph, resp. (F,K) are context-free. It now follows from Proposition 3.3 and Lemma 4.9 that also (G,K) is context-free. □

We remark here that one can always reduce the study of context-free pairs to free groups and their subgroups. Given (G,K), let F be a finitely generated free group that maps by a homomorphism onto G. Let K be the preimage of K under that homomorphism. Then clearly (G,K) is context-free if and only (F,K) has this property. (This reduction, however, is not very instructive.)

Of course, there are context-free pairs with G free beyond the situation of Corollary 5.3.

Example 5.4

Consider the free group F=a,b and the subgroup K with the infinite set of free generators {akblablak:k,lZ,l0}. The associated Schreier graph with respect to {a±1,b±1} is the comb lattice.

Its vertex set is the set of integer points in the plane. The edges labelled by a are along the x-axis, from (k,0) to (k+1,0), and there is a loop with label a at each point (k,l) with l0. The edges labelled by b are all the upward edges of the grid, that is, all edges from (k,l) to (k,l+1), where (k,l)Z2. To these, we have to add the oppositely oriented edges whose labels are the respective inverses (in Fig. 3, the oppositely oriented edges together with the corresponding labels are omitted for simplicity). The comb lattice is clearly a context-free graph (tree).

Fig. 3.

Fig. 3

The comb lattice described in Example 5.4.

We proceed giving some simple examples. It is very easy to see that context-freeness is not “transitive” in the following sense: if (G,H) and (H,K) are context-free (with G,H finitely generated and KHG) then in general (G,K) will not be context-free.

Example 5.5

Let G=Z2, H=Z×{0}Z and K={(0,0)}. Then H (i.e., (H,K)) is context-free. Of course, this also holds for (G,H), whose Schreier graphs are just the Cayley graphs of Z. But Z2 (i.e., (G,K)) is not context-free.

This also shows that the converse of Lemma 3.1 does not hold in general (while we know that it does hold when [G:H]<). Finally, we construct examples of three groups KHG, where (G,H) is context-free, [H:K]<, and (G,K) is not context-free.

Example 5.6

We construct a family of fully deterministic, symmetric labelled graphs XW, WZ(non-empty), and one such graph Y, so that Y is the factor graph with respect to the action of a 2-element group of automorphisms of each of the labelled graphs XW. While Y will be a context-free graph, many of the graphs XW in our family are not context-free. We then translate this back into the setting of pairs of groups.

The vertex set of XW is Z×{0,1}. The set of labels is Σ={a,b,a1,b1}. The edges are as follows:

((k,0),a,(k+1,0))and((k,1),a,(k+1,1))for allkZ,
((k,0),b,(k+1,0))and((k,1),b,(k+1,1))for allkZW,and
((k,0),b,(k+1,1))and((k,1),b,(k+1,0))for allkW.

The reversed edges carry the respective inverse labels (in Fig. 4, these reversed edges together with the corresponding labels are omitted for simplicity). Since W, there is at least one of the “crosses” (pair of the third type of edges). Therefore XW is connected. In general, it does not have finitely many cone types, i.e., it is not context-free. For example, it is not context-free when W={k(|k|+1):kZ}

For arbitrary W, the two-element group that exchanges each (k,0) with (k,1) acts on XW by label preserving graph automorphisms. The factor graph Y (see Fig. 5) has vertex set Z and edges

(k,a,k+1)and(k,b,k+1)for allkZ,

plus the associated reversed edges (in Fig. 5, these edges together with the corresponding labels are omitted for simplicity). It is clearly a context-free graph.

Now let F=FΣ be the free group (universal cover of XW and Y), and for given W, let KW be the fundamental group of XW at the vertex (0,0). Furthermore, let K be the fundamental group of Y at the vertex 0. Then it is straightforward that KW has index 2 in K. The mapping ψ is the embedding of Σ into FΣ, as above. We then have Y=X(F,K,ψ) and XW=X(F,KW,ψ), providing the required example.

Fig. 4.

Fig. 4

The fully deterministic, symmetric labelled graph XW, with WZ described in Example 5.6 (here W={0,1,3,}). The reverse edges, together with the corresponding labels, are omitted for simplicity.

Fig. 5.

Fig. 5

The factor graph Y of the graph XW from Fig. 4 (cf. Example 5.6). The reverse edges, together with the corresponding labels, are omitted for simplicity.

Example 5.7

At the end of the Introduction, we mentioned the possible interplay with ends. The number of ends e(X) of a symmetric, connected graph is the supremum of the number of connected components of the complement of any finite subgraph. Via Stallings’ [17] celebrated structure theorem, ends of groups (i.e., ends of Cayley graphs) are closely related with amalgamated free products and HNN-extensions. Thus, it is natural to ask the following question.

Let (G1,K) and (G2,K) be two context-free pairs of groups sharing the same subgroup K. Let G=G1KG2 be the amalgamated free product of G1 and G2 over the group K. Is it then true that (G,K) is context-free ? When K is finite, the answer is of course “yes”, because then G1,G2 and G are virtually free. When K is infinite, we have a counter-example. Here is a brief outline.

Let G=a1,a2,b1,b2[a1,b1][a2,b2] be the fundamental group of an orientable surface of genus 2. Let K be the infinite cyclic subgroup generated by the commutator [a1,b1]=[a2,b2]1, and for i=1,2, let Gi be the free group with free generators ai and bi. Then G is the amalgamated free product of G1 and G2 over K.

By Corollary 5.3, the pairs (G1,K) and (G2,K) are context-free. But (G,K) is not context-free. Indeed, let X be the Schreier graph of (G,K) with respect to the above generators and their inverses. It has two ends, see e.g. the outline in the Introduction of [14]. Thus, there is a finite subgraph F of X such that XB(F,n) has exactly two infinite cones for any n. If X were context-free, then the finite upper bound on the number of boundary elements of any cone would yield that X has linear growth, that is |B(F,n)|Cn for all n. This contradicts the fact that G, as well as the Schreier graphs of (G1,K) and (G2,K), have exponential growth.

Acknowledgements

The authors are grateful to Wilfried Imrich, Rögnvaldur G. Möller and Michah Sageev for useful hints and discussions.

The first author was partially supported by a visiting professorship at TU Graz. The second author was partially supported by a visiting professorship at Università di Roma - La Sapienza and the Austrian Science Fund project FWF-P19115-N18.

Contributor Information

Tullio Ceccherini-Silberstein, Email: tceccher@mat.uniroma3.it.

Wolfgang Woess, Email: woess@TUGraz.at.

References

  • 1.Anisimov A.V. Group languages. Kibernetika. 1971;4:18–24. [Google Scholar]
  • 2.Berstel J., Boasson L. vol. B. Elsevier; Amsterdam: 1990. Context-free languages; pp. 59–102. (Handbook of Theoretical Computer Science). [Google Scholar]
  • 3.Ceccherini-Silberstein T., Woess W. Growth and ergodicity of context-free languages. Trans. Amer. Math. Soc. 2002;354:4597–4625. [Google Scholar]
  • 4.Dunwoody M.J. The accessibility of finitely presented groups. Invent. Math. 1985;81:449–457. [Google Scholar]
  • 5.Frougny Ch., Sakarovitch J., Schupp P. Finiteness conditions on subgroups and formal language theory. Proc. London Math. Soc. 1989;58:74–88. [Google Scholar]
  • 6.Harrison M.A. Addison-Wesley; Reading, MA: 1978. Introduction to Formal Language Theory. [Google Scholar]
  • 7.Holt D., Rees S., Röver C., Thomas R. Groups with context-free co-word problem. J. London Math. Soc. 2005;71:643–657. [Google Scholar]
  • 8.Imrich W. Combinatorial Mathematics V. Vol. 622. Springer; Berlin: 1977. Subgroup theorems and graphs. (Lecture Notes in Mathematics). [Google Scholar]
  • 9.Lehnert J., Schweitzer P. The co-word problem for the Higman–Thompson group is context-free. Bull. Lond. Math. Soc. 2007;39:235–241. [Google Scholar]
  • 10.Lyndon R.C., Schupp P.E. Vol. 89. Springer; Berlin: 1977. Combinatorial group theory. (Ergebnisse der Mathematik und ihrer Grenzgebiete). [Google Scholar]
  • 11.Muller D.E., Schupp P.E. Groups, the theory of ends and context-free languages. J. Comput. System Sc. 1983;26:295–310. [Google Scholar]
  • 12.Muller D.E., Schupp P.E. The theory of ends, pushdown automata, and second-order logic. Theoret. Comput. Sci. 1985;37:51–75. [Google Scholar]
  • 13.Pélecq L. Automorphism groups of context-free graphs. Theoret. Comput. Sci. 1996;165:275–293. [Google Scholar]
  • 14.Sageev M. Ends of group pairs and non-positively curved cube complexes. Proc. London Math. Soc. 1995;71:585–617. [Google Scholar]
  • 15.Scott P. Ends of pairs of groups. J. Pure Appl. Algebra. 1977/78;11:179–198. [Google Scholar]
  • 16.Sénizergues G. Automata, languages and programming. vol. 1099. Springer; Berlin: 1996. Semi-groups acting on context-free graphs. pp. 206–218. (Lecture Notes in Comput. Sci.). Paderborn, 1996. [Google Scholar]
  • 17.Stallings J.R. On torsion-free groups with infinitely many ends. Ann. Math. 1968;88:312–334. [Google Scholar]
  • 18.Swarup G.A. On the ends of pairs of groups. J. Pure Appl. Algebra. 1993;87:93–96. [Google Scholar]
  • 19.Thomassen C., Woess W. Vertex-transitive graphs and accessibility. J. Combin. Theory Ser. B. 1993;58:248–268. [Google Scholar]
  • 20.Woess W. Context-free pairs of groups. II - Cuts, tree sets, and random walks. Discrete Math. 2012;312:157–173. doi: 10.1016/j.disc.2011.07.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Woess W. Graphs and groups with tree-like properties. J. Combin. Theory, Ser. B. 1989;47:361–371. [Google Scholar]

RESOURCES