Context-free pairs of groups I: Context-free pairs and graphs

Tullio Ceccherini-Silberstein; Wolfgang Woess

doi:10.1016/j.ejc.2012.03.011

. 2012 Oct;33(7):1449–1466. doi: 10.1016/j.ejc.2012.03.011

Context-free pairs of groups I: Context-free pairs and graphs

Tullio Ceccherini-Silberstein ^a, Wolfgang Woess ^b

PMCID: PMC4819043 PMID: 27087724

Abstract

Let $G$ be a finitely generated group, $A$ a finite set of generators and $K$ a subgroup of $G$ . We define what it means for $(G, K)$ to be a context-free pair; when $K$ is trivial, this specializes to the standard definition of $G$ to be a context-free group.

We derive some basic properties of such group pairs. Context-freeness is independent of the choice of the generating set. It is preserved under finite index modifications of $G$ and finite index enlargements of $K$ . If $G$ is virtually free and $K$ is finitely generated then $(G, K)$ is context-free. A basic tool is the following: $(G, K)$ is context-free if and only if the Schreier graph of $(G, K)$ with respect to $A$ is a context-free graph.

1. Introduction

Let $G$ be a finitely generated group and $K$ a (not necessarily finitely generated) subgroup of $G$ . We can choose a finite set $A \subset G$ of generators such that every element of $G$ is of the form $g = g_{1} \dots g_{n}$ , where $n \geq 0$ and $g_{1}, \dots, g_{n} \in A$ . Thus, $A$ generates $G$ as a semigroup. We shall say that $(G, K)$ is context-free, if–loosely spoken–the language of all words over $A$ that represent an element of $K$ is context-free.

The precise definition needs some preparation. Let $Σ$ be a finite alphabet and $ψ : Σ \to G$ be a (not necessarily injective) mapping such that $A = ψ (Σ)$ satisfies the above finite generation property for $G$ . Then $ψ$ has a unique extension, also denoted $ψ$ , as a monoid homomorphism $ψ : Σ^{*} \to G$ . Recall that $Σ^{*}$ consists of all words $w = a_{1} \dots a_{n}$ , where $n \geq 0$ and $a_{1}, \dots, a_{n} \in Σ$ (repetitions allowed). The number $n$ is the length $| w |$ of $w$ . If $n = 0$ this means that $w = ϵ$ , the empty word. This is the neutral element of $Σ^{*}$ , and $Σ^{*}$ is a free monoid with the binary operation of concatenation of words. The extension of $ψ$ is of course given by

ψ (a_{1} \dots a_{n}) = ψ (a_{1}) \dots ψ (a_{n}),

where the product on the right hand side is taken in $G$ . Given these ingredients, we shall say that $ψ : Σ \to G$ is a semigroup presentation of $G$ , referring to the fact that $A$ generates $G$ as a semigroup. A language over $Σ$ is a non-empty subset of $Σ^{*}$ .

Definition 1.1

The word problem of $(G, K)$ with respect to $ψ$ is the language

$L (G, K, ψ) = {w \in Σ^{*} : ψ (w) \in K} .$

We say that the triple $(G, K, ψ)$ is context-free, if $L (G, K, ψ)$ is a context-free language.

A context-free grammar is a quadruple $C = (V, Σ, P, S)$ , where $V$ is a finite set of variables, disjoint from the finite alphabet $Σ$ (the terminal symbols), the variable $S$ is the start symbol, and $P \subset V \times {(V \cup Σ)}^{*}$ is a finite set of production rules. We write $T ⊢ u$ or $(T ⊢ u) \in P$ if $(T, u) \in P$ . For $v, w \in {(V \cup Σ)}^{*}$ , we write $v ⟹ w$ if $v = v_{1} T v_{2}$ and $w = v_{1} u v_{2}$ , where $u, v_{1}, v_{2} \in {(V \cup Σ)}^{*}$ and $T ⊢ u$ . This is a single derivation step, and it is called rightmost, if $v_{2} \in Σ^{*}$ . A derivation is a sequence $v = w_{0}, w_{1}, \dots, w_{k} = w \in {(V \cup Σ)}^{*}$ such that $w_{i - 1} ⟹ w_{i}$ ; we then write $v \overset{*}{⟹} w$ . A rightmost derivation is one where each step is rightmost. The succession of steps of any derivation $T \overset{*}{⟹} w \in Σ^{*}$ can be reordered so that it becomes a rightmost derivation. For $T \in V$ , we consider the language $L_{T} = {w \in Σ^{*} : T \overset{*}{⟹} w}$ . The language generated by $C$ is $L (C) = L_{S}$ .

A context-free language is a language generated by a context-free grammar. As a basic reference for Language and Automata Theory, we refer to the magnificent monograph of Harrison[6].

The above definition of a context-free pair, or rather triple, $(G, K, ψ)$ makes sense when $G$ is a finitely generated monoid and $K$ is a sub-monoid, but here we are interested in groups. When in addition $K = {1_{G}}$ , this leads to the notion of $G$ being a context-free group. In two celebrated papers, Muller and Schupp[11], [12] have carried out a detailed study of context-free groups and more generally, context-free graphs. In particular, context-freeness of a group is independent of the particular choice of the generating set $A$ of $G$ . The main result of [11], in combination with a fundamental theorem of Dunwoody [4], is that a finitely generated group is context-free if and only if it is virtually free, that is, it contains a free subgroup of finite index. (In [11], it is assumed that $A = A^{- 1}$ and that $ψ : Σ \to A = ψ (Σ)$ is one-to-one, but the results carry over immediately to the more general setting where those two properties are not required.)

Previously, Anisimov [1] had shown that the groups whose word problem $L (G, {1_{G}}, ψ)$ is regular (see Section 2 for the definition) are precisely the finite groups.

The above mentioned context-free graphs are labelled, rooted graphs with finitely many isomorphism classes of cones. The latter are the connected components of the graph that remain after removing a ball around the root with arbitrary radius. See Section 4 for more precise details. As shown in [12], there is a natural correspondence between such graphs and pushdown automata, which are another tool for generating context-free languages; see Section 3.

Among subsequent work, we mention Pélecq [13] and Sénizergues [16], who studied actions on, resp. quotients of context-free graphs. Group-related examples occur also in Ceccherini-Silberstein and Woess [3].

More recently, Holt et al. [7] have introduced and studied co-context-free groups, which are such that the complement of $L (G, {1_{G}}, ψ)$ is context-free, see also Lehnert and Schweitzer [9]. This concept has an obvious extension to co-context-free pairs of groups, resp. graphs, on whose examination we do not (yet) embark.

In the present notes, we collect properties and examples of context-free pairs of groups $(G, K)$ .

•
The language $L (G, K, ψ)$ is regular if and only if the index $[G : K]$ of $K$ in $G$ is finite (Proposition 2.4).
•
The property that $L (G, K, ψ)$ is context-free does not depend on the specific choice of the semigroup presentation $ψ$ , so that context-freeness is just a property of the pair $(G, K)$ , a consequence of Lemma 3.1.
•
If $(G, K)$ is context-free then $L (G, K, ψ)$ is a deterministic context-free language (see Section 3 for the definition) for any semigroup presentation $ψ : Σ \to G$ (Corollary 4.8.a).
•
If $(G, K)$ is context-free and $H$ is a finitely generated subgroup of $G$ , then the pair $(H, K \cap H)$ is context-free (Lemma 3.1).
•
If $[G : H] < \infty$ then $(G, K)$ is context-free if and only if $(H, K \cap H)$ is context-free (Proposition 3.3 & Lemma 4.9).
•
If $(G, K)$ is context-free and $H$ is a subgroup of $G$ with $K \leq H$ and $[H : K] < \infty$ then $(G, H)$ is context-free (Lemma 4.9).
•
If $K$ is finite then $G$ is context-free if and only if $(G, K)$ is context-free (Lemma 4.11).
•
If $(G, K)$ is context-free then $(G, g^{- 1} K g)$ is context-free for every $g \in G$ (Corollary 4.8.b).
•
If $G$ is virtually free and $K$ is a finitely generated subgroup of $G$ then $(G, K)$ is context-free (Corollary 5.3).

Several of these properties rely on the following.

•
A fully deterministic, symmetric labelled graph (see Section 2 for definitions) is context-free in the sense of Muller and Schupp if and only if the language of all words which are labels of a path that starts and ends at a given root vertex is context-free (Theorem 4.2, Theorem 4.6).

The (harder) “if” part is not contained in previous work. It implies the following.
•
The pair $(G, K)$ is context-free if and only if for some ( $⟺$ any) symmetric semigroup presentation $ψ : Σ \to G$ , the Schreier graph of $(G, K)$ with respect to $ψ$ is a context-free graph. (See again Section 2 for precise definitions).

In a second paper [20], a slightly more general approach to context-freeness of graphs via cuts and tree-sets is given. It allows to show that certain structural properties (“irreducibility”) are preserved under finite-index-modifications of the underlying pair of groups. This is then applied to random walks, leading in particular to results on the asymptotic behaviour of transition probabilities.

In concluding the Introduction, we remark that with the exception of some “elementary” cases, context-free pairs of groups are always pairs with more than one end. Ends of pairs of groups were studied, e.g., by Scott [15], Swarup [18] and Sageev [14]. This leads directly to asking about the interplay between context-freeness of pairs and decomposition as amalgamated products or HNN-extensions. An example at the end of Section 5 shows that there is no immediate answer.

2. Schreier graphs and the regular case

Let $Σ$ be a finite alphabet. A directed graph labelled by $Σ$ is a triple $(X, E, ℓ)$ , where $X$ is the (finite or countable) set of vertices, $E \subset X \times Σ \times X$ is the set of oriented, labelled edges and $ℓ : E ∋ (x, a, y) \mapsto a \in Σ$ is the labelling map.

For an edge $e = (x, a, y) \in E$ , its initial vertex is $e^{-} = x$ and its terminal vertex is $e^{+} = y$ , and we say that $e$ is outgoing from $x$ and ingoing into $y$ . If $y = x$ then $e$ is a loop, which is considered both as an outgoing and as an ingoing edge. We allow multiple edges, i.e., edges of the form $e_{1} = (x, a_{1}, y)$ and $e_{2} = (x, a_{2}, y)$ with $a_{1} \neq a_{2}$ , but here we exclude multiple edges where also the labels coincide. The graph is always assumed to be locally finite, that is, every vertex is an initial or terminal vertex of only finitely many edges. We also choose a fixed vertex $o \in X$ , the root or origin. We shall often just speak of the graph $X$ , keeping in mind the presence of $E$ and $ℓ$ .

We call $X$ fully labelled if at every vertex, each $a \in Σ$ occurs as the label of at least one outgoing edge. We say that $X$ is deterministic if at every vertex all outgoing edges have distinct labels, and fully deterministic if it is fully labelled and deterministic. Finally, we say that $X$ is symmetric or undirected if there is a fixed point free involution $a \mapsto a^{- 1}$ of $Σ$ (i.e., ${(a^{- 1})}^{- 1} = a$ , excluding the possibility that $a^{- 1} = a$ ) such that for each edge $e = (x, a, y) \in E$ , also the reversed edge $e^{- 1} = (y, a^{- 1}, x)$ belongs to $E$ .

A path in $X$ is a sequence $π = e_{1} e_{2} \dots e_{n}$ of edges such that $e_{i}^{+} = e_{i + 1}^{-}$ for $i = 1, \dots, n - 1$ . The vertices $π^{-} = e_{1}^{-}$ and $π^{+} = e_{n}^{+}$ are the initial and the terminal vertex of $π$ . The number $| π | = n$ is the length of the path. The label of $π$ is $ℓ (π) = ℓ (e_{1}) ℓ (e_{2}) \dots ℓ (e_{n}) \in Σ^{*}$ . We also admit the empty path starting and ending at a vertex $x$ , whose label is $ϵ$ . Denote by $Π_{x, y} = Π_{x, y} (X)$ the set of all paths $π$ in $X$ with initial vertex $π^{-} = x$ and terminal vertex $π^{+} = y$ . The following needs no proof.

Lemma/Definition 2.1

Let $(X, E, ℓ)$ be a labelled graph, $x \in X$ and $w \in Σ^{*}$ . We define $Π_{x} (w) = {π : π^{-} = x, ℓ (π) = w}$ , the set of all paths that start at $x$ and have label $w$ . The set of all terminal vertices of those paths is denoted $x^{w} = {π^{+} : π \in Π_{x} (w)}$ .

Analogously, we define ${\bar{Π}}_{x} (w) = {π : π^{+} = x, ℓ (π) = w}$ , the set of all paths that terminate at $x$ and have label $w$ , and write $x^{- w} = {π^{-} : π \in {\bar{Π}}_{x} (w)}$ .

If $X$ is fully labelled, then $Π_{x} (w)$ is always non-empty.

If $X$ is deterministic, then $Π_{x} (w)$ has at most one element, and if that element exists, it is denoted $π_{x} (w)$ , while $x^{w}$ just denotes its endpoint.

If $X$ is fully deterministic, then $x^{w}$ is a unique vertex of $X$ for every $x \in X$ , $w \in Σ^{*}$ .

Finally, if $X$ is symmetric (not necessarily deterministic), then ${\bar{Π}}_{x} (w) = Π_{x} (w^{- 1})$ , where for $w = a_{1} \dots a_{n}$ , one defines $w^{- 1} = a_{n}^{- 1} \dots a_{1}^{- 1}$ .

With a labelled, directed graph as above, we can associate various languages. We can, e.g., consider the language

L_{x, y} = L_{x, y} (X) = {ℓ (π) : π \in Π_{x, y} (X)}, where x, y \in X .

(1)

Definition 2.2

Let $G$ be a finitely generated group, $K$ a subgroup and $ψ : Σ \to G$ a semigroup presentation of $G$ . The Schreier graph $X = X (G, K, ψ)$ has vertex set

$X = K ∖ G = {K g : g \in G}$

(the set of all right $K$ -cosets in $G$ ), and the set of labelled, directed edges

$E = {e = (x, a, y) : x = K g, y = K g ψ (a), where g \in G, a \in Σ} .$

$X$ is a rooted graph with origin $o = K$ , the right coset corresponding to the neutral element $1_{G}$ of the group $G$ . The Schreier graph is fully deterministic. It is also strongly connected: for every pair $x, y \in X$ , there is a path from $x$ to $y$ . (This follows from the fact that $ψ (Σ)$ generates $G$ as a semigroup.) When $K = {1_{G}}$ then we write $X (G, ψ)$ . This is the Cayley graph of $G$ with respect to $ψ$ , or more loosely speaking, with respect to the set $ψ (Σ)$ of generators.

Note that $X$ can have the loop $e = (x, a, x) \in E$ with $x = K g$ . This holds if and only if $ψ (a) \in g^{- 1} K g$ . It can also have the multiple edges $e_{1} = (x, a_{1}, y)$ and $e_{2} = (x, a_{2}, y)$ with $x = K g$ and $a_{1} \neq a_{2}$ . This occurs if and only if $ψ (a_{2}) ψ {(a_{1})}^{- 1} \in g^{- 1} K g$ . In particular, there might be multiple loops. The following is obvious.

Lemma 2.3

Let $K$ be a subgroup of $G$ and $ψ : Σ \to G$ be a semigroup presentation of $G$ . Then

$L (G, K, ψ) = L_{o, o} (X)$

is the language of all labels of closed paths starting and ending at $o = K$ in the Schreier graph $X (G, K, ψ)$ .

A context-free grammar $C = (V, Σ, P, S)$ and the language $L (C)$ are called linear, if every production rule in $P$ is of the form $T ⊢ v_{1} U v_{2}$ or $T ⊢ v$ , where $v, v_{1}, v_{2} \in Σ^{*}$ and $T, U \in V$ . If furthermore in this situation one always has $v_{2} = ϵ$ (the empty word), then grammar and language are called right linear or regular.

A finite automaton $A$ consists of a finite directed graph $X = (X, E, ℓ)$ with label set $Σ$ and labelling map $ℓ$ , together with a root vertex $o$ and a non-empty set $F \subset X$ . The vertices of $X$ are called the states of $A$ , the root $o$ is the initial state, and the elements of $F$ are the final states. The automaton is called (fully) deterministic provided the labelled graph $X$ is (fully) deterministic. The language accepted by $A$ is

L (A) = ⋃_{x \in F} L_{o, x} (X) .

If $A$ is deterministic, then for each $w \in L (A)$ there is a unique path $π \in ⋃_{x \in F} Π_{o, x} (X)$ such that $ℓ (π) = w$ . A state $y \in X$ is called useful if there is some word $w \in L$ such that the vertex $y$ lies on a path in $⋃_{x \in F} Π_{o, x} (X)$ with label $w$ . It is clear that we can remove all useless states and their ingoing and outgoing edges to obtain an automaton which accepts the same language and is reduced: it has only useful states.

It is well known [6, Chapter 2] that a language $L \subseteq Σ^{*}$ is regular if and only if $L$ is accepted by some deterministic finite automaton.

The following, which corresponds to Theorem 1 in [5], generalizes Anisimov’s [1] characterization of groups with regular word problem, and also simplifies its proof, as well as the simpler one of [11, Lemma 1].

Proposition 2.4

Let $G$ be a finitely generated group, $K$ a subgroup and $ψ : Σ \to G$ a semigroup presentation of $G$ . Then $(G, K)$ has regular word problem with respect to $ψ$ if and only if $K$ has finite index in $G$ .

Proof

Suppose first that the index of $K$ in $G$ is finite. Consider the finite automaton $A = (X, o, {o})$ where $X$ is the Schreier graph $X (G, K, ψ)$ , and the initial and unique final state is $o = K$ (as a vertex of $X$ ). Then $L (G, K, ψ) = L (A)$ : indeed, $w \in Σ^{*}$ belongs to $L (G, K, ψ)$ , i.e. $ψ (w) \in K$ , if and only if $K = K ψ (w)$ . This shows that $L (G, K, ψ)$ is regular.

Conversely, suppose that $L = L (G, K, ψ)$ is regular and accepted by the reduced, deterministic finite automaton $A = (X, o, F)$ . For $y \in X$ there is some word $w \in L$ such that the vertex $y$ lies on the unique path from $o$ to $F$ with label $w$ . We choose one such $w$ and let $w_{y}$ be the label of the final piece of the path, starting at $y$ and ending at $F$ . We set $g_{y} = ψ {(w_{y})}^{- 1} \in G$ .

Let $g \in G$ . There are $w, \bar{w} \in Σ^{*}$ with $ψ (w) = g$ and $ψ (\bar{w}) = g^{- 1}$ . Thus, $w \bar{w} \in L = L (G, K, ψ)$ , and there is a (unique) path $π$ with label $w \bar{w}$ from $o$ to some final state. Now consider the initial piece $π_{w}$ of $π$ , that is, the path starting at $o$ whose label is our $w$ that we started with. [Thus, we have proved that such a path $π_{w}$ must exist in $X$ !] Let $y$ be the final state (vertex) of $π_{w}$ . Then clearly $w w_{y} \in L (A)$ , which means that $g g_{y}^{- 1} = ψ (w w_{y}) \in K$ . Since $ψ (Σ^{*}) = G$ , it follows that

$G = ⋃_{y \in X} K g_{y},$

and $K$ has finitely many cosets in $G$ . □

Corollary 2.5

Let $G$ be finitely generated and $K$ a subgroup. Then the property of the pair $(G, K)$ to have a regular word problem is independent of the semigroup presentation of $G$ .

We shall see that the same also holds in the context-free case. Another corollary that we see from the proof of Proposition 2.4 is the following.

Corollary 2.6

Let $G$ be finitely generated and $K$ a subgroup of finite index. Then for any semigroup presentation $ψ : Σ \to G$ , any reduced deterministic automaton $A = (X, o, F)$ that accepts $L (G, K, ψ)$ has a surjective homomorphism (as a labelled oriented graph with root $o$ ) onto the Schreier graph $X (G, K, ψ)$ . Also, the labelled graph $X$ is fully deterministic.

Proof

Let $A = (X, o, F)$ be deterministic and reduced, as in part 2 of the proof of Proposition 2.4.

Let $y \in X$ , and recall the construction of the label $w_{y}$ of a path from $y$ to $F$ , and $g_{y} = ψ {(w_{y})}^{- 1} \in G$ . If $v$ is another path from $y$ to $F$ , and $h = ψ {(v)}^{- 1}$ , then we can take $w \in L_{o, y}$ (which we know to be non-empty) and find that $w w_{y}, w v \in L (G, K, ψ)$ , so that $ψ (w) \in K g_{y} \cap K h$ . Thus $K g_{y} = K ψ (w) = K h$ , and the map $κ : X \to K ∖ G, y \mapsto K g_{y}$ is well defined. It has the property that when $w \in L_{o, y}$ , then $K ψ (w) = K g_{y}$ . The map $κ$ is clearly surjective, and $κ (o) = K$ by construction.

Now let $y \in X$ and $a \in Σ$ . Take $w \in L_{o, y}$ and consider the word $w a$ . Again by part 2 of the proof of Proposition 2.4, there is a unique path $π_{w a}$ in $X$ starting at $o$ with label $w a$ . If $y$ is its final vertex, then there is the edge $e = (y, a, z)$ in $X$ . In this situation, $κ (z) = K ψ (w a) = K g_{y} ψ (a) = κ (y) ψ (a)$ . This means that in the Schreier graph, there is the edge with label $a$ from $κ (y)$ to $κ (z)$ . Therefore $κ$ is a homomorphism of labelled graphs. □

The following simple example shows that, in general, the map $κ$ constructed in the proof of the previous corollary is not injective.

Example 2.7

Let $G = Z_{2} = {1, t}$ be the group of order two and $K = {1}$ the trivial subgroup. Let $Σ = {a}$ and consider the presentation $ψ : Σ \to G$ such that $ψ (a) = t$ . Then $L (G, K, ψ) = {a^{2 n} : n \geq 0}$ .

In Fig. 1 we have represented, from left to right, the Schreier graph $X (G, K, ψ)$ (which is nothing but the Cayley graph of $G$ w.r. to $ψ$ ), and two automata $A_{1}$ and $A_{2}$ . As usual, $o$ denotes the origin, while the sets of final states are $F_{1} = {o}$ and $F_{2} = {o, f}$ , respectively. We have $L (A_{1}) = L (A_{2}) = L (G, K, ψ)$ .

Fig. 1 — (From left to right) the Schreier graph $X (G, K, ψ)$ described in Example 2.7 and two automata $A_{1}$ and $A_{2}$ such that $L (A_{1}) = L (A_{1}) = L (G, K, ψ)$ .

3. Pushdown automata

Besides grammars, we shall need another instrument for generating context-free languages. A pushdown automaton is a 7-tuple $A = (Q, Σ, Z, δ, q_{0}, Q_{f}, z_{0})$ , where $Q$ is a finite set of states, $Σ$ the input alphabet as above, $Z$ a finite set of stack symbols, $q_{0} \in Q$ the initial state, $Q_{f} \subset Q$ the set of final states, and $z_{0} \in Z \cup {ϵ}$ is the start symbol. Finally, the function $δ : Q \times (Σ \cup {ϵ}) \times (Z \cup {ϵ}) \to P_{fin} (Q \times Z^{*})$ is the transition function. Here, $P_{fin} (Q \times Z^{*})$ stands for the collection of all finite subsets of $Q \times Z^{*}$ .

The automaton works in the following way. At any time, it is in some state $p \in Q$ , and the stack contains a word $ζ \in Z^{*}$ . The automaton reads a word $w \in Σ^{*}$ from the “input tape” letter by letter from left to right. If the current letter of $w$ is $a$ , the state is $p$ and the top (=rightmost) symbol of the stack word $ζ$ is $z$ , then it performs one of the following transitions.

(i)
$A$ selects some $(q, ζ^{'}) \in δ (p, a, z)$ , changes into state $q$ , moves to the next position on the input tape (it may be empty if $a$ was the last letter of $w$ ), and replaces the rightmost symbol $z$ of $ζ$ by $ζ^{'}$ , or
(ii)
$A$ selects some $(q, ζ^{'}) \in δ (p, ϵ, z)$ , changes into state $q$ , remains at the current position on the input tape (so that $a$ has to be treated later), and replaces the rightmost symbol $z$ of $ζ$ by $ζ^{'}$ .

If both $δ (p, a, z)$ and $δ (p, ϵ, z)$ are empty then $A$ halts.

The automaton is also allowed to continue to work when the stack is empty, i.e., when $ζ = ϵ$ . Then the automaton acts in the same way, by putting $ζ^{'}$ on the stack when it has selected $(q, ζ^{'}) \in δ (p, a, ϵ)$ in case (i), resp. $(q, ζ^{'}) \in δ (p, ϵ, ϵ)$ in case (ii).

We say that $A$ accepts a word $w \in Σ^{*}$ if starting in the state $q_{0}$ with only $z_{0}$ on the stack and with $w$ on the input tape, after finitely many transitions the automaton can reach a final state with empty stack and empty input tape. The language accepted by $A$ is denoted $L (A)$ .

The pushdown automaton is called deterministic if for any $p \in Q$ , $a \in Σ$ and $z \in Z \cup {ϵ}$ , it has at most one option what to do next, that is,

| δ (p, a, z) | + | δ (p, ϵ, z) | \leq 1 .

(Here, $| \cdot |$ denotes cardinality.)

It is well known [6] that a language is context-free if and only if it is accepted by some pushdown automaton. A context-free language is called deterministic if it is accepted by a deterministic pushdown automaton. We also remark here that a deterministic context-free language $L$ is un-ambiguous, which means that it is generated by some context-free grammar in which every word of $L$ has precisely one rightmost derivation.

The following lemma is modelled after the indications of [11, Lemma 2]. For the sake of completeness, we include the full proof.

Lemma 3.1

Suppose that $G, K, Σ$ and $ψ : Σ \to G$ are as above. Let $H$ be a finitely generated subgroup of $G$ , and let $Σ^{'}$ be another alphabet and $ψ^{'} : Σ^{'} \to H$ be such that $F^{'} = ψ^{'} (Σ^{'})$ generates $H$ as a semigroup.

Then, if $L (G, K, ψ)$ is context-free, also $L (H, K \cap H, ψ^{'})$ is context-free, and if in addition $L (G, K, ψ)$ is deterministic, then so is $L (H, K \cap H, ψ^{'})$ .

Proof

We start with a pushdown automaton $A = (Q, Σ, Z, δ, q_{0}, Q_{f}, z_{0})$ that accepts $L (G, K, ψ)$ .

For each $b \in Σ^{'}$ , there is $u (b) \in Σ^{*}$ such that $ψ^{'} (b) = ψ (u (b))$ , and we may choose $u (b)$ to have length $\geq 1$ . Thus,

$w^{'} = b_{1} \dots b_{n} \in L (H, K \cap H, ψ^{'}) ⟺ u (b_{1}) \dots u (b_{n}) \in L (G, K, ψ) .$

With this in mind, we modify $A$ in order to obtain a pushdown automaton $A^{'}$ that accepts $L (H, K \cap H, ψ^{'})$ . Our $A^{'}$ has to translate any $w^{'} = b_{1} \dots b_{n} \in {(Σ^{'})}^{*}$ into $w = u (b_{1}) \dots u (b_{n}) \in Σ^{*}$ and to use $A$ in order to check whether $w \in L (G, K, ψ)$ .

Let $m + 1 = max {| u (b) | : b \in Σ^{'}}$ . If $m = 0$ then the only modification of $A$ needed is to replace $Σ$ by its subset $Σ^{'}$ and to use the resulting restriction of the transition function.

Otherwise, we set $Σ_{m} = Σ \cup Σ^{2} \cup \dots \cup Σ^{m}$ . For $v \in Σ^{+} = Σ^{*} ∖ {ϵ}$ , we denote by $v_{+}$ its subword obtained by deleting the first letter. We define $Q^{'} = Q \cup (Q \times Σ_{m})$ and $A^{'} = (Q^{'}, Σ^{'}, Z, δ^{'}, q_{0}, Q_{f}, z_{0})$ with the transition function $δ^{'}$ as follows. For each $p \in Q$ and $z \in Z$ ,

$δ^{'} (p, ϵ, z) = δ (p, ϵ, z),$

$δ^{'} (p, b, z) = δ (p, a, z), if u (b) = a \in Σ,$

$δ^{'} (p, b, z) = {((q, u {(b)}_{+}), ζ) : (q, ζ) \in δ (p, a, z)}, if u (b) \in a Σ^{+},$

$δ^{'} ((p, v), ϵ, z) = {((q, v), ζ) : (q, ζ) \in δ (p, ϵ, z)} \cup {((q, v_{+}), ζ) : (q, ζ) \in δ (p, a, z)}, if v \in a Σ^{+}, δ^{'} ((p, a), ϵ, z) = {((q, a), ζ) : (q, ζ) \in δ (p, ϵ, z)} \cup δ (p, a, z), if a \in Σ .$

Thus, the new states of the form $(p, v)$ with $1 \leq | v | < m$ serve to remember the terminal parts $v$ of the words $u (b)$ , $b \in Σ^{'}$ . This automaton accepts $L (G, K, ψ^{'})$ , and it is deterministic, if $A$ has this property. □

Corollary 3.2

Being context-free is a property of the pair $(G, K)$ that does not depend on the specific choice of the alphabet $Σ$ and the map $ψ : Σ \to G$ for which $ψ (Σ)$ generates $G$ as a semigroup.

Therefore, it is justified to refer to the context-free pair $(G, K)$ rather than to the triple $(G, K, ψ)$ . Furthermore, whenever this is useful, we may restrict attention to the case when the graph $X (G, K, ψ)$ is symmetric: we say that $ψ$ is symmetric, if there is a proper involution $a \mapsto a^{- 1}$ of $Σ$ such that $ψ (a^{- 1}) = ψ {(a)}^{- 1}$ in $G$ . (Again, it is not necessary to assume that $ψ$ is one-to-one, so that we have that $a^{- 1} \neq a$ even when $ψ {(a)}^{2} = 1_{G}$ .)

Proposition 3.3

Let $G$ be finitely generated, $H$ be a subgroup with $[G : H] < \infty$ . If $K$ is a subgroup of $H$ then $(G, K)$ is context-free if and only if $(H, K)$ is context-free.

Proof

The “only if” is contained in Lemma 3.1. (Observe that $H$ inherits finite generation from $G$ , since $[G : H] < \infty$ .)

For the converse, we assume that $(H, K)$ is context-free and let $ψ : Σ \to H$ and $ψ^{'} : Σ^{'} \to G$ be semigroup presentations of $H$ and $G$ , respectively. There is a pushdown automaton $A = (Q, Σ, Z, δ, q_{0}, Q_{f}, z_{0})$ that accepts $L (H, K, ψ)$ .

Let $F$ be a set of representatives of the right cosets of $H$ in $G$ , with $1_{G} \in F$ . Thus, $| F | < \infty$ , and

$G = ⨄_{g \in F} H g,$

For every $g \in F$ and $b \in Σ^{'}$ there is a unique $\bar{g} = \bar{g} (g, b) \in F$ such that $g ψ^{'} (b) \in H \bar{g}$ . Therefore there is a word $u = u (g, b) \in Σ^{*}$ such that

$g ψ^{'} (b) = ψ (u (g, b)) \bar{g} (g, b) .$

An input word $w = b_{1} \dots b_{n}$ is transformed recursively into $u_{1} \dots u_{n}$ , along with the sequence $g_{0}, g_{1}, \dots, g_{n}$ of elements of $F$ that indicate the current $H$ -coset at each step:

$g_{0} = 1_{G}; u_{k} = u (g_{k - 1}, b_{k}) and g_{k} = \bar{g} (g_{k - 1}, b_{k}) .$

Then $ψ^{'} (w) \in K$ if and only if $g_{n} = 1_{G}$ and $ψ (u_{1} \dots u_{n}) \in K$ .

Thus, our new automaton $A^{'}$ recalls at each step the current coset $H g_{k - 1}$ , which is multiplied on the right by $ψ (b_{k})$ , where $b_{k}$ is the next input letter. Then the new coset is $H \bar{g} (g_{k - 1}, b_{k})$ , and $A^{'}$ simulates what $A$ does next upon reading $u (g_{k - 1}, b_{k})$ . Then $w$ is accepted when at the end the coset is $H = H 1_{G}$ and $A$ is in a final state.

The simple task to write down this automaton in detail is left to the reader. □

4. Context-free graphs

In this section, we assume that $(X, E, ℓ)$ is symmetric. We may think of each pair of oppositely oriented edges $(x, a, y)$ and $(y, a^{- 1}, x)$ as one non-oriented edge, so that $X$ becomes an ordinary graph with symmetric neighbourhood relation, but possibly multiple edges and loops. If it is in addition fully deterministic, then $X$ is a regular graph, that is, the number of outgoing edges (which coincides with the number of ingoing edges) at each vertex is $| Σ |$ . Attention: if we consider non-oriented edges, then each loop at $x$ has to be counted twice, since it corresponds to two oriented edges of the form $(x, a, x)$ and $(x, a^{- 1}, x)$ . For all our purposes it is natural to require that $X$ is connected: for any pair of vertices $x, y$ there is a path from $x$ to $y$ . The distance $d (x, y)$ is the minimum length (number of edges) of a path from $x$ to $y$ , which defines the integer-valued graph metric. A geodesic path is one whose length is the distance between its endpoints.

We select a finite, non-empty subset $F$ of $X$ and consider the balls $B (F, n) = {x : d (x, F) \leq n}$ (where $d (x, F) = min {d (x, y) : y \in F}$ ). If we delete $B (F, n)$ then the induced graph $X ∖ B (F, n)$ will fall apart into a finite number of connected components, called cones with respect to $F$ . Each cone is a labelled, symmetric graph $C$ with the boundary $\partial C$ consisting of all vertices $x$ in $C$ having a neighbour outside $C$ (i.e., in $B (F, n)$ ).

The following notion was introduced in [12] for symmetric, labelled graphs and $F = {o}$ .

Definition 4.1

The graph $X$ is called context-free with respect to $F$ if there is only a finite number of isomorphism types of the cones with respect to $F$ as labelled graphs with boundary.

This means that there are finitely many cones $C_{1}, \dots, C_{r}$ (generally with respect to different radii $n$ ) such that for each cone $C$ , we can fix a bijection $ϕ_{C}$ from (the vertex set of) $C$ to precisely one of the $C_{i}$ , this bijection sends $\partial C$ to $\partial C_{i}$ , and $(x, a, y)$ is an edge with both endpoints in $C$ if and only if its image $(ϕ_{C} (x), a, ϕ_{C} (y))$ is an edge of $C_{i}$ . In this case, we say that $C$ is a cone of type $i$ .

Generally, as in [12], we are interested in the case when $F = {o}$ (or any other singleton), but there is at least one point where it will be useful to admit arbitrary finite, non-empty $F$ .

Another natural notion of context-freeness of $X$ with respect to $o$ is to require that the language $L_{o, o} (X)$ is context-free. We shall see that for deterministic, symmetric graphs this is equivalent with context-freeness with respect to $o$ in the sense of Definition 4.1. One direction of this equivalence is practically contained in [12], but not stated explicitly except for the case of Cayley graphs of groups. The other direction (that context-freeness of $L_{o, o}$ implies that of the graph) is shown in [12] only for Cayley graphs of groups, which is substantially simpler than the general case treated below in Theorem 4.6.

Theorem 4.2

If the symmetric, labelled graph $(X, E, ℓ)$ with label alphabet $Σ$ is context-free with respect to the finite, non-empty set $F \subset X$ , then $L_{x, y}$ is a context-free language for all $x, y \in X$ . Furthermore, if the graph $X$ is deterministic, then so is the context-free language $L_{x, y}$ .

Proof

Just for the purpose of this proof, we write $x_{0}, y_{0}$ instead of $x, y$ for the vertices for which $L_{x_{0}, y_{0}}$ will be shown to be context-free. We may assume without loss of generality that $x_{0}, y_{0}$ in $F$ . Indeed, if this is not the case, then we can replace $F$ by $F^{'} = B (F, n)$ , which contains $x_{0}$ and $y_{0}$ when $n$ is sufficiently large. The cones with respect to $F^{'}$ are also cones with respect to $F$ , so that $X$ is also context-free with respect to $F^{'}$ .

Similarly to [12, Lemma 2.3], we construct a deterministic pushdown automaton that accepts $L_{x_{0}, y_{0}}$ .

We consider also the whole graph $X$ as a cone $C_{0}$ with boundary $F$ , which we keep apart from the other representatives $C_{1}, \dots, C_{r}$ of cones.

If $C$ is a cone, then as a component of $X ∖ B (F, n)$ for some $n \geq 0$ it must be a successor of another cone $C^{-}$ . The latter is the unique component of $X ∖ B (F, n - 1)$ that contains $C$ , when $n \geq 1$ , while it is $C_{0} = X$ when $n = 0$ . We also call $C^{-}$ the predecessor of $C$ .

Different cones of type $j \in {1, \dots, r}$ may have predecessors of different types. Conversely, a cone $C$ of type $i \in {0, \dots, r}$ may have none, one or more than one successors of type $j$ , and the number $d_{i, j}$ of those successors depends only on $i$ and $j$ . In the representative cone $C_{i}$ , we choose and fix a numbering of the distinct successors of type $j$ as $C_{i, j}^{k}$ , $k = 1, \dots, d_{i, j}$ . If $C$ is any cone with type $i$ then we use the isomorphism $ϕ_{C} : C \to C_{i}$ to transport this numbering to the successors of $C$ that have type $j$ , which allows us to identify the $k$ -th successor of $C$ with type $j$ .

One can visualize the cone structure by a finite, oriented graph $Γ$ with multiple edges and root 0: the vertex set is the set of cone types $i \in {0, \dots, r}$ , and there are $d_{i, j}$ oriented edges, which we denote by $t_{i, j}^{k}$ ( $k = 1, \dots, d_{i, j}$ ) from vertex $i$ to vertex $j$ ( $i \geq 0$ , $j \geq 1$ ).

Every vertex $x$ of $X$ belongs to the boundary of precisely one cone $C = C (x)$ with respect to $F$ . We define the type $i$ of $x$ as the type of $C (x)$ . Under the mapping $ϕ_{C}$ , our $x$ corresponds to precisely one element of $\partial C_{i}$ . We write $ϕ (x)$ for that element, without subscript $C$ , so that $ϕ$ maps $X$ onto $⋃_{i} \partial C_{i}$ . In particular, $ϕ (x) = x$ for every $x \in F$ .

Let $y \in X ∖ F$ with type $j$ . Then there is $i$ (depending on $y$ ) such that every neighbour $x$ of $y$ with $d (x, F) = d (y, F) - 1$ has type $i$ , and there is precisely one successor cone $C_{i, j}^{k}$ of $C_{i}$ that contains $ϕ_{C (x)} (y)$ . In this case, we write $τ (y) = t_{i, j}^{k}$ , the second order type of $y$ . Compare with [12]. If $y^{'}$ is such that $C (y^{'}) = C (y)$ then $τ (y^{'}) = τ (y)$ .

We now finally construct the required pushdown automaton $A$ . (Comparing with [12], we use more states and stack symbols, which facilitates the description.) The set of states and stack symbols are

$Q = ⨄_{i = 0}^{r} \partial C_{i} and Z = F \cup {t_{i, j}^{k} : i = 1, \dots, r, j = 0, \dots, r, k = 1, \dots, d_{i, j}} .$

(When $d_{i, j} = 0$ then there is no $t_{i, j}^{k}$ .) Note that both sets contain $F$ . In order to generate the language $L_{x_{0}, y_{0}}$ , where $x_{0}, y_{0} \in F$ , then we use $x_{0}$ as the initial state and $y_{0}$ as the (only) final state. We describe the transition function, which–like $Q$ and $Z$ –does not depend on $x_{0}, y_{0}$ .

We want to read an input word, which has to correspond to the label starting in $x_{0}$ . Inside the subgraph of $X$ induced by $F$ , our $A$ behaves just like that subgraph, seen as a finite automaton.

Outside of $F$ , it works as follows. At the $m$ -th step, the automaton will be in a state that describes the $m$ -th vertex, say $x$ , of that path, by identifying $x$ as above with the element $ϕ (x)$ of $C_{j}$ , where $j$ is the type of $x$ . The current stack symbol is of the form $t_{i, j}^{k}$ and serves to recall that $x$ lies in the $k$ -th successor cone of type $j$ of a cone with type $i$ . If the next vertex along the path, say $y$ , satisfies $d (y, F) = d (x, F) + 1$ , and $y$ has type $j^{'}$ then the state is changed to $ϕ (y) \in C_{j^{'}}$ , and the symbol $t_{j, j^{'}}^{k^{'}} = τ (y)$ is added to the stack. If $d (y, F) = d (x, F)$ , then only the state is changed from $ϕ (x)$ to $ϕ (y)$ . Finally, if $d (y, F) = d (x, F) - 1$ then the new state is again $ϕ (y)$ , while the top symbol on the stack is deleted. Formally, we get the following list of transition rules. If $x \in F = Q \cap Z$ :

$δ (x, a, x) = {(y, y) : (x, a, y) \in E, y \in F} \cup {(ϕ (y), x τ (y)) : (x, a, y) \in E, d (y, F) = 1} .$

If $x \in X ∖ F$ :

$δ (ϕ (x), a, τ (x)) = {(ϕ (y), a, τ (x) τ (y)) : (x, a, y) \in E, d (y, F) = d (x, F) + 1} \cup {(ϕ (y), τ (y) = τ (x)) : (x, a, y) \in E, d (y, F) = d (x, F)} \cup {(ϕ (y), ϵ) : (x, a, y) \in E, d (y, F) = d (x, F) - 1} .$

This is a finite collection of transitions, since $ϕ (\cdot)$ and $τ (\cdot)$ can take only finitely many different values.

In view of the above explanations, $A$ accepts $L_{x_{0}, y_{0}}$ . Also, when the graph $X$ is deterministic, then so is $A$ . □

Before proving a converse of Theorem 4.2, we first need some preliminaries, and start by recalling a fact proved in [11], [12], see also Woess [21] and Berstel and Boasson [2].

Lemma 4.3

If $L_{o, o}$ is context-free then there is a constant $M$ such that for each cone $C$ with respect to $o$ , one has $diam (\partial C) \leq M$ .

(The diameter is of course taken with respect to the graph metric.) We shall see below how to deduce this, but it is good to know it in advance.

A context-free grammar $C = (V, Σ, P, S)$ is said to have Chomsky normal form (CNF), if (i) every production rule is of the form $T ⊢ U \hat{U}$ or $T ⊢ a$ , where $U, \hat{U} \in V$ (not necessarily distinct), resp. $a \in Σ$ , and (ii) if $ϵ \in L (C)$ , then there is the rule $S ⊢ ϵ$ , and $S$ is not contained in the right hand side of any production rule.

With a slight deviation from [11], we associate with each $w = a_{1} \dots a_{n} \in L (C)$ , $n \geq 2$ a labelled (closed) polygon $P (w)$ with length $n + 1$ . As a directed graph, it has distinct vertices $t_{0}, t_{1}, \dots, t_{n}$ and labelled edges $(t_{i - 1}, a_{i}, t_{i})$ , $i = 1, \dots, n$ , plus the edge $(t_{0}, S, t_{n})$ . A (diagonal) triangulation of $P (w)$ is a plane triangulation of $P (w)$ obtained by inserting only diagonals. Here, we specify those diagonals as oriented, labelled edges $(t_{i}, T, t_{j})$ , where $t_{i}, t_{j}$ are not neighbours in $P (w)$ and $T \in V$ . Furthermore, we will never have two diagonals between the same pair of vertices of $P (w)$ . (If $| w | \leq 2$ we consider $P (w)$ itself triangulated.) The proof of the following Lemma may help to make the construction of [11] (used for Cayley graphs of groups) more transparent.

Lemma 4.4

If $C = (V, Σ, P, S)$ is in CNF and $w = a_{1} \dots a_{n} \in L (C)$ with $n \geq 2$ then there is a diagonal triangulation of $P (w)$ with the property that whenever $(t_{i}, T, t_{j})$ is a diagonal edge, then $T$ occurs in a derivation $S \overset{*}{⟹} w$ , $j - i \geq 2$ and $T \overset{*}{⟹} a_{i + 1} \dots a_{j}$ .

Proof

We start with a fixed derivation $S \overset{*}{⟹} w$ , and explain how to build up the triangles step by step. Suppose that $T \in V$ occurs in our derivation, and that we have a “sub-derivation” $T ⊢ U \hat{U} \overset{*}{⟹} a_{i + 1} \dots a_{k}$ , where $U, \hat{U} \in V$ . Then there is $j \in {i + 1, \dots, k - 1}$ such that $U \overset{*}{⟹} a_{i + 1} \dots a_{j}$ and $\hat{U} \overset{*}{⟹} a_{j + 1} \dots a_{k}$ . In this case, we draw a triangle with three oriented, labelled edges, namely the ‘old’ edge $(t_{i}, T, t_{k})$ and the two ‘new’ edges $(t_{i}, U, t_{j})$ and $(t_{j}, \hat{U}, t_{k})$ .

If we have the derivation $S \overset{*}{⟹} a_{1} \dots a_{n}$ , then it uses successive steps of the form $T ⊢ U \hat{U}$ with $U \hat{U} \overset{*}{⟹} a_{i + 1} \dots a_{k}$ as above. We work through these steps one after the other, starting with $S ⊢ T_{1} {\hat{T}}_{1}$ , where $T_{1} \overset{*}{⟹} a_{1} \dots a_{k}$ and ${\hat{T}}_{1} \overset{*}{⟹} a_{k + 1} \dots a_{n}$ . The first triangle has the ‘old’ edge $(t_{0}, S, t_{n})$ and the ‘new’ edges $(t_{0}, T_{1}, t_{k})$ and $(t_{k}, {\hat{T}}_{1}, t_{n})$ .

At any successive step, we take one of the ‘new’ edges $(t_{i}, T, t_{k})$ , where $k - i \geq 2$ and proceed as explained at the beginning, so that we add two ‘new’ edges that make up a triangle together with $(t_{i}, T, t_{k})$ , which is then declared ‘old’. We continue until all derivation steps of the form $T ⊢ U \hat{U}$ in our derivation $S \overset{*}{⟹} w$ are exhausted. At this point, we have obtained a tiling of triangles that constitute a diagonal triangulation of its outer polygon, whose edges have the form $(t_{0}, S, t_{n})$ and $(t_{i - 1}, U_{i}, t_{i})$ with $U_{i} \in V$ , $i = 1, \dots, n$ . The only steps of our derivation that we have not yet considered are the terminal ones $U_{i} ⊢ a_{i}$ . Thus, we conclude by replacing the label $U_{i}$ of $(t_{i - 1}, U_{i}, t_{i})$ by $a_{i}$ . □

The construction is best understood by considering an example: suppose our rightmost derivation is

S ⊢ T_{1} {\hat{T}}_{1} ⟹ T_{1} (T_{2} {\hat{T}}_{2}) ⟹ T_{1} (T_{2} (T_{3} {\hat{T}}_{3})) ⟹ T_{1} (T_{2} (T_{3} a_{6})) ⟹ T_{1} (T_{2} ((T_{4} {\hat{T}}_{4}) a_{6})) ⟹ T_{1} (T_{2} ((T_{4} a_{5}) a_{6})) ⟹ T_{1} (T_{2} ((a_{4} a_{5}) a_{6})) ⟹ T_{1} (a_{3} ((a_{4} a_{5}) a_{6})) ⟹ T_{1} (a_{3} ((a_{4} a_{5}) a_{6})) ⟹ (T_{5} {\hat{T}}_{5}) (a_{3} ((a_{4} a_{5}) a_{6})) ⟹ (T_{5} a_{2}) (a_{3} ((a_{4} a_{5}) a_{6})) ⟹ (a_{1} a_{2}) (a_{3} ((a_{4} a_{5}) a_{6})) .

(We have inserted the parentheses to make the rules that we used in each step more visible.) The associated triangulation is as in Fig. 2.

The variables of the terminal rules $T_{5} ⊢ a_{1}$ , ${\hat{T}}_{5} ⊢ a_{2}$ , $T_{2} ⊢ a_{3}$ , $T_{4} ⊢ a_{4}$ , ${\hat{T}}_{4} ⊢ a_{5}$ and ${\hat{T}}_{3} ⊢ a_{6}$ are not visible in this figure (but we might add them to the boundary edges). Apart from this, one can read the derivation $S \overset{*}{⟹} w$ from the diagonalization in a similar way as it can be read from the so-called derivation tree (see e.g. [6, Section 1.6] for the latter).

The following goes back to [11] in the case of (Cayley graphs of) finitely generated groups (recall from Lemma/Definition 2.1 that in case $X$ is deterministic and symmetric, if $x \in X$ and $w = a_{1} a_{2} \dots a_{n} \in Σ^{*}$ , then $x^{- w} = x^{a_{n}^{- 1} \dots a_{2}^{- 1} a_{1}^{- 1}} \in X$ denotes the initial vertex of the path $π$ in $X$ terminating at $x$ with label $ℓ (π) = w$ ).

Lemma 4.5

Let $C = (V, Σ, P, S)$ be in CNF and $L (C) = L_{x, y} (X)$ , where $X$ is a deterministic, symmetric graph. If $w = a_{1} \dots a_{n} \in L_{x, y} (X)$ and $(t_{i}, T, t_{j})$ is a diagonal edge in a triangulation of $P (w)$ as in Lemma 4.4, then the vertices $\bar{x} = x^{a_{1} \dots a_{i}}$ and $\bar{y} = x^{a_{1} \dots a_{j}}$ of $X$ satisfy $d (\bar{x}, \bar{y}) \leq m (T)$ , where

$m (T) = min {| w | : w \in L_{T}} .$ (2)

Proof

Since $X$ is deterministic, Lemma 2.3 implies that $π_{x} (w)$ exists as the unique path with initial vertex $x$ and label $w$ . In particular, $\bar{x}$ and $\bar{y}$ lie on that path. Furthermore, we have $\bar{y} = y^{- a_{j + 1} \dots a_{n}}$ .

Now let $v \in L_{T}$ with $| v | = m (T)$ . Then by Lemma 4.4, $T$ arises in a derivation $S \overset{*}{⟹} a_{1} \dots a_{i} T a_{j + 1} \dots a_{n} \overset{*}{⟹} w$ . But then we also have $S \overset{*}{⟹} a_{1} \dots a_{i} v a_{j + 1} \dots a_{n}$ , a word in $L_{x, y}$ . By Lemma 2.3, again using that $X$ is symmetric and deterministic, ${\bar{x}}^{v} = y^{- a_{j + 1} \dots a_{n}} = \bar{y}$ . Therefore, $\bar{x}$ and $\bar{y}$ are connected by a path with label $v$ . Its length is $m (T)$ . □

Theorem 4.6

Let $(X, E, ℓ)$ be a fully deterministic, symmetric graph with label alphabet $Σ$ and root $o$ . If $L_{o, o}$ is a context-free language, then $X$ is a context-free graph with respect to $o$ , and in particular, $L_{o, o}$ is deterministic.

Proof

There is a reduced grammar $C = (V, Σ, P, S)$ in CNF that generates $L_{o, o}$ . Each of the languages $L_{T}$ , $T \in V$ , is non-empty, only $L_{S}$ contains $ϵ$ , and we define

$m = max {m (T) : T \in V},$ (3)

where $m (T)$ is as in (2).

Let $C$ be a cone with respect to $o$ such that $k = d (o, \partial C) > m$ .

Construction of $\tilde{D} (C)$ . We define $D (C)$ as the subgraph of $X$ induced by all vertices $y \in X$ with

$d (o, x) = d (o, y) + d (x, y) and d (x, y) \leq m for some x \in \partial C .$

In particular, $y$ lies on some geodesic path from $o$ to $\partial C$ .

Now let $x_{1}, x_{2} \in \partial C$ , and consider some path $π \in Π_{x_{1}, x_{2}} (C)$ (i.e., it lies in $C$ ). Choose a geodesic path $π_{1}$ from $o$ to $x_{1}$ and a geodesic path $π_{2}$ from $x_{2}$ to $o$ . Then we can concatenate the three paths to a single path $π_{1} π π_{2} \in Π_{o, o}$ . Its label is the word $w = ℓ (π_{1}) ℓ (π) ℓ (π_{2}) \in L_{o, o}$ . Set $n = | w |$ and write

$w = (a_{1} \dots a_{k}) (a_{k + 1} \dots a_{n - k}) (a_{n - k + 1} \dots a_{n})$

where the 3 pieces in the parentheses are (in order) $ℓ (π_{1})$ , $ℓ (π)$ and $ℓ (π_{2})$ . The words $ℓ (π_{1})$ , $ℓ (π)$ and $ℓ (π_{2}) S$ are the labels of three consecutive arcs that fill the boundary of the polygon $P (w)$ . (To be precise, along the last edge of the $3^{rd}$ arc, we are reading the label $S$ in the reversed direction.) By [11, Lemma 5], its triangulation has a triangle which meets each of those arcs. (It may also occur that one corner of the triangle meets two arcs.) Thus, there are $i \in {0, \dots, k}$ and $i^{'} \in {k, \dots, n - k}$ such that the vertices $t_{i}$ and $t_{i^{'}}$ of $P (w)$ lie on that triangle. They correspond to the vertices $y_{1} = o^{a_{1} \dots a_{i}}$ and $y^{'} = o^{a_{1} \dots a_{i^{'}}}$ of $X$ . We either have $i^{'} - i \leq 1$ , or else a diagonal $(t_{i}, U, t_{i^{'}})$ is a side of our triangle. By Lemma 4.5, we get $d (y_{1}, y^{'}) \leq m (U) \leq m$ . Thus $k \leq i^{'} \leq d (o, y^{'}) \leq i + m$ , that is, $i \geq k - m > 0$ . In particular, $t_{i}$ does not lie on the third arc. In the same way, there is $j \in {n - k, \dots, n - k + m}$ (and not larger) such that $t_{j}$ is a corner of our triangle. This yields that there must be a “true” diagonal $(t_{i}, T, t_{j})$ of $P (w)$ . We set $v_{1} = a_{i + 1} \dots a_{k}$ and $v_{2} = a_{n - k + 1} \dots a_{j}$ , so that $x_{1} = y_{1}^{v_{1}}$ , and let $y_{2} = x_{2}^{a_{n - k + 1} \dots a_{j}}$ . The points $y_{1}$ and $y_{2}$ are in $D (C)$ , and by Lemma 4.4, $T \overset{*}{⟹} v_{1} ℓ (π) v_{2}$ .

[It is here that we can see Lemma 4.3, since we deduced that $d (x_{1}, x_{2}) \leq 3 m$ for all $x_{1}, x_{2} \in \partial C$ .]

By Lemma 4.4, we also have

$S \overset{*}{⟹} a_{1} \dots a_{i} T a_{j + 1} \dots a_{n},$

so that $v \in L_{T}$ implies $a_{1} \dots a_{i} v a_{j + 1} \dots a_{n} \in L_{o, o}$ and consequently $v \in L_{y_{1}, y_{2}}$ , that is, $y_{1}^{v} = y_{2}$ .

We now insert into $D (C)$ the additional labelled edge $(y_{1}, v_{1} T v_{2}, y_{2})$ , whose label is the word $v_{1} T v_{2} \in Σ^{*} V Σ^{*}$ . We insert all diagonals of the same type that can be obtained in the same way, and write $\tilde{D} (C)$ for the resulting “edge-enrichment” of $D (C)$ .

Subsuming, we have an edge $(y_{1}, v_{1} T v_{2}, y_{2})$ in $\tilde{D} (C)$ if and only if the following properties hold.

•
$| v_{i} | \leq m$ ( $i = 1, 2$ ) and $T \in V$ ,

•
the path with label $v_{1}$ starting at $y_{1}$ and ending at $x_{1} = y_{1}^{v_{1}} \in \partial C$ is part of a geodesic from $o$ to $x_{1}$ ,

•
the path with label $v_{2}$ starting at $x_{2} = y_{2}^{- v_{2}} \in \partial C$ and ending at $y_{2}$ is part of a geodesic from $x_{2}$ to $o$ , and

•
there is a path $π$ in $C$ from $x_{1}$ to $x_{2}$ such that $T \overset{*}{⟹} v_{1} ℓ (π) v_{2}$ ,

•
if $T \overset{*}{⟹} v \in Σ^{*}$ then $v$ is the label of a path in $Π_{y_{1}, y_{2}}$ .

Now, there are only finitely many cones $C$ with respect to $o$ with $d (\partial C, o) \leq m$ . On the other hand, for all cones $C$ with $d (\partial C, o) \geq m$ , there is a bound on the number of vertices of $\tilde{D} (C)$ , as well as on the number of possible labels on its edges. In particular, there are only finitely many possible isomorphism types of the labelled graphs $(\tilde{D} (C), \partial C)$ with “marked” boundary $\partial C \subset \tilde{D} (C)$ .

We now suppose that $C$ and $C^{'}$ are two cones at distance $\geq m$ from $o$ , such that $(\tilde{D} (C), \partial C)$ and $(\tilde{D} (C^{'}), \partial C^{'})$ are isomorphic. We claim that $C$ and $C^{'}$ are isomorphic, and this will conclude the proof that there are only finitely many isomorphism types of cones with respect to $o$ .

Let $ϕ : \tilde{D} (C) \to \tilde{D} (C^{'})$ be an isomorphism with $ϕ (\partial C) = \partial C^{'}$ , and $ϕ^{'}$ its inverse mapping. We extend $ϕ$ to a mapping from $C$ to $C^{'}$ , also denoted $ϕ$ .

Claim 1

Let $x \in \partial C$ and $v \in Σ^{+}$ such that the path $π_{x} (v)$ lies in $C$ and meets $\partial C$ only in its initial point $x$ . Then the path $π_{x^{'}} (v)$ lies in $C^{'}$ and meets $\partial C^{'}$ only in its initial point $x^{'} = ϕ (x) \in \partial C^{'}$ .

Proof

If $a$ is the initial letter of $v$ then (always using the notation of Definition 1.1) the first edge of $π_{x} (v)$ is $(x, a, x^{a})$ . We now consider the path $π_{x^{'}} (v)$ with label $v$ starting at $x^{'} \in \partial C^{'}$ . We first claim that the latter lies in $C^{'}$ and only its initial point $x^{'}$ is in $\partial C^{'}$ . Let $(x^{'}, a, {(x^{'})}^{a})$ be the first edge of the path. Then ${(x^{'})}^{a}$ cannot lie in $\tilde{D} (C^{'})$ , since otherwise $(x, a, x^{a}) = (ϕ^{'} (x^{'}), a, ϕ^{'} {(x^{'})}^{a})$ would be an edge in $\tilde{D} (C)$ , a contradiction. Thus, the path $π_{x^{'}} (v)$ goes at least initially into $C^{'} ∖ \partial C$ .

So now suppose that $π_{x^{'}} (v)$ ever returns to $\partial C^{'}$ , and let $π^{'}$ be its initial part up to the first return. Then $v^{'} = ℓ (π_{x^{'}} (v))$ is an initial part of $v$ with $| v^{'} | \geq 2$ , and $π^{'}$ is a path within $C^{'}$ from $x_{1}^{'} = x^{'}$ to $x_{2}^{'} = {(x^{'})}^{v^{'}} \in \partial C^{'}$ . But then, by construction, $\tilde{D} (C^{'})$ must contain an edge $(y_{1}^{'}, v_{1} T v_{2}, y_{2}^{'})$ such that $x_{1}^{'} = {(y_{1}^{'})}^{v_{1}}$ , $y_{2}^{'} = {(x_{2}^{'})}^{v_{2}}$ , and $T \overset{*}{⟹} v_{1} v^{'} v_{2}$ . Using the isomorphism $ϕ^{'} : \tilde{D} (C^{'}) \to \tilde{D} (C)$ , we set $y_{i} = ϕ^{'} (y_{i}^{'})$ , $i = 1, 2$ , and $x_{2} = ϕ^{'} (x_{2}^{'}) \in \partial C$ . We have of course $x_{1} = ϕ^{'} (x_{1}^{'})$ . Now we must have the edge $(y_{1}, v_{1} T v_{2}, y_{2})$ in $\tilde{D} (C)$ . But then $v_{1} v^{'} v_{2} \in L_{y_{1}, y_{2}}$ , and consequently $v^{'} \in L_{x_{1}, x_{2}}$ , that is, $x_{1}^{v^{'}} \in \partial C$ . But this contradicts the fact that $π_{x} (v)$ meets $\partial C$ only in its initial point. We conclude that also the path $π_{x^{'}} (v)$ lies in $C^{'}$ and meets $\partial C^{'}$ only in its initial point, and Claim 1 is verified. □

Now let $z \in C ∖ \partial C$ . Then there are $x \in \partial C$ and $v \in Σ^{+}$ such that $z = x^{v}$ and the path $π_{x} (v)$ from $x$ to $z$ meets $\partial C$ only in its initial point $x$ . By Claim 1, the analogous statement holds for the path $π_{x^{'}} (v)$ in $C^{'}$ , where $x^{'} = ϕ (x)$ . The only choice is to define $ϕ (z) = z^{'} = {(x^{'})}^{v}$ , which lies in $C^{'} ∖ \partial C^{'}$ as required. We have to show that $ϕ$ is well-defined. This will follow from the next claim.

Claim 2

Let $x_{1}, x_{2} \in \partial C$ , $v, w \in Σ^{+}$ such that the paths $π_{x_{1}} (v)$ and $π_{x_{2}} (w)$ lie in $C$ , meet $\partial C$ only in their initial points and end at the same point of $C ∖ \partial C$ . Then, setting $x_{i}^{'} = ϕ (x_{i})$ , also $π_{x_{1}^{'}} (v)$ and $π_{x_{2}^{'}} (w)$ end at the same point of $C^{'} ∖ \partial C^{'}$ .

Proof

Let $w^{- 1}$ be the “inverse” of $w$ , as defined in Definition 1.1. Then $x_{2}^{- w^{- 1}} = x_{2}^{w}$ , and $v w^{- 1}$ is the label of the path from $x_{1}$ to $x_{2}$ that we obtain by first following $π_{x_{1}} (v)$ and then the “inverse” of $π_{x_{2}} (w)$ . It lies entirely in $C$ , and only its endpoints are in $\partial C$ . By construction, $\tilde{D} (C)$ has an edge $(y_{1}, v_{1} T v_{2}, y_{2})$ such that $y_{1}^{v_{1}} = x_{1}$ , $x_{2}^{v_{2}} = y_{2}$ and $T \overset{*}{⟹} v_{1} v w^{- 1} v_{2}$ . We set $y_{i}^{'} = ϕ (y_{i})$ , $i = 1, 2$ . Then $(y_{1}^{'}, v_{1} T v_{2}, y_{2}^{'})$ is an edge of $\tilde{D} (C^{'})$ . Therefore $v_{1} v w^{- 1} v_{2} \in L_{y_{1}^{'}, y_{2}^{'}}$ . But this implies that $v w^{- 1}$ is the label of a path from $x_{1}^{'}$ to $x_{2}^{'}$ , and we know from Claim 1 that it lies in $C$ and has only its endpoints in $\partial C$ . Thus ${(x_{1}^{'})}^{v} = {(x_{2}^{'})}^{- w^{- 1}} = {(x_{2}^{'})}^{w}$ , and Claim 2 is true. □

Thus, $ϕ$ is well defined, and the same works of course also for $ϕ^{'}$ by exchanging the roles of $C$ and $C^{'}$ .

Claim 3

The map $ϕ : C \to C^{'}$ is bijective.

Proof

We know that $ϕ : \partial C \to \partial C^{'}$ is bijective and that $ϕ (C ∖ \partial C) \subset C^{'} ∖ \partial C$ . Let $z \in C ∖ \partial C$ , and let $x \in \partial C$ , $v \in Σ^{+}$ such that $π_{x} (v)$ is a path from $x$ to $z$ that intersects $\partial C$ only at the initial point. Setting $x^{'} = ϕ (x)$ , $z^{'} = ϕ (z)$ , we know from the construction of $ϕ$ and Claim 1 that $π_{x^{'}} (v)$ is a path in $C^{'}$ from $x^{'}$ to $z^{'}$ that meets $\partial C^{'}$ only in its initial point. Now the way how $ϕ^{'}$ is constructed yields that $ϕ^{'} (z^{'}) = z$ . Therefore $ϕ^{'} ϕ$ is the identity on $C$ . Exchanging roles, we also get the $ϕ ϕ^{'}$ is the identity on $C^{'}$ . This proves Claim 3. □

It is now immediate from the construction that $ϕ$ also preserves the edges and their labels, so that it is indeed an isomorphism between the labelled graphs $C$ and $C^{'}$ that sends $\partial C$ to $\partial C^{'}$ . This concludes the proof of Theorem 4.6. □

[12, Cor. 2.7] says that if a symmetric labelled graph is context-free with respect to one root $o$ , then it is context-free with respect to any other vertex chosen as the root $x$ . In view of Theorem 4.2, Theorem 4.6, this is also obtained from the following, when the graph is fully deterministic.

Corollary 4.7

Let $(X, E, ℓ)$ be a fully deterministic, strongly connected graph with label alphabet $Σ$ . If $L_{o, o}$ is context-free then $L_{x, y}$ is deterministic context-free for all $x, y \in X$ .

Theorem 4.2, Theorem 4.6, together with Lemma 3.1 also imply the following.

Corollary 4.8

Let $G$ be a finitely generated group and $K$ a subgroup.

(a)
The pair $(G, K)$ is context-free if and only if for any symmetric $ψ : Σ \to G$ , the Schreier graph $X (G, K, ψ)$ is a context-free graph. In this case, the language $L (G, K, ψ)$ is deterministic for every (not necessarily symmetric) semigroup presentation $ψ : Σ \to G$ .

(b)
If $(G, K)$ is context-free, then also $(G, g^{- 1} K g)$ is context-free for every $g \in G$ .

Proof

(a) is clear. Regarding (b), for the Schreier graph $X (G, K, ψ)$ , we have $L (G, K, ψ) = L_{o, o}$ and $L (G, g^{- 1} K g, ψ) = L_{x, x}$ with $x = K g$ , $g \in G$ . Thus, the statement follows from Corollary 4.7. □

Lemma 4.9

Let $G$ be a finitely generated group and $K, H$ be subgroups with $K \leq H$ and $[H : K] < \infty$ .

If $(G, K)$ is context-free then also $(G, H)$ is context-free.

Proof

In the context-free graph $X (G, K, ψ)$ , consider the finite set of vertices $F = {K h : h \in H}$ , containing the root vertex $o = o_{K} = K$ . Then $L (G, H, ψ) = ⋃_{x \in F} L_{o, x}$ is a finite (disjoint) union of context-free languages. Therefore it is context-free by standard facts. □

Remark 4.10

In terms of Schreier graphs, we have the mapping $K g \mapsto H g$ which is a homomorphism of labelled graphs from $X = X (G, K, ψ)$ onto $Y = X (G, H, ψ)$ which is finite-to-one. The lemma says that in this situation, if $X$ is a context-free graph then so is $Y$ . We do not see an easy direct proof of this fact in terms of graphs, the main problem being how the homomorphism $X \to Y$ interacts with the isomorphisms between the cones of $X$ with respect to the set $F$ . On the other hand, reformulating this in terms of the associated “path languages” with the help of Theorem 4.2, Theorem 4.6, it has become straightforward.

The converse of Lemma 4.9 is not true, that is, when $(G, H)$ is context-free and $[H : K] < \infty$ then $(G, K)$ is not necessarily context-free. See Example 5.6 in the last section. However, we have the following.

Lemma 4.11

If $K$ is a finite subgroup of $G$ then $(G, K)$ is context-free if and only if $G$ is a context-free (i.e. virtually free) group.

Proof

Fix $Σ$ and $ψ$ . Let $X = X (G, ψ)$ be the associated Cayley graph of $G$ , and $Y = X (G, K, ψ)$ . We let $o$ be the root of $Y$ , that is, $o = K 1_{G}$ as an element of $Y$ (a coset). The group $K$ acts on $X$ by automorphisms of that labelled graph. It leaves the set $F = K$ (now as a set of vertices of $X$ ) invariant. The factor graph of $X$ by this action is $Y$ . Write $π$ for the factor mapping. It is $| K |$ -to-one. Each cone of $X$ with respect to $F$ is mapped onto a cone of $Y$ with respect to $o$ , and this mapping sends boundaries of cones of $X$ to boundaries of cones of $Y$ . By assumption, $Y$ is a context-free graph. By Lemma 4.3, there is an upper bound on the number of elements in the latter boundaries. Therefore there also is an upper bound on the number of elements of any of the boundaries of the cones of $X$ with respect to $F$ .

Without going here into the details of the definition of the space of ends of $X$ , we refer to the terminology of Thomassen and Woess [19] and note that the above implies that all ends of $X$ are thin. But then, as proved in [19], $G$ must be a virtually free group. □

One should not tend to believe that in the situation of the last lemma, the Cayley graphs of $G$ are quasi-isometric with the Schreier graphs of $(G, K)$ . As a simple counter-example, take for $G$ the infinite dihedral group $〈 a, b ∣ a^{2} = b^{2} 〉$ and for $K$ the 2-element subgroup generated by $a$ .

5. Covers and Schreier graphs

We assume again that $(X, E, ℓ)$ is symmetric and fully deterministic. Recall the involution $a \mapsto a^{- 1} \neq a$ of $Σ$ . A word in $Σ^{*}$ is called reduced if it contains no subword of the form $a a^{- 1}$ , where $a \in Σ$ . We write $T_{Σ}$ for the set of all reduced words in $Σ^{*}$ . We can equip $T_{Σ}$ with the structure of a labelled graph, whose edges are of the form

(v, a, w) and (w, a^{- 1}, v), where v, w \in T_{Σ}, a \in Σ, v a = w .

(4)

Thus, the terminal letter of $v$ must be different from $a^{- 1}$ . Then $T_{Σ}$ is fully deterministic, and it is a tree, that is, it has no closed path whose label is a (non-empty) reduced word. As the root of $T_{Σ}$ , we choose the empty word $ϵ$ . Then $T_{Σ}$ is the universal cover of $X$ . Namely, if we choose (and fix) any vertex $o \in X$ as the root, then the mapping

Φ : T_{Σ} \to X, Φ (w) = o^{w},

(5)

is a covering map: it is a surjective homomorphism between labelled graphs which is a local isomorphism, that is, it is one-to-one between the sets of outgoing (resp. ingoing) edges of any element $w \in T_{Σ}$ and its image $Φ (w)$ . (Note that this allows the image of an edge to be a loop.) “Universal” means that it covers every other cover of $X$ , but this is not very important for us. The property of $w \in T_{Σ}$ to be reduced is equivalent with the fact that the path $π_{o} (w)$ in $X$ is non-backtracking, that is, it does not contain two consecutive edges which are the reversal of each other.

We now realize that $T_{Σ}$ is the standard Cayley graph of the free group $F_{Σ}$ , where $Σ$ is the set of free generators together with their inverses. The group product is the following: if $v, w \in T_{Σ} \equiv F_{Σ}$ , then $v \cdot w$ is obtained from the concatenated word $v w$ by step after step deleting possible subwords of the form $a a^{- 1}$ that can arise from that concatenation. The group identity is $ϵ$ , and the inverse of $w$ is $w^{- 1}$ as at the end of Definition 1.1. With $Φ$ as in (5), let

K = K (X) = Φ^{- 1} (o) = {w \in T_{Σ} : π_{o} (w) is a closed path from o to o in X} .

(6)

Then, under the indentification $T_{Σ} \equiv F_{Σ}$ , we clearly have that $K$ is a subgroup of $F_{Σ}$ . The following is known, see e.g. Lyndon and Schupp [10, Ch. III] or (our personal source) Imrich [8].

Proposition 5.1

The graph $X$ is the Schreier graph of the pair of groups $(F_{Σ}, K (X))$ with respect to the semigroup presentation $ψ$ given by $ψ (a) = a$ , $a \in Σ$ .

In $ψ (a) = a$ , we interpret $a$ simultaneously as a letter from the alphabet and as a generator of the free group.

Thus, in reality the study of context-free pairs of groups is the same as the study of fully deterministic, symmetric context-free graphs under a different viewpoint.

The same is not true without assuming symmetry. Indeed, given a semigroup presentation $ψ$ of $G$ , for every $a \in Σ$ there must be $w_{a} \in Σ^{*}$ such $ψ (w_{a}) = ψ {(a)}^{- 1}$ , the inverse in $G$ . But then in the Schreier Graph $X (G, K, ψ)$ , for any subgroup $K$ of $G$ , we have the following: if $(x, a, y) \in E$ then $y^{w_{a}} = x$ , that is, there is the oriented path from $y$ to $x$ with label $w_{a}$ . In a general fully deterministic graph this property does not necessarily hold, even if it has the additional property that for each $a \in Σ$ , there is precisely one incoming edge with label $a$ at every vertex. As an example, consider $X = {x, y, z}$ , $Σ = {a, b}$ and labelled edges $(x, a, y), (x, b, y), (y, a, z), (y, b, x), (z, a, x), (z, b, z)$ .

We return to the situation of Proposition 5.1. As a subgroup of the free group, the group $K (X)$ is itself free. There is a method for finding a set of free generators. First recall the notion of a spanning tree of $X$ . This is a tree $T$ , which as subgraph of $X$ is obtained by deleting edges (but no vertices) of $X$ . Every connected (non-oriented) graph has a spanning tree, for locally finite graphs it can be constructed inductively. Now let $T$ be a spanning tree of $X$ , and consider all edges of $X$ that are not edges of $T$ . They must come in pairs $(e, e^{- 1})$ . For each pair, we choose one of the two partner edges, and we write $E_{0}$ for the chosen (oriented) edges. For each $e \in E_{0}$ , we choose non-backtracking paths in $T$ from $o$ to $e^{-}$ and from $e^{+}$ to $o$ . Together with $e$ (in the middle), they give rise to a non-backtracking path in $X$ that starts and ends at $o$ . Let $w (e)$ be the label on that path. Then the following holds [10], [8].

Proposition 5.2

As elements of $F_{Σ}$ , the $w (e)$ , $e \in E_{0}$ , are free generators of $K (X)$ .

Corollary 5.3

Let $G$ be a virtually free group and $K$ a finitely generated subgroup. Then $(G, K)$ is context-free.

Proof

Let $F = F_{Σ}$ be a free subgroup of $G$ of finite index. Then $K = K \cap F$ is a free subgroup of $K$ with $[K : K] < \infty$ . Since $K$ is finitely generated, also $K$ is finitely generated. In the Schreier graph $X$ of $(F, K)$ with respect to the standard labelling by $Σ$ , choose a spanning tree and remaining set $E_{0}$ of edges, as described above. Since all sets of free generators of $K$ must have the same cardinality, $E_{0}$ is finite. Thus, $X$ is obtained by adding finitely many edges to a tree. If $o$ is the root vertex of $X$ and $n$ is the largest distance between $o$ and an endpoint of some edge in $E_{0}$ , then every cone $C$ of $X$ with $d (\partial C, o) > n$ is a rooted, labelled tree that is isomorphic to one of the cones of $T_{Σ}$ . Thus, the Schreier graph, resp. $(F, K)$ are context-free. It now follows from Proposition 3.3 and Lemma 4.9 that also $(G, K)$ is context-free. □

We remark here that one can always reduce the study of context-free pairs to free groups and their subgroups. Given $(G, K)$ , let $F$ be a finitely generated free group that maps by a homomorphism onto $G$ . Let $K$ be the preimage of $K$ under that homomorphism. Then clearly $(G, K)$ is context-free if and only $(F, K)$ has this property. (This reduction, however, is not very instructive.)

Of course, there are context-free pairs with $G$ free beyond the situation of Corollary 5.3.

Example 5.4

Consider the free group $F = 〈 a, b ∣ 〉$ and the subgroup $K$ with the infinite set of free generators ${a^{k} b^{l} a b^{- l} a^{- k} : k, l \in Z, l \neq 0}$ . The associated Schreier graph with respect to ${a^{\pm 1}, b^{\pm 1}}$ is the comb lattice.

Its vertex set is the set of integer points in the plane. The edges labelled by $a$ are along the $x$ -axis, from $(k, 0)$ to $(k + 1, 0)$ , and there is a loop with label $a$ at each point $(k, l)$ with $l \neq 0$ . The edges labelled by $b$ are all the upward edges of the grid, that is, all edges from $(k, l)$ to $(k, l + 1)$ , where $(k, l) \in Z^{2}$ . To these, we have to add the oppositely oriented edges whose labels are the respective inverses (in Fig. 3, the oppositely oriented edges together with the corresponding labels are omitted for simplicity). The comb lattice is clearly a context-free graph (tree).

Fig. 3 — The comb lattice described in Example 5.4.

We proceed giving some simple examples. It is very easy to see that context-freeness is not “transitive” in the following sense: if $(G, H)$ and $(H, K)$ are context-free (with $G, H$ finitely generated and $K \leq H \leq G$ ) then in general $(G, K)$ will not be context-free.

Example 5.5

Let $G = Z^{2}$ , $H = Z \times {0} ≅ Z$ and $K = {(0, 0)}$ . Then $H$ (i.e., $(H, K)$ ) is context-free. Of course, this also holds for $(G, H)$ , whose Schreier graphs are just the Cayley graphs of $Z$ . But $Z^{2}$ (i.e., $(G, K)$ ) is not context-free.

This also shows that the converse of Lemma 3.1 does not hold in general (while we know that it does hold when $[G : H] < \infty$ ). Finally, we construct examples of three groups $K \leq H \leq G$ , where $(G, H)$ is context-free, $[H : K] < \infty$ , and $(G, K)$ is not context-free.

Example 5.6

We construct a family of fully deterministic, symmetric labelled graphs $X_{W}$ , $W \subset Z$ (non-empty), and one such graph $Y$ , so that $Y$ is the factor graph with respect to the action of a 2-element group of automorphisms of each of the labelled graphs $X_{W}$ . While $Y$ will be a context-free graph, many of the graphs $X_{W}$ in our family are not context-free. We then translate this back into the setting of pairs of groups.

The vertex set of $X_{W}$ is $Z \times {0, 1}$ . The set of labels is $Σ = {a, b, a^{- 1}, b^{- 1}}$ . The edges are as follows:

$((k, 0), a, (k + 1, 0)) and ((k, 1), a, (k + 1, 1)) for all k \in Z,$

$((k, 0), b, (k + 1, 0)) and ((k, 1), b, (k + 1, 1)) for all k \in Z ∖ W, and$

$((k, 0), b, (k + 1, 1)) and ((k, 1), b, (k + 1, 0)) for all k \in W .$

The reversed edges carry the respective inverse labels (in Fig. 4, these reversed edges together with the corresponding labels are omitted for simplicity). Since $W \neq 0̸$ , there is at least one of the “crosses” (pair of the third type of edges). Therefore $X_{W}$ is connected. In general, it does not have finitely many cone types, i.e., it is not context-free. For example, it is not context-free when $W = {k (| k | + 1) : k \in Z}$

For arbitrary $W$ , the two-element group that exchanges each $(k, 0)$ with $(k, 1)$ acts on $X_{W}$ by label preserving graph automorphisms. The factor graph $Y$ (see Fig. 5) has vertex set $Z$ and edges

$(k, a, k + 1) and (k, b, k + 1) for all k \in Z,$

plus the associated reversed edges (in Fig. 5, these edges together with the corresponding labels are omitted for simplicity). It is clearly a context-free graph.

Now let $F = F_{Σ}$ be the free group (universal cover of $X_{W}$ and $Y$ ), and for given $W$ , let $K_{W}$ be the fundamental group of $X_{W}$ at the vertex $(0, 0)$ . Furthermore, let $K$ be the fundamental group of $Y$ at the vertex 0. Then it is straightforward that $K_{W}$ has index 2 in $K$ . The mapping $ψ$ is the embedding of $Σ$ into $F_{Σ}$ , as above. We then have $Y = X (F, K, ψ)$ and $X_{W} = X (F, K_{W}, ψ)$ , providing the required example.

Fig. 4 — The fully deterministic, symmetric labelled graph $X_{W}$ , with $W \subset Z$ described in Example 5.6 (here $W = {0, 1, - 3, \dots}$ ). The reverse edges, together with the corresponding labels, are omitted for simplicity.

Fig. 5 — The factor graph $Y$ of the graph $X_{W}$ from Fig. 4 (cf. Example 5.6). The reverse edges, together with the corresponding labels, are omitted for simplicity.

Example 5.7

At the end of the Introduction, we mentioned the possible interplay with ends. The number of ends $e (X)$ of a symmetric, connected graph is the supremum of the number of connected components of the complement of any finite subgraph. Via Stallings’ [17] celebrated structure theorem, ends of groups (i.e., ends of Cayley graphs) are closely related with amalgamated free products and HNN-extensions. Thus, it is natural to ask the following question.

Let $(G_{1}, K)$ and $(G_{2}, K)$ be two context-free pairs of groups sharing the same subgroup $K$ . Let $G = G_{1} *_{K} G_{2}$ be the amalgamated free product of $G_{1}$ and $G_{2}$ over the group $K$ . Is it then true that $(G, K)$ is context-free ? When $K$ is finite, the answer is of course “yes”, because then $G_{1}, G_{2}$ and $G$ are virtually free. When $K$ is infinite, we have a counter-example. Here is a brief outline.

Let $G = 〈 a_{1}, a_{2}, b_{1}, b_{2} ∣ [a_{1}, b_{1}] [a_{2}, b_{2}] 〉$ be the fundamental group of an orientable surface of genus 2. Let $K$ be the infinite cyclic subgroup generated by the commutator $[a_{1}, b_{1}] = {[a_{2}, b_{2}]}^{- 1}$ , and for $i = 1, 2$ , let $G_{i}$ be the free group with free generators $a_{i}$ and $b_{i}$ . Then $G$ is the amalgamated free product of $G_{1}$ and $G_{2}$ over $K$ .

By Corollary 5.3, the pairs $(G_{1}, K)$ and $(G_{2}, K)$ are context-free. But $(G, K)$ is not context-free. Indeed, let $X$ be the Schreier graph of $(G, K)$ with respect to the above generators and their inverses. It has two ends, see e.g. the outline in the Introduction of [14]. Thus, there is a finite subgraph $F$ of $X$ such that $X ∖ B (F, n)$ has exactly two infinite cones for any $n$ . If $X$ were context-free, then the finite upper bound on the number of boundary elements of any cone would yield that $X$ has linear growth, that is $| B (F, n) | \leq C \cdot n$ for all $n$ . This contradicts the fact that $G$ , as well as the Schreier graphs of $(G_{1}, K)$ and $(G_{2}, K)$ , have exponential growth.

Acknowledgements

The authors are grateful to Wilfried Imrich, Rögnvaldur G. Möller and Michah Sageev for useful hints and discussions.

The first author was partially supported by a visiting professorship at TU Graz. The second author was partially supported by a visiting professorship at Università di Roma - La Sapienza and the Austrian Science Fund project FWF-P19115-N18.

Contributor Information

Tullio Ceccherini-Silberstein, Email: tceccher@mat.uniroma3.it.

Wolfgang Woess, Email: woess@TUGraz.at.

References

1.Anisimov A.V. Group languages. Kibernetika. 1971;4:18–24. [Google Scholar]
2.Berstel J., Boasson L. vol. B. Elsevier; Amsterdam: 1990. Context-free languages; pp. 59–102. (Handbook of Theoretical Computer Science). [Google Scholar]
3.Ceccherini-Silberstein T., Woess W. Growth and ergodicity of context-free languages. Trans. Amer. Math. Soc. 2002;354:4597–4625. [Google Scholar]
4.Dunwoody M.J. The accessibility of finitely presented groups. Invent. Math. 1985;81:449–457. [Google Scholar]
5.Frougny Ch., Sakarovitch J., Schupp P. Finiteness conditions on subgroups and formal language theory. Proc. London Math. Soc. 1989;58:74–88. [Google Scholar]
6.Harrison M.A. Addison-Wesley; Reading, MA: 1978. Introduction to Formal Language Theory. [Google Scholar]
7.Holt D., Rees S., Röver C., Thomas R. Groups with context-free co-word problem. J. London Math. Soc. 2005;71:643–657. [Google Scholar]
8.Imrich W. Combinatorial Mathematics V. Vol. 622. Springer; Berlin: 1977. Subgroup theorems and graphs. (Lecture Notes in Mathematics). [Google Scholar]
9.Lehnert J., Schweitzer P. The co-word problem for the Higman–Thompson group is context-free. Bull. Lond. Math. Soc. 2007;39:235–241. [Google Scholar]
10.Lyndon R.C., Schupp P.E. Vol. 89. Springer; Berlin: 1977. Combinatorial group theory. (Ergebnisse der Mathematik und ihrer Grenzgebiete). [Google Scholar]
11.Muller D.E., Schupp P.E. Groups, the theory of ends and context-free languages. J. Comput. System Sc. 1983;26:295–310. [Google Scholar]
12.Muller D.E., Schupp P.E. The theory of ends, pushdown automata, and second-order logic. Theoret. Comput. Sci. 1985;37:51–75. [Google Scholar]
13.Pélecq L. Automorphism groups of context-free graphs. Theoret. Comput. Sci. 1996;165:275–293. [Google Scholar]
14.Sageev M. Ends of group pairs and non-positively curved cube complexes. Proc. London Math. Soc. 1995;71:585–617. [Google Scholar]
15.Scott P. Ends of pairs of groups. J. Pure Appl. Algebra. 1977/78;11:179–198. [Google Scholar]
16.Sénizergues G. Automata, languages and programming. vol. 1099. Springer; Berlin: 1996. Semi-groups acting on context-free graphs. pp. 206–218. (Lecture Notes in Comput. Sci.). Paderborn, 1996. [Google Scholar]
17.Stallings J.R. On torsion-free groups with infinitely many ends. Ann. Math. 1968;88:312–334. [Google Scholar]
18.Swarup G.A. On the ends of pairs of groups. J. Pure Appl. Algebra. 1993;87:93–96. [Google Scholar]
19.Thomassen C., Woess W. Vertex-transitive graphs and accessibility. J. Combin. Theory Ser. B. 1993;58:248–268. [Google Scholar]
20.Woess W. Context-free pairs of groups. II - Cuts, tree sets, and random walks. Discrete Math. 2012;312:157–173. doi: 10.1016/j.disc.2011.07.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Woess W. Graphs and groups with tree-like properties. J. Combin. Theory, Ser. B. 1989;47:361–371. [Google Scholar]

[br000005] 1.Anisimov A.V. Group languages. Kibernetika. 1971;4:18–24. [Google Scholar]

[br000010] 2.Berstel J., Boasson L. vol. B. Elsevier; Amsterdam: 1990. Context-free languages; pp. 59–102. (Handbook of Theoretical Computer Science). [Google Scholar]

[br000015] 3.Ceccherini-Silberstein T., Woess W. Growth and ergodicity of context-free languages. Trans. Amer. Math. Soc. 2002;354:4597–4625. [Google Scholar]

[br000020] 4.Dunwoody M.J. The accessibility of finitely presented groups. Invent. Math. 1985;81:449–457. [Google Scholar]

[br000025] 5.Frougny Ch., Sakarovitch J., Schupp P. Finiteness conditions on subgroups and formal language theory. Proc. London Math. Soc. 1989;58:74–88. [Google Scholar]

[br000030] 6.Harrison M.A. Addison-Wesley; Reading, MA: 1978. Introduction to Formal Language Theory. [Google Scholar]

[br000035] 7.Holt D., Rees S., Röver C., Thomas R. Groups with context-free co-word problem. J. London Math. Soc. 2005;71:643–657. [Google Scholar]

[br000040] 8.Imrich W. Combinatorial Mathematics V. Vol. 622. Springer; Berlin: 1977. Subgroup theorems and graphs. (Lecture Notes in Mathematics). [Google Scholar]

[br000045] 9.Lehnert J., Schweitzer P. The co-word problem for the Higman–Thompson group is context-free. Bull. Lond. Math. Soc. 2007;39:235–241. [Google Scholar]

[br000050] 10.Lyndon R.C., Schupp P.E. Vol. 89. Springer; Berlin: 1977. Combinatorial group theory. (Ergebnisse der Mathematik und ihrer Grenzgebiete). [Google Scholar]

[br000055] 11.Muller D.E., Schupp P.E. Groups, the theory of ends and context-free languages. J. Comput. System Sc. 1983;26:295–310. [Google Scholar]

[br000060] 12.Muller D.E., Schupp P.E. The theory of ends, pushdown automata, and second-order logic. Theoret. Comput. Sci. 1985;37:51–75. [Google Scholar]

[br000065] 13.Pélecq L. Automorphism groups of context-free graphs. Theoret. Comput. Sci. 1996;165:275–293. [Google Scholar]

[br000070] 14.Sageev M. Ends of group pairs and non-positively curved cube complexes. Proc. London Math. Soc. 1995;71:585–617. [Google Scholar]

[br000075] 15.Scott P. Ends of pairs of groups. J. Pure Appl. Algebra. 1977/78;11:179–198. [Google Scholar]

[br000080] 16.Sénizergues G. Automata, languages and programming. vol. 1099. Springer; Berlin: 1996. Semi-groups acting on context-free graphs. pp. 206–218. (Lecture Notes in Comput. Sci.). Paderborn, 1996. [Google Scholar]

[br000085] 17.Stallings J.R. On torsion-free groups with infinitely many ends. Ann. Math. 1968;88:312–334. [Google Scholar]

[br000090] 18.Swarup G.A. On the ends of pairs of groups. J. Pure Appl. Algebra. 1993;87:93–96. [Google Scholar]

[br000095] 19.Thomassen C., Woess W. Vertex-transitive graphs and accessibility. J. Combin. Theory Ser. B. 1993;58:248–268. [Google Scholar]

[br000100] 20.Woess W. Context-free pairs of groups. II - Cuts, tree sets, and random walks. Discrete Math. 2012;312:157–173. doi: 10.1016/j.disc.2011.07.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[br000105] 21.Woess W. Graphs and groups with tree-like properties. J. Combin. Theory, Ser. B. 1989;47:361–371. [Google Scholar]

PERMALINK

Context-free pairs of groups I: Context-free pairs and graphs

Tullio Ceccherini-Silberstein

Wolfgang Woess

Abstract

1. Introduction

Definition 1.1

2. Schreier graphs and the regular case

Lemma/Definition 2.1

Definition 2.2

Lemma 2.3

Proposition 2.4

Proof

Corollary 2.5

Corollary 2.6

Proof

Example 2.7

Fig. 1.

3. Pushdown automata

Lemma 3.1

Proof

Corollary 3.2

Proposition 3.3

Proof

4. Context-free graphs

Definition 4.1

Theorem 4.2

Proof

Lemma 4.3

Lemma 4.4

Proof

Fig. 2.

Lemma 4.5

Proof

Theorem 4.6

Proof

Claim 1

Proof

Claim 2

Proof

Claim 3

Proof

Corollary 4.7

Corollary 4.8

Proof

Lemma 4.9

Proof

Remark 4.10

Lemma 4.11

Proof

5. Covers and Schreier graphs

Proposition 5.1

Proposition 5.2

Corollary 5.3

Proof

Example 5.4

Fig. 3.

Example 5.5

Example 5.6

Fig. 4.

Fig. 5.

Example 5.7

Acknowledgements

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases