Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Jan 7;12038:341–353. doi: 10.1007/978-3-030-40608-0_24

Intersection and Union Hierarchies of Deterministic Context-Free Languages and Pumping Lemmas

Tomoyuki Yamakami 5,
Editors: Alberto Leporati8, Carlos Martín-Vide9, Dana Shapira10, Claudio Zandron11
PMCID: PMC7206636

Abstract

We study the computational complexity of finite intersections and unions of deterministic context-free languages. Earlier, Wotschke (1978) demonstrated that intersections of Inline graphic deterministic context-free languages are in general more powerful than intersections of d deterministic context-free languages for any positive integer d based on the hierarchy separation of Liu and Weiner (1973). The argument of Liu and Weiner, however, works only on bounded languages of particular forms, and therefore Wotschke’s result cannot be extended to disprove any other language to be written in the form of an intersection of d deterministic context-free languages. To deal with the non-membership of a wide range of languages, we circumvent their proof argument and instead devise a new, practical technical tool: a pumping lemma for finite unions of deterministic context-free languages. Since the family of deterministic context-free languages is closed under complementation, this pumping lemma enables us to show a non-membership relation of languages made up with finite intersections of even non-bounded languages as well. We also refer to a relationship to Hibbard’s limited automata.

Keywords: Deterministic pushdown automata, Intersection and union hierarchies, Pumping lemma, Limited automata

A Historical Account and an Overview of Contributions

Intersection and Union Hierarchies and Historical Background

In formal language theory, context-free languages constitute a fundamental family Inline graphic, which is situated in between the family Inline graphic of regular languages and that of context-sensitive languages. It has been well known that this family Inline graphic is closed under an operation of union but not closed under intersection. As a quick example, the language Inline graphic is not context-free but it can be expressed as an intersection of two context-free languages. This non-closure property can be further generalized to any intersection of d (Inline graphic) context-free languages. For later notational convenience, we here write Inline graphic for the family of such languages, namely, the d intersection closure of Inline graphic (see, e.g., [13]). With this notation, the above language Inline graphic belongs to Inline graphic. Similarly, the language Inline graphic over an alphabet Inline graphic falls into Inline graphic because Inline graphic can be expressed as an intersection of d context-free languages of the form Inline graphic (Inline graphic). In 1973, Liu and Weiner [8] gave a contrived proof to their key statement that (*) Inline graphic is outside of Inline graphic for any index Inline graphic. Therefore, the collection Inline graphic truly forms an infinite hierarchy.

Deterministic context-free (dcf) languages have been a focal point in Inline graphic since a systematic study of Ginsburg and Greibach [1]. The importance of such languages can be exemplified by the facts that dcf languages are easy to parse and that every context-free language is simply the homomorphic image of a dcf language. Unlike Inline graphic, the family Inline graphic of dcf languages is closed under neither union nor intersection. We use the terms of d-intersection deterministic context-free (dcf) languages and d-union deterministic context-free (dcf) languages to express intersections of d dcf languages and unions of d dcf languages, respectively. For brevity, we write Inline graphic and Inline graphic respectively for the family of all d-intersection dcf languages and that of all d-union dcf languages, while Wotschke [11, 12] earlier referred Inline graphic to the d-intersection closure of Inline graphic. In particular, we obtain Inline graphic. Since Inline graphic is closed under complementation, it follows that the complement of Inline graphic coincides with Inline graphic. For our convenience, we call two hierarchies Inline graphic and Inline graphic the intersection and union hierarchies of dcf languages, respectively. Concerning these hierarchies, we set Inline graphic to be the intersection closure of Inline graphic, which is Inline graphic. In a similar way, we write Inline graphic for the union closure of Inline graphic, that is, Inline graphic.

Wotschke [11, 12] noted that the aforementioned result (*) of Liu and Weiner leads to the conclusion that Inline graphic truly forms an infinite hierarchy. To be more precise, since the language Inline graphic belongs to Inline graphic, the statement (*) implies Inline graphic, which instantly yields Inline graphic. Wotschke’s argument, nonetheless, heavily relies on the separation result of Liu and Weiner, who employed a notion of stratified semi-linear set to prove the statement (*). Notice that the proof technique of Liu and Weiner was developed only for a particular form of bounded languages1 and it is therefore applicable to specific languages, such as Inline graphic. In fact, the key idea of the proof of Liu and Weiner for Inline graphic is to focus on the number of the occurrences of each base symbol in Inline graphic appearing in each given string w and to translate Inline graphic into a set Inline graphic of Parikh images Inline graphic in order to exploit the semi-linearity of Inline graphic, where Inline graphic expresses the total number of symbols Inline graphic in a string w.

Because of the aforementioned limitation of Liu and Weiner’s proof technique, the scope of their proof cannot be extended to other forms of languages. Simple examples of such languages include Inline graphic, where [d] denotes the set Inline graphic. This is a bounded language expanding Inline graphic but its Parikh images do not have semi-linearity. As another example, let us take a look at a “non-palindrome” language Inline graphic, where Inline graphic expresses the reversal of Inline graphic. This Inline graphic is not even a bounded language. Therefore, Liu and Weiner’s argument is not directly applicable to verify that neither Inline graphic nor Inline graphic belongs to Inline graphic unless we dextrously pick up its core strings that form a certain bounded language. With no such contrived argument, how can we prove Inline graphic and Inline graphic to be outside of Inline graphic? Moreover, given a language, how can we verify that it is not in Inline graphic? We can ask similar questions for d-union dcf languages and the union hierarchy of dcf languages. Ginsburg and Greibach [1] remarked with no proof that the context-free language Inline graphic for any non-unary alphabet Inline graphic is not in Inline graphic. It is natural to call for a formal proof of the remark of Ginsburg and Greibach. Using a quite different language Inline graphic, however, Wotschke [11, 12] actually proved that Inline graphic does not belong to Inline graphic (more strongly, the Boolean closure of Inline graphic) by employing the closure property of Inline graphic under inverse gsm mappings as well as complementation and intersection with regular languages. Wotschke’s proof relies on the following two facts. (i) The language Inline graphic can be expressed as the inverse gsm map of the language Inline graphic, restricted to Inline graphic. (ii) Inline graphic is expressed as the complement of Inline graphic, restricted to a certain regular language. Together with these facts, the final conclusion comes from the aforementioned result (*) of Liu and Weiner because Inline graphic implies Inline graphic by (i) and (ii). To our surprise, the fundamental results on Inline graphic that we have discussed so far are merely “corollaries” of the main result (*) of Liu and Weiner!

For further study on Inline graphic and answering more general non-membership questions to Inline graphic, we need to divert from Liu and Weiner’s contrived argument targeting the statement (*) and to develop a completely different, new, more practical technical tool. The sole purpose of this exposition is, therefore, set to (i) develop a new proof technique, which can be applicable to many other languages, (ii) present an alternative proof for the fact that the intersection and union hierarchies of DCFL are infinite hierarchies, and (iii) present other languages in Inline graphic that do not belong to Inline graphic (in part, verifying Ginsburg and Greibach’s remark for the first time).

In relevance to the union hierarchy of dcf languages, there is another known extension of Inline graphic using a different machine model called limited automata,2 which are originally invented by Hibbard [3] and later discussed extensively in, e.g., [9, 14]. Of all such machines, a d-limited deterministic automaton (or a d-lda, for short) is a deterministic Turing machine that can rewrite each tape cell in between two endmarkers only during the first d visits (except that making a turn of a tape head counts as double visits). We can raise a question of whether there is any relationship between the union hierarchy and d-lda’s.

Overview of Main Contributions

In Sect. 1.1, we have noted that fundamental properties associated with Inline graphic heavily rely on the single separation result (*) of Liu and Weiner. However, Liu and Weiner’s technical tool that leads to their main result does not seem to withstand a wide variety of direct applications. It is thus desirable to develop a new, simple, and practical technical tool that can find numerous applications for a future study on Inline graphic and Inline graphic. Thus, our main contribution of this exposition is to present a simple but powerful, practical technical tool, called the pumping lemma of languages in Inline graphic with Inline graphic, which also enriches our understanding of Inline graphic as well as Inline graphic. Notice that there have been numerous forms of so-called pumping lemmas (or iteration theorems) for variants of context-free languages in the past literature, e.g., [2, 47, 10, 15]. Our pumping lemma is a crucial addition to the list of such lemmas.

For a string x of length n and any number Inline graphic, x[i] stands for the ith symbol of x and Inline graphic for the i repetitions of x.

Lemma 1

(Pumping Lemma for DCFL[d]). Let d be any positive integer and let L be any d-union dcf language over an alphabet Inline graphic. There exist a constant Inline graphic such that, for any Inline graphic strings Inline graphic, if Inline graphic has the form Inline graphic for strings Inline graphic with Inline graphic and Inline graphic for any pair Inline graphic, then there exists two distinct indices Inline graphic for which the following conditions (1)–(2) hold. Let Inline graphic.

  1. If Inline graphic, then either (a) or (b) holds.

    1. There is a factorization Inline graphic with Inline graphic and Inline graphic such that Inline graphic is in L for any number Inline graphic.
    2. There are two factorizations Inline graphic and Inline graphic with Inline graphic and Inline graphic such that Inline graphic is in L for any number Inline graphic.
  2. In the case of Inline graphic, either (a) or (b) holds.

    1. There is a factorization Inline graphic with Inline graphic and Inline graphic such that, for each Inline graphic, Inline graphic is in L for any Inline graphic.
    2. Let Inline graphic and Inline graphic. There are three factorizations Inline graphic, Inline graphic, and Inline graphic with Inline graphic and Inline graphic such that Inline graphic and Inline graphic are in L for any number Inline graphic.

As a special case of Inline graphic, we obtain Yu’s pumping lemma [15, Lemma 1] from Lemma 1. Since there have been few machine-based analyses to prove various pumping lemmas in the past literature, one of the important aspects of this exposition is a clear demonstration of the first alternative proof to Yu’s pumping lemma, which is solely founded on an analysis of behaviors of 1dpda’s instead of derivation trees of Inline graphic grammars as in [15]. The proof of Lemma 1, in fact, exploits early results of [14] on an ideal shape form (Sect. 2.3) together with a new approach of Inline graphic-enhanced machines by analyzing transitions of crossing state-stack pairs (Sect. 2.4). These notions will be explained in Sect. 2 and their basic properties will be explored therein.

Using our pumping lemma (Lemma 1), we can expand the scope of the statement (*) of Liu and Weiner [8] targeting specific bounded languages to other types of languages, including Inline graphic and Inline graphic for each index Inline graphic.

Theorem 1

Let Inline graphic be any index.

  1. The language Inline graphic is not in Inline graphic.

  2. The language Inline graphic is not in Inline graphic.

Since Lemma 1 concerns with Inline graphic, in our proof of Theorem 1, we first take the complements of the above languages, restricted to suitable regular languages, and we then apply Lemma 1 appropriately to them. The proof sketch of this theorem will be given in Sect. 3. From Theorem 1, we instantly obtain the following consequences of Wotschke [11, 12].

Corollary 1

[11, 12] The intersection hierarchy of dcf languages and the union hierarchy of dcf languages are both infinite hierarchies.

Concerning the limitation of Inline graphic and Inline graphic in recognition power, since all unary context-free languages are also regular languages and the family Inline graphic of regular languages is closed under intersection, all unary languages in Inline graphic are regular as well. It is thus easy to find languages that are not in Inline graphic. Such languages, nevertheless, cannot serve themselves to separate Inline graphic from Inline graphic. As noted in Sect. 1.1, Ginsburg and Greibach [1] remarked with no proof that the context-free language Inline graphic does not belong to Inline graphic (as well as Inline graphic). As another direct application of our pumping lemma, we give a formal written proof of their remark.

Theorem 2

The context-free language Pal is not in Inline graphic.

As an immediate consequence of the above theorem, we obtain Wotschke’s separation of Inline graphic from Inline graphic. Here, we stress that, unlike the work of Wotschke [11, 12], our proof does not depend on the main result (*) of Liu and Weiner.

Corollary 2

[11, 12] Inline graphic and Inline graphic.

We turn our interest to limited automata. Let us write Inline graphic for the family of all languages recognized by d-limited deterministic automata, in which their tape heads are allowed to rewrite tape symbols only during the first d accesses (except that, in the case of tape heads making a turn, we treat each turn as double visits). Hibbard [3] demonstrated that Inline graphic for any Inline graphic. A slightly modified language of his, which separates Inline graphic from Inline graphic, also belongs to the Inline graphic-th level of the union hierarchy of dcf languages but not in the Inline graphic-th level. We thus obtain the following separation.

Proposition 1

For any Inline graphic, Inline graphic.

The proofs of all the above assertions will be given after introducing necessary notions and notation in the subsequent section.

Preparations: Notions and Notation

Fundamental Notions and Notation

The set of all natural numbers (including 0) is denoted by Inline graphic. An integer interval Inline graphic for two integers mn with Inline graphic is the set Inline graphic. In particular, for any integer Inline graphic, Inline graphic is abbreviated as [n]. For any string x, |x| indicates the total number of symbols in x. The special symbol Inline graphic is used to denote the empty string of length 0. For a language L over alphabet Inline graphic, Inline graphic denotes Inline graphic, the complement of L. Given a family Inline graphic of languages, Inline graphic expresses the complement family, which consists of languages Inline graphic for any Inline graphic.

Deterministic Pushdown Automata

A one-way deterministic pushdown automaton (or a 1dpda, for short) M is a tuple Inline graphic, where Q is a finite set of inner states, Inline graphic is an input alphabet with Inline graphic, Inline graphic is a stack alphabet, Inline graphic is a deterministic transition function from Inline graphic to Inline graphic, Inline graphic is the initial state in Q, Inline graphic is the bottom marker in Inline graphic, and Inline graphic and Inline graphic are subsets of Q. The symbols Inline graphic and Inline graphic respectively express the left-endmarker and the right-endmarker. Let Inline graphic. We assume that, if Inline graphic is defined, then Inline graphic is undefined for all symbols Inline graphic. Moreover, we require Inline graphic for any Inline graphic and Inline graphic. Each content of a stack is expressed as Inline graphic in which Inline graphic is the topmost stack symbol, Inline graphic is the bottom marker Inline graphic, and all others are placed in order from the top to the bottom of the stack.

Given Inline graphic, a d-intersection deterministic context-free (dcf) language is an intersection of d deterministic context-free (dcf) languages. Let Inline graphic denote the family of all d-intersection dcf languages. Similarly, we define d-union dcf languages and Inline graphic by substituting “union” for “intersection” in the above definitions. Note that Inline graphic because Inline graphic.

For two language families Inline graphic and Inline graphic, the notation Inline graphic (resp., Inline graphic) denotes the family of all languages L for which there are two languages Inline graphic and Inline graphic over the same alphabet satisfying Inline graphic (resp., Inline graphic).

Lemma 2

[11, 12] Inline graphic is closed under union, intersection with Inline graphic. In other words, Inline graphic and Inline graphic. A similar statement holds for Inline graphic.

Lemma 3

Let Inline graphic be any natural number.

  1. Inline graphic iff Inline graphic.

  2. If Inline graphic, then it follows that Inline graphic for any Inline graphic.

From Lemma 3(1) follows Corollary 1, provided that Theorem 1 is true. Theorem 1 itself will be proven in Sect. 3.

Ideal Shape

Let us recall from [14] a special “pop-controlled form” (called an ideal shape), in which the pop operations always take place by first reading an input symbol and then making a series (one or more) of the pop operations without reading any further input symbol. This notion was originally introduced for one-way probabilistic pushdown automata (or 1ppda’s); however, in this exposition, we apply this notion only to 1dpda’s. To be more formal, a 1dpda in an ideal shape is a 1dpda restricted to take only the following transitions. (1) Scanning Inline graphic, preserve the topmost stack symbol (called a stationary operation). (2) Scanning Inline graphic, push a new symbol u (Inline graphic) without changing any other symbol in the stack. (3) Scanning Inline graphic, pop the topmost stack symbol. (4) Without scanning an input symbol (i.e., Inline graphic-move), pop the topmost stack symbol. (5) The stack operations (4) comes only after either (3) or (4).

It was shown in [14] that any 1ppda can be converted into its “error-equivalent” 1ppda in an ideal shape. In Lemma 4, we restate this result for 1dpda’s. We say that two 1dpda’s are (computationally) equivalent if, for any input x, their acceptance/rejection coincide. The push size of a 1ppda is the maximum length of any string pushed into a stack by any single move.

Lemma 4

(Ideal Shape Lemma for 1dpda’s). (cf. [14]) Let Inline graphic. Any n-state 1dpda M with stack alphabet size m and push size e can be converted into another (computationally) equivalent 1dpda N in an ideal shape with Inline graphic states and stack alphabet size Inline graphic.

Boundaries and Crossing State-Stack Pairs

We want to define two basic notions of boundaries and crossing state-stacks. For this purpose, we visualize a single move of a 1dpda M as three consecutive actions: (i) firstly replacing the topmost stack symbol, (ii) updating an inner state, and (iii) thirdly either moving a tape head or staying still.

A boundary is a borderline between two consecutive tape cells. We index all such boundaries from 0 to Inline graphic as follows. The boundary 0 is located at the left of cell 0 and boundary Inline graphic is in between cell i and Inline graphic for every index Inline graphic. When a string xy is written in |xy| consecutive cells, the (xy)-boundary indicates the boundary Inline graphic, which separates between x and y. A boundary block between boundaries Inline graphic and Inline graphic with Inline graphic is a consecutive series of boundaries between Inline graphic and Inline graphic (including Inline graphic and Inline graphic). These Inline graphic and Inline graphic are called ends of this boundary block. For brevity, we write Inline graphic to denote a boundary block between Inline graphic and Inline graphic. For two boundaries Inline graphic with Inline graphic, the Inline graphic-region refers to the consecutive cells located in the boundary block Inline graphic. When an input string x is written in the Inline graphic-region, we conveniently call this region the x-region unless the region is unclear from the context.

The stack height of M at boundary t is the length of the stack content while passing the boundary t. E.g., a stack content Inline graphic has stack height k.

A boundary block Inline graphic is called convex if there is a boundary s between Inline graphic and Inline graphic (namely, Inline graphic) such that there is no pop operation in the Inline graphic-region and there is no push operation in the Inline graphic-region. A boundary block Inline graphic is flat if the stack height does not change in the Inline graphic-region. A boundary block Inline graphic with Inline graphic is pseudo-convex if the stack height at every boundary Inline graphic does not go below Inline graphic, where Inline graphic is the stack height at boundary Inline graphic for any Inline graphic. By their definitions, either convex or flat boundary blocks are also pseudo-convex.

A peak is a boundary t such that the stack heights at the boundaries Inline graphic and Inline graphic are smaller than the stack height at the boundary t. A plateau is a boundary block Inline graphic such that any stack height at a boundary Inline graphic is the same. A hill is a boundary block Inline graphic such that (i) the stack height at the boundary t and the stack height at the boundary Inline graphic coincide, (ii) there is at least one peak at a certain boundary Inline graphic, and (iii) both [ti] and Inline graphic are convex. The height of a hill is the difference between the topmost stack height and the lowest stack height.

Given strings over alphabet Inline graphic, Inline graphic-enhanced strings are strings over the extended alphabet Inline graphic, where Inline graphic is treated as a special input symbol expressing the absence of symbols in Inline graphic. An Inline graphic-enhanced 1dpda (or an Inline graphic-1dpa, for short) is a 1dpda that takes Inline graphic-enhanced strings and works as a standard 1dpda except that a tape head always moves to the right without stopping. This tape head movement is sometimes called “real time”.

Lemma 5

For any 1dpda M, there exists an Inline graphic-1dpda N such that, for any input string x, there is an appropriate Inline graphic-enhanced string Inline graphic for which M accepts (resp., rejects) x iff N accepts (resp., rejects) Inline graphic. Moreover, Inline graphic is identical to x except for the Inline graphic symbol and is uniquely determined from x and M.

Let M be either a 1dpda or an Inline graphic-1dpda, and assume that M is in an ideal shape. A crossing state-stack pair at boundary i is a pair Inline graphic of inner state q and stack content Inline graphic. In a computation of M on input x, a crossing state-stack pair Inline graphic at boundary i refers to the machine’s current status where (1) M is reading an input symbol, say, Inline graphic at cell Inline graphic in a certain state, say, p with the stack content Inline graphic and then M changes its inner state to q, changing a by either pushing another symbol b satisfying Inline graphic or popping a with Inline graphic. Any computation of M on x can be expressed as a series of crossing state-stack pairs at every boundary in the Inline graphic-region.

Two boundaries Inline graphic and Inline graphic with Inline graphic are mutually correlated if there are two crossing state-stack pairs Inline graphic and Inline graphic at the boundaries Inline graphic and Inline graphic, respectively, for which the boundary block Inline graphic is pseudo-convex. Moreover, assume that Inline graphic. Two boundary blocks Inline graphic and Inline graphic are mutually correlated if (i) Inline graphic, Inline graphic, and Inline graphic are all pseudo-convex, (ii) Inline graphic and Inline graphic are crossing state-stack pairs at the boundaries Inline graphic and Inline graphic, respectively, and (iii) Inline graphic and Inline graphic are also crossing state-stack pairs at the boundaries Inline graphic and Inline graphic, respectively, for certain Inline graphic, Inline graphic, and Inline graphic.

If an Inline graphic-1dpda is in an ideal shape, then it pops exactly one stack symbol whenever it reads a single symbol of a given Inline graphic-enhanced input string.

Lemma 6

Let w be any string.

  1. Let Inline graphic with Inline graphic. Let Inline graphic be a factorization such that Inline graphic is the Inline graphic-boundary and Inline graphic is the Inline graphic-boundary. If the boundaries Inline graphic and Inline graphic are mutually correlated and inner states at the boundaries Inline graphic and Inline graphic coincide, then it follows that Inline graphic iff Inline graphic for any Inline graphic.

  2. Let Inline graphic with Inline graphic. Let Inline graphic such that each Inline graphic is Inline graphic-boundary for each Inline graphic. If two boundary blocks Inline graphic and Inline graphic are mutually correlated, inner states at the boundaries Inline graphic and Inline graphic coincide, and inner states at the boundaries Inline graphic and Inline graphic coincide, then it follows that Inline graphic iff Inline graphic for any number Inline graphic.

Proof Sketches of Three Separation Claims

We intend to present the proof sketches of three separation claims (Theorems 1 and 2 and Proposition 1) before verifying the pumping lemma. To understand our proofs better, we demonstrate a simple and easy example of how to apply Lemma 1 to obtain a separation between Inline graphic and Inline graphic.

Proposition 2

Let Inline graphic and let Inline graphic. It then follows that Inline graphic.

Proof

Let Inline graphic. Clearly, Inline graphic belongs to Inline graphic. Assuming Inline graphic, we apply the pumping lemma (Lemma 1) to Inline graphic. There is a constant Inline graphic that satisfies the lemma. Let Inline graphic and consider Inline graphic for each index Inline graphic. Since each Inline graphic belongs to Inline graphic, we can take an index pair Inline graphic with Inline graphic such that Inline graphic and Inline graphic satisfy the conditions of the lemma.

Since Condition (1) of the lemma is immediate, we hereafter consider Condition (2). Let Inline graphic, Inline graphic, and Inline graphic. Firstly, we consider Case (a) with a factorization Inline graphic with Inline graphic and Inline graphic. Since Inline graphic for any number Inline graphic, we conclude that Inline graphic and Inline graphic. Let Inline graphic and Inline graphic for certain numbers Inline graphic. Note that Inline graphic equals Inline graphic. Hence, Inline graphic for a certain Inline graphic. This implies that Inline graphic. We then obtain Inline graphic, which further implies that Inline graphic and Inline graphic. Similarly, from Inline graphic, it follows that Inline graphic. Thus, Inline graphic. This implies Inline graphic and Inline graphic. Since Inline graphic, we obtain a contradiction.

Next, we consider Case (b) with appropriate factorizations Inline graphic, Inline graphic, and Inline graphic with Inline graphic and Inline graphic such that Inline graphic and Inline graphic for any number Inline graphic. Since Inline graphic, we obtain Inline graphic. Assume that Inline graphic for a certain number Inline graphic. This is impossible because Inline graphic has the form Inline graphic and the exponent of b is not of the form rn for any number Inline graphic.

Proof Sketch of Theorem 1(1). Let Inline graphic be any integer and consider Inline graphic over Inline graphic. It is not difficult to check that Inline graphic. Our goal is, therefore, to show that Inline graphic is not in Inline graphic. To lead to a contradiction, we assume that Inline graphic.

Take Inline graphic in Inline graphic and consider Inline graphic, that is, Inline graphic. Note by Lemma 3(2) that, since Inline graphic, we obtain Inline graphic. Take a pumping-lemma constant Inline graphic that satisfies Lemma 1. We set Inline graphic and consider the set Inline graphic, where Inline graphic and Inline graphic for each index Inline graphic. Lemma 1 guarantees the existence of a specific distinct pair Inline graphic with Inline graphic.

By Lemma 1, since Inline graphic, there are two conditions to consider separately. Condition (1) is not difficult. Next, we consider Condition (2). Let Inline graphic, Inline graphic, and Inline graphic. Note that Inline graphic and Inline graphic. There are three factorizations Inline graphic with Inline graphic and Inline graphic, Inline graphic, and Inline graphic satisfying both Inline graphic and Inline graphic for any number Inline graphic. From Inline graphic follows Inline graphic. Let Inline graphic for a certain Inline graphic. In particular, take Inline graphic. Note that Inline graphic has factors Inline graphic and Inline graphic. Thus, we obtain Inline graphic, a clear contradiction.    Inline graphic

We omit from this exposition the proofs of Theorems 1(2), 2, and Proposition 1. These proofs will be included in its complete version.

Proof Sketch of the Pumping Lemma for DCFL[d]

We are now ready to provide the proof of the pumping lemma for Inline graphic (Lemma 1). Our proof has two different parts depending on the value of d. The first part of the proof targets the basis case of Inline graphic. This special case directly corresponds to Yu’s pumping lemma [15, Lemma 1]. To prove his lemma, Yu utilized a so-called left-part theorem of his for Inline graphic grammars. We intend to re-prove Yu’s lemma using only 1dpda’s with no reference to Inline graphic grammars. Our proof argument is easily extendable to one-way nondeterministic pushdown automata (or 1npda’s) and thus to the pumping lemma for Inline graphic. The second part of the proof deals with the general case of Inline graphic. Hereafter, we give the sketches of these two parts.

Basis Case of Inline graphic: Let Inline graphic be any alphabet and take any infinite dcf language L over Inline graphic. Let us consider an appropriate Inline graphic-1dpda Inline graphic in an ideal shape that recognizes L by Lemmas 45. For the desired constant c, we set Inline graphic. Firstly, we take two arbitrary strings xy and Inline graphic over Inline graphic with Inline graphic and Inline graphic.

Our goal is to show that Condition (2) in the basis case of Inline graphic holds. There are four distinct cases to deal with. Hereafter, we intend to discuss them separately. Note that, since M is one-way, every crossing state-stack pair at any boundary in the x-region does not depend on the choice of y and Inline graphic.

Case 1: Consider the case where there are two boundaries Inline graphic with Inline graphic and Inline graphic such that (i) the boundaries Inline graphic and Inline graphic are mutually correlated and (ii) inner states at the boundaries Inline graphic and Inline graphic coincide. In this case, we factorize x into Inline graphic so that Inline graphic and Inline graphic. By Lemma 6(1), it then follows that, for any number Inline graphic, Inline graphic and Inline graphic.

Case 2: Consider the case where there are four boundaries Inline graphic with Inline graphic and Inline graphic and there are Inline graphic, Inline graphic, and Inline graphic for which (i) Inline graphic and Inline graphic are the crossing state-stack pairs respectively at the boundaries Inline graphic and Inline graphic, (ii) Inline graphic and Inline graphic are the crossing state-stack pairs respectively at the boundaries Inline graphic and Inline graphic, and (iii) the boundary block Inline graphic for each index Inline graphic is pseudo-convex. We then take a factorization Inline graphic such that Inline graphic for each Inline graphic. Note that Inline graphic because of Inline graphic and Inline graphic. By an application of Lemma 6(2), we conclude that, for any Inline graphic, Inline graphic for all Inline graphic.

Case 3: Assume that Cases 1–2 fail. For brevity, we set Inline graphic. Consider the case where there is no pop operation in the R-region. Since R-region contains more than Inline graphic boundaries, the R-region includes a certain series of boundaries Inline graphic such that, for certain Inline graphic, Inline graphic, and Inline graphic, there are crossing state-stack pairs of the form Inline graphic at the boundaries Inline graphic, respectively. Note that the boundary blocks Inline graphic are all convex. Clearly, Inline graphic. We choose Inline graphic and Inline graphic so that (i) for each index Inline graphic, Inline graphic and Inline graphic are boundaries in the y-region and in the Inline graphic-region, respectively, satisfying that Inline graphic and Inline graphic, and (ii) for each index Inline graphic, Inline graphic is mutually correlated to Inline graphic in the y-region and also to Inline graphic in the Inline graphic-region. Note that the boundary blocks Inline graphic, Inline graphic are all pseudo-convex. Since Inline graphic, it follows that there is a pair Inline graphic with Inline graphic such that inner states at the boundaries Inline graphic and Inline graphic coincide. Using Lemma 6(2), we can obtain the desired conclusion.

Case 4: Assume that Cases 1–3 fail. In this case, we define a notion of “true gain” in the R-region and estimate its value. Choose Inline graphic and Inline graphic so that Inline graphic, Inline graphic, and the boundary block Inline graphic is pseudo-convex. Let Inline graphic denote the set of boundary blocks Inline graphic with Inline graphic, Inline graphic, Inline graphic for every Inline graphic, and Inline graphic for every Inline graphic such that (i) Inline graphic is pseudo-convex but cannot be flat, (ii) Inline graphic is pseudo-convex (and could be flat), (iii) there are crossing state-stack pairs Inline graphic at the boundaries Inline graphic for every Inline graphic, (iv) the stack height at the boundary Inline graphic is higher than the stack height at the boundary Inline graphic, (v) the boundary Inline graphic is a pit (i.e., the lowest point within its small vicinity). Define the true gain Inline graphic to be Inline graphic. It is possible to prove that Inline graphic. Using this inequality, we can employ an argument similar to Case 3 to obtain the lemma.

General Case of Inline graphic: We begin with proving this case by considering d 1dpda’s Inline graphic. The language recognized by each machine Inline graphic is denoted by Inline graphic. Let us assume that Inline graphic. Take Inline graphic strings Inline graphic in L and assume that each Inline graphic has the form Inline graphic with Inline graphic. Since all Inline graphic’s are in L, define a function f as follows. Let f(k) denote the minimal index Inline graphic satisfying that Inline graphic but Inline graphic for all Inline graphic. Since there are at most d different languages, there are two distinct indices Inline graphic such that Inline graphic. In what follows, we fix such a pair Inline graphic.

Consider the case of Inline graphic and Inline graphic. Take arbitrary factorizations Inline graphic and Inline graphic. We apply the basis case of Inline graphic again and obtain one of the following (a)–(b). (a) There is a factorization Inline graphic with Inline graphic and Inline graphic such that Inline graphic and Inline graphic for any number Inline graphic. (b) There are factorizations Inline graphic, Inline graphic, and Inline graphic such that Inline graphic, Inline graphic, Inline graphic, and Inline graphic for any number Inline graphic.

Footnotes

1

A bounded language satisfies Inline graphic for fixed strings Inline graphic.

2

Hibbard [3] actually defined a rewriting system, called “scan-limited automata.” Later, Pighizzini and Pisoni [9] re-formulated Hibbard’s system as restricted linear automata.

Contributor Information

Alberto Leporati, Email: alberto.leporati@unimib.it.

Carlos Martín-Vide, Email: carlos.martin@urv.cat.

Dana Shapira, Email: shapird@g.ariel.ac.il.

Claudio Zandron, Email: zandron@disco.unimib.it.

Tomoyuki Yamakami, Email: TomoyukiYamakami@gmail.com.

References

  • 1.Ginsburg S, Greibach S. Deterministic context free languages. Inf. Control. 1966;9:620–648. doi: 10.1016/S0019-9958(66)80019-0. [DOI] [Google Scholar]
  • 2.Harrison MA. Iteration theorems for deterministic families of languages. Fundamenta Informaticae. 1986;9:481–508. [Google Scholar]
  • 3.Hibbard TN. A generalization of context-free determinism. Inf. Control. 1967;11:196–238. doi: 10.1016/S0019-9958(67)90513-X. [DOI] [Google Scholar]
  • 4.Igarashi Y. A pumping lemma for real-time deterministic context-free languages. Theor. Comput. Sci. 1985;36:89–97. doi: 10.1016/0304-3975(85)90032-5. [DOI] [Google Scholar]
  • 5.King KN. Iteration theorems for families of strict deterministic languages. Theor. Comput. Sci. 1980;10:317–333. doi: 10.1016/0304-3975(80)90052-3. [DOI] [Google Scholar]
  • 6.Kutrib M, Malcher A, Wotschke D. The Boolean closure of linear context-free languages. Acta Inform. 2008;45:177–191. doi: 10.1007/s00236-007-0068-6. [DOI] [Google Scholar]
  • 7.Li M, Vitányi P. An Introduction to Kolmogorov Complexity and Its Applications. New York: Springer; 1994. [Google Scholar]
  • 8.Liu LY, Weiner P. An infinite hierarchy of intersections of context-free languages. Math. Syst. Theory. 1973;7:185–192. doi: 10.1007/BF01762237. [DOI] [Google Scholar]
  • 9.Pighizzini G, Pisoni A. Limited automata and regular languages. Int. J. Found. Comput. Sci. 2014;25:897–916. doi: 10.1142/S0129054114400140. [DOI] [Google Scholar]
  • 10.Wise DS. A strong pumping lemma for context-free languages. Theor. Comput. Sci. 1976;3:359–369. doi: 10.1016/0304-3975(76)90052-9. [DOI] [Google Scholar]
  • 11.Wotschke D. The Boolean closures of the deterministic and nondeterministic context-free languages. In: Brauer W, editor. GI Gesellschaft für Informatik e. V.; Heidelberg: Springer; 1973. pp. 113–121. [Google Scholar]
  • 12.Wotschke D. Nondeterminism and Boolean operations in pda’s. J. Comput. Syst. Sci. 1978;16:456–461. doi: 10.1016/0022-0000(78)90030-2. [DOI] [Google Scholar]
  • 13.Yamakami T. Oracle pushdown automata, nondeterministic reducibilities, and the hierarchy over the family of context-free languages. In: Geffert V, Preneel B, Rovan B, Štuller J, Tjoa AM, editors. SOFSEM 2014: Theory and Practice of Computer Science; Cham: Springer; 2014. pp. 514–525. [Google Scholar]
  • 14.Yamakami T. Behavioral Strengths and weaknesses of various models of limited automata. In: Catania B, Královič R, Nawrocki J, Pighizzini G, editors. SOFSEM 2019: Theory and Practice of Computer Science; Cham: Springer; 2019. pp. 519–530. [Google Scholar]
  • 15.Yu S. A pumping lemma for deterministic context-free languages. Inf. Process. Lett. 1989;31:47–51. doi: 10.1016/0020-0190(89)90108-7. [DOI] [Google Scholar]

Articles from Language and Automata Theory and Applications are provided here courtesy of Nature Publishing Group

RESOURCES