Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Jan 7;12038:328–340. doi: 10.1007/978-3-030-40608-0_23

Geometrically Closed Positive Varieties of Star-Free Languages

Ondřej Klíma 12, Peter Kostolányi 13,
Editors: Alberto Leporati8, Carlos Martín-Vide9, Dana Shapira10, Claudio Zandron11
PMCID: PMC7206645

Abstract

A recently introduced operation of geometrical closure on formal languages is investigated. It is proved that the geometrical closure of a language from the positive variety Inline graphic, the level 3/2 of the Straubing-Thérien hierarchy of star-free languages, always falls into the variety Inline graphic, which is a new variety consisting of specific R-trivial languages. As a consequence, each class of regular languages lying between Inline graphic and Inline graphic is geometrically closed.

Keywords: Language varieties, Geometrical closure, Straubing-Thérien hierarchy, R-trivial monoid

Introduction

A geometrical closure is an operation on formal languages introduced recently by Dubernard, Guaiana, and Mignot [8]. It is defined as follows: Take any language L over some k-letter alphabet and consider the set called the figure of L in [8], which consists of all elements of Inline graphic corresponding to Parikh vectors of prefixes of words from L. The geometrical closure of L is the language Inline graphic of all words w such that the Parikh vectors of all the prefixes of w lie in the figure of L. This closure operator was inspired by the previous works of Blanpain, Champarnaud, and Dubernard [4] and Béal et al. [3], in which geometrical languages are studied – using the terminology from later paper [8], these can be described as languages whose prefix closure is equal to their geometrical closure. Note that this terminology was motivated by the fact that a geometrical language is completely determined by its (geometrical) figure. In the particular case of binary alphabets, these (geometrical) figures were illustrated by plane diagrams in [8].

The class of all regular languages can be easily observed not to be geometrically closed – that is, one can find a regular language such that its geometrical closure is not regular [8] (see also the end of Sect. 2). One possible research aim could be to characterise regular languages L for which Inline graphic is regular, or to describe some robust classes of languages with this property. Another problem posed in [8] is to find some subclasses of regular languages that are geometrically closed. As we explain in Sect. 3, non-empty group languages have their geometrical closure equal to the universal language Inline graphic. For this reason, it makes sense to look for more interesting geometrically closed subclasses among star-free languages, which are known to be “group-free”. More precisely, a language L is star-free if and only if the syntactic monoid Inline graphic of L is aperiodic, that is, if Inline graphic does not contain non-trivial groups as subsemigroups.

It is well known that the star-free languages are classified into the Straubing-Thérien hierarchy based on polynomial and Boolean operations. In particular, the variety Inline graphic (i.e., the variety of languages of level 1) is formed by piecewise testable languages and the positive variety Inline graphic is formed by polynomials built from languages of level 1. We refer to the survey paper by Pin [12] for an introduction to the Straubing-Thérien hierarchy of star-free languages and the algebraic theory of regular languages in general. This theory is based on Eilenberg correspondence between varieties of regular languages and pseudovarieties of finite monoids. Note that one well-known instance of Eilenberg correspondence, which plays an essential role in our contribution, is given by the pseudovariety of finite R-trivial monoids, for which the corresponding variety of languages is denoted by Inline graphic. Nevertheless, we emphasise that our contribution is rather elementary, and it does not use sophisticated tools developed in the algebraic theory of regular languages.

It was proved by Dubernard, Guaiana, and Mignot [8] that the class of all binary languages from the positive variety Inline graphic is geometrically closed. They have obtained this result by decomposing the plane diagram of the figure of a given language into specific types of basic subdiagrams, and using this decomposition to construct a regular expression for the language Inline graphic.

We prove a generalisation of the above mentioned result in this contribution. Our approach is to concentrate on the form of languages that may arise as Inline graphic for L taken from Inline graphic. In other words, we do not construct a concrete regular expression for Inline graphic, but we determine what kind of expression exists for such a language. In particular, we introduce a new variety of languages Inline graphic, which is a subvariety of the variety Inline graphic. Note that there is a transparent description of languages from Inline graphic and also an effective characterisation via the so-called acyclic automata (both are recalled in Sect. 4). The variety of languages Inline graphic is then characterised in the same manner: a precise description by specific regular expressions and also an automata-based characterisation are given. The letters LT in the notation Inline graphic refer to a characteristic property of acyclic automata in which “loops are transferred” along paths.

We show that the geometrical closure of a language from the positive variety Inline graphic always falls into the variety Inline graphic. As a consequence, each class of regular languages lying between Inline graphic and Inline graphic is geometrically closed. In particular, the positive variety Inline graphic is geometrically closed regardless of the alphabet, as well as is the variety Inline graphic.

Preliminaries

All automata considered in this paper are understood to be deterministic and finite. An automaton is thus a five-tuple Inline graphic, where Q is a finite set of states, Inline graphic is a non-empty finite alphabet, Inline graphic is a complete transition function, Inline graphic is the unique initial state, and Inline graphic is the set of final states. The minimal automaton of a given language L is denoted by Inline graphic.

By a (positive) variety of languages, we always understand what is called a (positive) Inline graphic-variety in [12]. We recall this notion for a reader’s convenience briefly. A class of languages Inline graphic is an operator, which determines, for each finite non-empty alphabet Inline graphic, a set Inline graphic of languages over Inline graphic. A positive variety is a class of regular languages Inline graphic such that Inline graphic is closed under quotients, finite unions and intersections, and the whole class is closed under preimages in homomorphisms. A positive variety Inline graphic is a variety if each Inline graphic is closed under complementation. Note that an alphabet could be fixed in our contribution, so homomorphisms among different alphabets play no role, and we could consider lattices of languages [9] instead of varieties of languages. However, we prefer to stay in the frame of the theory of (positive) varieties of languages as a primary aim of this paper is to describe robust classes closed under geometrical closure.

Given words uv over an alphabet Inline graphic, we write Inline graphic if u is a prefix of v. We also write, for each Inline graphic,

graphic file with name M47.gif

We call these languages the prefix closure and the prefix reduction of L, respectively. Both are prefix-closed, while Inline graphic and Inline graphic.

Proposition 1

Each positive variety Inline graphic is closed under the operator Inline graphic.

Proof

It is well known that each regular language has finitely many right quotients by words. Thus, for each alphabet Inline graphic and each Inline graphic, the language

graphic file with name M54.gif

is a finite union of right quotients of L, and its membership to Inline graphic follows.    Inline graphic

Let Inline graphic be a linearly ordered alphabet. The Parikh vector of a word w in Inline graphic is then given by Inline graphic, where Inline graphic denotes the number of occurrences of the letter a in w. This notation extends naturally to languages: we write Inline graphic for Inline graphic. We denote by [w] the equivalence class of the kernel relation of Inline graphic, i.e. Inline graphic. Then we also write, for each language Inline graphic,

graphic file with name M66.gif

and we call [L] the commutative closure of L. A language L such that Inline graphic is called commutative. A class of languages Inline graphic is said to be closed under commutation if for each alphabet Inline graphic, the language [L] belongs to Inline graphic whenever Inline graphic.

In the previous paragraph we consider the mapping Inline graphic, where Inline graphic is the set of all non-negative integers. Following the ideas of [8], we introduce some technical notations concerning Inline graphic, whose elements are called vectors. We denote by Inline graphic the null vector of Inline graphic. Let Inline graphic and Inline graphic be vectors and Inline graphic be an index. We write Inline graphic if Inline graphic and, at the same time, Inline graphic for all Inline graphic. Moreover, Inline graphic means that Inline graphic for some index s. A path in Inline graphic is a finite sequence Inline graphic of vectors from Inline graphic such that Inline graphic and Inline graphic for Inline graphic; more specifically, we say that Inline graphic is a path leading to Inline graphic. This means that a path always begins in Inline graphic and each other vector of the path is obtained from the previous one by incrementing exactly one of its coordinates by one. If in addition Inline graphic all belong to a set Inline graphic, we say that Inline graphic is a path in F and write Inline graphic.

Given a word Inline graphic in Inline graphic, we write Inline graphic for the unique path Inline graphic in Inline graphic such that Inline graphic. Conversely, for each path Inline graphic in Inline graphic, there is a unique word w such that Inline graphic. We denote this unique word w by Inline graphic. For each Inline graphic, we denote Inline graphic the set Inline graphic. Note that the language Inline graphic is prefix-closed.

Moreover, we put Inline graphic for each Inline graphic. The set Inline graphic is a connex figure in the sense of [8], i.e., for each Inline graphic, there is a path Inline graphic leading to Inline graphic such that Inline graphic.

Finally, the geometrical closure of L is a language Inline graphic. A class of languages Inline graphic is said to be geometrically closed if the language Inline graphic belongs to Inline graphic whenever L does, for each alphabet Inline graphic.

Note that the class of all regular languages is not geometrically closed, as observed in [8]. For instance, the language Inline graphic is regular, while its geometrical closure Inline graphic is the prefix closure of the Dyck language.

A Characterisation of the Geometrical Closure

We now characterise the operation of geometrical closure via three simpler operations: the prefix closure, the commutative closure, and the prefix reduction. This characterisation is a key to our later considerations.

Proposition 2

If L is a language over Inline graphic, then Inline graphic.

Proof

By definition,

graphic file with name M129.gif

If Inline graphic, then there is a path Inline graphic such that Inline graphic. For an arbitrary prefix u of w, we have Inline graphic for some Inline graphic. It follows that Inline graphic belongs to Inline graphic. Hence Inline graphic and w belongs to Inline graphic.

On the other hand, if w belongs to Inline graphic, then all prefixes u of w belong to Inline graphic. Thus Inline graphic is in Inline graphic for each Inline graphic, and Inline graphic is a path in Inline graphic, implying that w is in Inline graphic.    Inline graphic

As a direct consequence of Propositions 1 and 2, we obtain the following sufficient condition, under which a positive variety of languages is geometrically closed.

Corollary 3

Each positive variety of regular languages closed under prefix reduction and commutation is geometrically closed.

Some positive varieties of languages Inline graphic are geometrically closed for trivial reasons – for instance all Inline graphic such that Inline graphic for all non-empty Inline graphic. Let us observe that this is the case for L whenever Inline graphic. The proof of the following lemma is easy to see. We just note that by an absorbing state we mean a state p satisfying Inline graphic for every Inline graphic.

Lemma 4

Let L be a regular language over an alphabet Inline graphic and Inline graphic be the minimal automaton of L. Then the following conditions are equivalent:

  • (i)

    Inline graphic;

  • (ii)

    for each state p in Inline graphic, there exists a final state reachable from p;

  • (iii)

    every absorbing state p in Inline graphic is final.

The conditions of Lemma 4 are satisfied in particular for all non-empty group languages. The variety Inline graphic, consisting of all languages L such that the syntactic monoid Inline graphic is a group, is geometrically closed as a consequence. This result can be extended to languages of the form Inline graphic, where each Inline graphic is a letter, and each Inline graphic is a non-empty group language. Indeed, for every Inline graphic, there is some Inline graphic such that Inline graphic, and one can find at least one Inline graphic for every Inline graphic. Then u is a prefix of the word Inline graphic. This implies that Inline graphic. We may thus conclude that the variety Inline graphic, consisting of languages of level 1/2 in the group hierarchy, is geometrically closed. (The reader not familiar with the group hierarchy is referred to [12]).

In the rest of the paper, we move our attention to star-free languages.

Languages Recognised by LT-acyclic Automata

We now introduce the class of languages Inline graphic, which plays a central role in our main result. For every alphabet Inline graphic, the set Inline graphic consists of languages which are finite unions of languages of the form

graphic file with name M176.gif 1

The previous definition is similar to definitions of other classes of languages that have already been studied in literature. First of all, if we omit the condition Inline graphic, we get a definition of languages from the variety Inline graphic corresponding to R-trivial monoids, which we recall in more detail later. Let us conclude here just that Inline graphic. Secondly, if we also require Inline graphic in (1) for Inline graphic, then we obtain a variety of languages considered by Pin, Straubing, and Thérien [13] and corresponding to a pseudovariety of finite monoids denoted Inline graphic. Finally, if we drop in (1) the condition Inline graphic and then we generate a variety, then we obtain the variety of languages corresponding to the pseudovariety Inline graphic considered by Almeida [1, p. 236].

Since we want to characterise languages from Inline graphic in terms of automata, we recall the characterisation of languages from Inline graphic first. An automaton Inline graphic is acyclic if every cycle in Inline graphic is a loop. This means that if Inline graphic for some Inline graphic and Inline graphic, then also Inline graphic for every letter a occurring in w. The defining condition means that one can number the states in Q as Inline graphic in such a way that the state Inline graphic, with Inline graphic and Inline graphic, is always greater than or equal to p. For this reason, these automata are called extensive in [11, p. 93]. It is known that they recognise precisely R-trivial languages [6].

We say that an acyclic automaton Inline graphic has a loop transfer property, if Inline graphic implies Inline graphic for every Inline graphic and Inline graphic. We then call Inline graphic an LT-acyclic automaton for short. This means that if there is an a-labelled loop in a state p in an LT-acyclic automaton, then there is also an a-labelled loop in each state reachable from p. We may thus equivalently take Inline graphic in the previous definition. The first aim of this section is to show that languages recognised by LT-acyclic automata are precisely those from Inline graphic. We do so via a series of elementary lemmas.

Lemma 5

For a language L of the form (1), the automaton Inline graphic is LT-acyclic.

Proof

Let L be a language Inline graphic of the form (1). For every Inline graphic, we denote Inline graphic and we also put Inline graphic. Then it is an easy exercise to show that the automaton in Fig. 1 is the minimal automaton of L and that it is an LT-acyclic automaton.    Inline graphic

Fig. 1.

Fig. 1.

An LT-acyclic automaton for the language of the form (1).

Lemma 6

Let L, K be languages over an alphabet Inline graphic recognised by LT-acyclic automata. Then Inline graphic is also recognised by an LT-acyclic automaton.

Proof

The language Inline graphic can be recognised by the direct product of a pair of automata that recognise the languages L and K. It is a routine to check that a finite direct product of LT-acyclic automata is an LT-acyclic automaton.    Inline graphic

The previous two lemmas show that every language from Inline graphic is recognised by an LT-acyclic automaton. The following lemma strengthens this observation by implying that the minimal automaton of a language from Inline graphic is LT-acyclic.

Lemma 7

Let L be a language recognised by an LT-acyclic automaton. Then the minimal automaton of L is also LT-acyclic.

Proof

Let Inline graphic be an LT-acyclic automaton such that Inline graphic. The minimal automaton Inline graphic is a homomorphic image of some subautomaton of Inline graphic [14]. It is clear that a subautomaton of an LT-acyclic automaton is LT-acyclic. Thus we may assume that Inline graphic has all states reachable from the initial state Inline graphic.

Let Inline graphic be a surjective mapping, which is a homomorphism from the automaton Inline graphic onto an automaton Inline graphic. We claim that Inline graphic is acyclic. To prove this claim, let Inline graphic and Inline graphic be such that Inline graphic. Then we choose some state Inline graphic from Inline graphic. For that Inline graphic, we have Inline graphic for every natural number m. Since the sequence Inline graphic contains only finitely many states, there are natural numbers n and m such that Inline graphic. Since Inline graphic is acyclic, we have Inline graphic for every letter a occurring in w. Consequently, Inline graphic. We showed that Inline graphic is acyclic.

Now let Inline graphic and Inline graphic be such that Inline graphic. It follows from the previous paragraph that there is Inline graphic such that Inline graphic. Since Inline graphic is LT-acyclic, we see that Inline graphic for every Inline graphic. Thus Inline graphic. We showed that Inline graphic is an LT-acyclic automaton. In particular, it is true for Inline graphic.    Inline graphic

Let us also prove a converse to the statements established above.

Lemma 8

Let Inline graphic be an LT-acyclic automaton over an alphabet Inline graphic. Then Inline graphic belongs to Inline graphic.

Proof

Let Inline graphic and let R be the set of all valid runs in the automaton Inline graphic, which do not use loops:

graphic file with name M258.gif

We see that the set R is finite. Moreover, for each q in Q, let Inline graphic denote the alphabet Inline graphic. Then

graphic file with name M261.gif

is a language of the form (1) for each Inline graphic in R and

graphic file with name M263.gif

Hence the language Inline graphic belongs to Inline graphic.    Inline graphic

The following theorem provides a summary of the previous lemmas.

Theorem 9

For a language Inline graphic, the following statements are equivalent:

  • (i)

    L belongs to Inline graphic.

  • (ii)

    L is recognised by an LT-acyclic automaton.

  • (iii)

    The minimal automaton of L is LT-acyclic.

Proof

The statement (i) implies (ii) by Lemmas 5 and 6. The statement (ii) implies (iii) by Lemma 7. Finally, (iii) implies (i) by Lemma 8.    Inline graphic

One may prove that Inline graphic is a variety of languages in several different ways. It is possible to prove directly that the class Inline graphic is closed under basic language operations. It is also possible to prove that the class of LT-acyclic automata forms a variety of actions in the sense of [7]. Here we complete the previous characterisation by showing the algebraic counterpart of the class Inline graphic; namely, we characterise the corresponding pseudovariety of finite monoids by pseudoidentities. We do not want to recall the notion of pseudoidentities in general. Let us only recall the implicit operation Inline graphic here. If we substitute for x some element s in a finite monoid M, then the image of Inline graphic is Inline graphic, which is a unique idempotent in the subsemigroup of M generated by s. It could be useful to know that, for a fixed finite monoid M, there is a natural number m such that Inline graphic for each Inline graphic.

Theorem 10

Let Inline graphic be an alphabet, Inline graphic, and Inline graphic the syntactic monoid of L. The following statements are equivalent:

  • (i)

    L belongs to Inline graphic.

  • (ii)

    Inline graphic satisfies the pseudoidentities Inline graphic and Inline graphic.

  • (iii)

    Inline graphic satisfies the pseudoidentity Inline graphic.

Proof

Let Inline graphic be the minimal automaton of the language L. Then Inline graphic can be viewed as the transition monoid of Inline graphic (see [12, p. 692]). Elements of Inline graphic are thus transitions of Inline graphic determined by words from Inline graphic. More formally, for Inline graphic, we denote by Inline graphic the transition given by the rule Inline graphic for each Inline graphic. Let m be a natural number such that Inline graphic for each s in Inline graphic.

Let us prove that (i) implies (ii). Suppose that L belongs to Inline graphic. Then Inline graphic is an LT-acyclic automaton by Theorem 9. In particular, the language L is R-trivial as we already mentioned. Hence, the monoid Inline graphic is R-trivial, i.e., Inline graphic satisfies the pseudoidentity Inline graphic. Next, let xy be mapped to elements in Inline graphic which are given by words Inline graphic. We now need to check that Inline graphic. Since Inline graphic is acyclic, we have Inline graphic for every Inline graphic and Inline graphic occurring in v. Since Inline graphic is an LT-acyclic automaton, the loop labelled by a in state Inline graphic is transferred to every state reachable from Inline graphic. In particular, for every letter a occurring in v, there is a loop labelled by a in the state Inline graphic. The equality Inline graphic follows.

Next, let us show that the pseudoidentity Inline graphic is a consequence of pseudoidentities from item (ii). We may interpret xyz as arbitrary elements of any finite monoid M satisfying these pseudoidentities. Let m be such that Inline graphic for each Inline graphic. Then we use the second pseudoidentity from (ii) repetitively, and we get

graphic file with name M319.gif 2

By the first pseudoidentity from (ii), we get Inline graphic. Then we obtain Inline graphic using the equality (2). Thus we get Inline graphic.

Finally, in order to prove that (iii) implies (i), suppose that Inline graphic satisfies the pseudoidentity Inline graphic. Taking Inline graphic, it follows that Inline graphic satisfies the pseudoidentity Inline graphic. Hence, L is R-trivial and Inline graphic is acyclic. Moreover, let Inline graphic and Inline graphic be such that Inline graphic, and take arbitrary Inline graphic. Then Inline graphic in Inline graphic maps p to Inline graphic. Similarly, Inline graphic in Inline graphic maps p to Inline graphic. However, taking Inline graphic, Inline graphic, and Inline graphic in Inline graphic gives us Inline graphic. Therefore, Inline graphic. So, we see that there is a loop labelled by a in the state Inline graphic. We proved that Inline graphic is an LT-acyclic automaton and L belongs to Inline graphic by Theorem 9.    Inline graphic

Corollary 11

The class Inline graphic is a variety of languages corresponding to the pseudovariety of finite monoids Inline graphic given by

graphic file with name M351.gif

Let us also note that Inline graphic is known to describe the pseudovariety of finite monoids Inline graphic; cf. Almeida [1, p. 212], who attributes this result to Pin. Therefore, Inline graphic.

The Main Result

Let us now return to the geometrical closure and prove the main result of this paper: each class of languages lying between the variety of languages Inline graphic and the positive variety Inline graphic is geometrically closed. This strengthens the result from [8] mentioned in the Introduction.

The route that we take to this result (Theorem 16) consists of three steps:

  1. We recall that the class Inline graphic is closed under commutation [5, 10]. Although it is not necessary to obtain our main result, we refine this observation by proving that a commutative closure of a Inline graphic-language is piecewise testable.

  2. We prove that each commutative Inline graphic-language belongs to Inline graphic.

  3. We observe that the variety Inline graphic is closed under prefix reduction.

These three observations imply that the geometrical closure of a Inline graphic-language belongs to Inline graphic, from which our main result follows easily.

Recall the result of Arfi [2], according to which a language belongs to Inline graphic if and only if it is given by a finite union of languages Inline graphic, where Inline graphic are letters from Inline graphic and Inline graphic are subalphabets of Inline graphic. It follows by a more general result of Guaiana, Restivo, and Salemi [10], or of Bouajjani, Muscholl, and Touili [5] that Inline graphic is closed under commutation, and this observation is a first step to Theorem 16.

Let us show that a commutative closure of a Inline graphic-language is in fact piecewise testable.

Lemma 12

A commutative closure of a Inline graphic-language is piecewise testable.

Proof

Let an alphabet Inline graphic be fixed. It is clear that if Inline graphic are languages, then

graphic file with name M375.gif

As a result, it is enough to prove piecewise testability of [L] for all languages Inline graphic, with Inline graphic and Inline graphic.

Let L be of this form. Denote Inline graphic, and Inline graphic. We claim that

graphic file with name M381.gif 3

Indeed, if w is in [L], then Inline graphic for some Inline graphic, while clearly Inline graphic for each a in Inline graphic, and Inline graphic for each b in Inline graphic. Conversely, let w in Inline graphic be such that Inline graphic for each a in Inline graphic, and Inline graphic for each b in Inline graphic. Then Inline graphic for v in Inline graphic given by Inline graphic, where Inline graphic (Inline graphic) is given as follows: if Inline graphic, then

graphic file with name M399.gif

The word v is in L by construction, hence w belongs to [L].

It remains to observe that the language [L] given by (3) is piecewise testable. However, this language is equal to

graphic file with name M400.gif 4

The language on the right-hand side of (4) is piecewise testable.    Inline graphic

We now proceed to prove that the geometrical closure of each language from Inline graphic belongs to Inline graphic.

Lemma 13

Every commutative language L from Inline graphic belongs to Inline graphic.

Proof

If we take into account the proof of Lemma 12 and the fact that Inline graphic is closed under finite unions, it is enough to prove that every language of the form (3) belongs to Inline graphic. We may also use the expression (4) for that language. For each letter Inline graphic and a natural number m, we may write Inline graphic. This shows that the language Inline graphic belongs to Inline graphic. Since Inline graphic is a variety, we see that also the language Inline graphic belongs to Inline graphic. Altogether, the language (4) belongs to the variety Inline graphic.    Inline graphic

Finally, let us observe that the variety Inline graphic is closed under prefix reduction.

Lemma 14

Let L be a language from Inline graphic for some alphabet Inline graphic. Then Inline graphic belongs to Inline graphic as well.

Proof

Let L be recognised by some LT-acyclic automaton Inline graphic. If Inline graphic, then L does not contain the empty word, and consequently Inline graphic, which belongs to Inline graphic. So we may assume that Inline graphic.

Now, simply saying, we claim that the language Inline graphic is recognised by the automaton Inline graphic constructed from Inline graphic by replacing all non-final states with a single absorbing non-final state Inline graphic. More precisely, we construct an automaton Inline graphic, where Inline graphic is a new state, for which we define Inline graphic for each Inline graphic. Furthermore, for each Inline graphic and Inline graphic, we put Inline graphic if Inline graphic, and Inline graphic otherwise. As Inline graphic contains no cycle other than a loop, the constructed automaton Inline graphic has the same property. Moreover, any state of Inline graphic reachable in Inline graphic from some p in Inline graphic is either reachable from p in Inline graphic, or equal to Inline graphic. As Inline graphic for each c in Inline graphic, this implies that Inline graphic is an LT-acyclic automaton and Inline graphic belongs to Inline graphic by Theorem 9.    Inline graphic

Theorem 15

Let Inline graphic be an alphabet and Inline graphic. Then Inline graphic.

Proof

We have Inline graphic by Proposition 2. As Inline graphic is a positive variety of languages, Inline graphic belongs to Inline graphic whenever L belongs to this set by Proposition 1. The language Inline graphic is thus a commutative Inline graphic-language by [5, 10]. (Note that the language Inline graphic is actually commutative piecewise testable, by Lemma 12.) It follows by Lemma 13 that Inline graphic belongs to Inline graphic, and by Lemma 14 that the language Inline graphic belongs to Inline graphic as well.    Inline graphic

We are now prepared to state the main result of this article merely as an alternative formulation of the theorem above.

Theorem 16

Let Inline graphic be a class of languages containing Inline graphic, which is contained in Inline graphic. Then Inline graphic is geometrically closed.

There are many important (positive) varieties studied in the literature for which the main result can be applied.

Corollary 17

The following classes are geometrically closed: the positive variety Inline graphic, the variety Inline graphic, the variety Inline graphic, the variety of all Inline graphic-recognisable languages, the variety of all Inline graphic-recognisable languages.

The variety of all Inline graphic-recognisable languages coincides with the intersection of Inline graphic and its dual. This class has a natural interpretation in terms of logical descriptions of levels in Straubing-Thérien hierarchy (see Section 5 in [15]).

Conclusions

We have introduced a new variety of languages Inline graphic and we have proved that geometrical closures of languages from Inline graphic fall into Inline graphic. As a consequence, we have seen that many natural classes of star-free languages are geometrically closed, namely those between the variety Inline graphic and the positive variety Inline graphic. On the contrary, the variety of all piecewise testable languages Inline graphic is not geometrically closed. The example is not included in the paper due to space limitations.

There are some interesting questions in connection to the paper. First of all, one may ask how to effectively construct a regular expression for the geometrical closure Inline graphic for a given language L from Inline graphic. Note that it is effectively testable, for a given deterministic finite automaton Inline graphic, whether the language Inline graphic belongs to Inline graphic (see [12, p. 725]). It is not clear to us whether a regular expression for Inline graphic can be effectively computed from Inline graphic.

Nevertheless, the main open question related to the topic is to clarify the behaviour of the geometrical closure outside the class Inline graphic.

Footnotes

The first author was supported by Grant 19-12790S of the Grant Agency of the Czech Republic. The second author was supported by the grant VEGA 2/0165/16.

Contributor Information

Alberto Leporati, Email: alberto.leporati@unimib.it.

Carlos Martín-Vide, Email: carlos.martin@urv.cat.

Dana Shapira, Email: shapird@g.ariel.ac.il.

Claudio Zandron, Email: zandron@disco.unimib.it.

Ondřej Klíma, Email: klima@math.muni.cz.

Peter Kostolányi, Email: kostolanyi@fmph.uniba.sk.

References

  • 1.Almeida, J.: Finite Semigroups and Universal Algebra. World Scientific, Singapore (1994)
  • 2.Arfi M. Opérations polynomiales et hiérarchies de concaténation. Theor. Comput. Sci. 1991;91(1):71–84. doi: 10.1016/0304-3975(91)90268-7. [DOI] [Google Scholar]
  • 3.Béal M-P, Champarnaud J-M, Dubernard J-P, Jeanne H, Lombardy S. Decidability of geometricity of regular languages. In: Yen H-C, Ibarra OH, editors. Developments in Language Theory; Heidelberg: Springer; 2012. pp. 62–72. [Google Scholar]
  • 4.Blanpain, B., Champarnaud, J.M., Dubernard, J.P.: Geometrical languages. In: LATA 2007, pp. 127–138 (2007)
  • 5.Bouajjani A, Muscholl A, Touili T. Permutation rewriting and algorithmic verification. Inf. Comput. 2007;205(2):199–224. doi: 10.1016/j.ic.2005.11.007. [DOI] [Google Scholar]
  • 6.Brzozowski JA, Fich FE. Languages of Inline graphic-trivial monoids. J. Comput. Syst. Sci. 1980;20(1):32–49. doi: 10.1016/0022-0000(80)90003-3. [DOI] [Google Scholar]
  • 7.Chaubard L, Pin JÉ, Straubing H. Actions, wreath products of C-varieties and concatenation product. Theor. Comput. Sci. 2006;356(1–2):73–89. doi: 10.1016/j.tcs.2006.01.039. [DOI] [Google Scholar]
  • 8.Dubernard J-P, Guaiana G, Mignot L. Geometrical closure of binary Inline graphic languages. In: Martín-Vide C, Okhotin A, Shapira D, editors. Language and Automata Theory and Applications; Cham: Springer; 2019. pp. 302–314. [Google Scholar]
  • 9.Gehrke M, Grigorieff S, Pin JÉ. Duality and equational theory of regular languages. In: Aceto L, Damgård I, Goldberg LA, Halldórsson MM, Ingólfsdóttir A, Walukiewicz I, editors. Automata, Languages and Programming; Heidelberg: Springer; 2008. pp. 246–257. [Google Scholar]
  • 10.Guaiana G, Restivo A, Salemi S. On the trace product and some families of languages closed under partial commutations. J. Autom. Lang. Comb. 2004;9(1):61–79. [Google Scholar]
  • 11.Pin JÉ. Varieties of Formal Languages. London: North Oxford Academic Publishers; 1986. [Google Scholar]
  • 12.Pin J-E. Syntactic semigroups. In: Rozenberg G, Salomaa A, editors. Handbook of Formal Languages; Heidelberg: Springer; 1997. pp. 679–746. [Google Scholar]
  • 13.Pin JÉ, Straubing H, Thérien D. Small varieties of finite semigroups and extensions. J. Aust. Math. Soc. 1984;37(2):269–281. doi: 10.1017/S1446788700022084. [DOI] [Google Scholar]
  • 14.Sakarovitch J. Elements of Automata Theory. Cambridge: Cambridge University Press; 2009. [Google Scholar]
  • 15.Tesson, P., Thérien, D.: Diamonds are forever: the variety DA. In: Semigroups, Algorithms, Automata and Languages, pp. 475–499. World Scientific (2002)

Articles from Language and Automata Theory and Applications are provided here courtesy of Nature Publishing Group

RESOURCES