Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Jan 7;12038:315–327. doi: 10.1007/978-3-030-40608-0_22

The Power of Programs over Monoids in Inline graphic

Nathan Grosshans 12,13,
Editors: Alberto Leporati8, Carlos Martín-Vide9, Dana Shapira10, Claudio Zandron11
PMCID: PMC7206935

Abstract

The model of programs over (finite) monoids, introduced by Barrington and Thérien, gives an interesting way to characterise the circuit complexity class Inline graphic and its subclasses and showcases deep connections with algebraic automata theory. In this article, we investigate the computational power of programs over monoids in Inline graphic, a small variety of finite aperiodic monoids. First, we give a fine hierarchy within the class of languages recognised by programs over monoids from Inline graphic, based on the length of programs but also some parametrisation of Inline graphic. Second, and most importantly, we make progress in understanding what regular languages can be recognised by programs over monoids in Inline graphic. We show that those programs actually can recognise all languages from a class of restricted dot-depth one languages, using a non-trivial trick, and conjecture that this class suffices to characterise the regular languages recognised by programs over monoids in Inline graphic.

Introduction

In computational complexity theory, many hard still open questions concern relationships between complexity classes that are expected to be quite small in comparison to the mainstream complexity class Inline graphic of tractable languages. One of the smallest such classes is Inline graphic, the class of languages decided by Boolean circuits of polynomial length, logarithmic depth and bounded fan-in, a relevant and meaningful class, that has many characterisations but whose internal structure still mostly is a mystery. Indeed, among its most important subclasses, we count Inline graphic, Inline graphic and Inline graphic: all of them are conjectured to be different from each other and strictly within Inline graphic, but despite many efforts for several decades, this could only be proved for the first of those classes.

In the late eighties, Barrington and Thérien [3], building on Barrington’s celebrated theorem [2], gave an interesting viewpoint on those conjectures, relying on algebraic automata theory. They defined the notion of a program over a monoid M: a sequence of instructions (if), associating through function f some element of M to the letter at position i in the input of fixed length. In that way, the program outputs an element of M for every input word, by multiplying out the elements given by the instructions for that word; acceptance or rejection then depends on that outputted element. A language of words of arbitrary length is consequently recognised in a non-uniform fashion, by a sequence of programs over some fixed monoid, one for each possible input length; when that sequence is of polynomial length, it is said that the monoid p-recognises that language. Barrington and Thérien’s discovery is that Inline graphic and almost all of its significant subclasses can each be exactly characterised by p-recognition over monoids taken from some suitably chosen variety of finite monoids (a class of finite monoids closed under basic operations on monoids). For instance, Inline graphic, Inline graphic, Inline graphic and Inline graphic correspond exactly to p-recognition by, respectively, finite monoids, finite aperiodic monoids, finite solvable groups and finite solvable monoids. Understanding the internal structure of Inline graphic thus becomes a matter of understanding what finite monoids from some particular variety are able to p-recognise.

It soon became clear that regular languages play a central role in understanding p-recognition: McKenzie, Péladeau and Thérien indeed observed [12] that finite monoids from a variety Inline graphic and a variety Inline graphic p-recognise the same languages if and only if they p-recognise the same regular languages. Otherwise stated, most conjectures about the internal structure of Inline graphic can be reformulated as a statement about where one or several regular languages lie within that structure. This is why a line of previous works got interested into various notions of tameness, capturing the fact that for a given variety of finite monoids, p-recognition does not offer much more power than classical morphism-recognition when it comes to regular languages (see [8, 10, 11, 13, 14, 2022]).

This paper is a contribution to an ongoing study of what regular languages can be p-recognised by monoids taken from “small” varieties, started with the author’s Ph.D. thesis [7]. In a previous paper by the author with McKenzie and Segoufin [8], a novel notion of tameness was introduced and shown for the “small” variety of finite aperiodic monoids Inline graphic. This allowed them to characterise the class of regular languages p-recognised by monoids from Inline graphic as those recognised by so called quasi-Inline graphic morphisms and represented a first small step towards a new proof that the variety Inline graphic of finite aperiodic monoids is tame. This is a statement equivalent to Furst’s, Saxe’s, Sipser’s [6] and Ajtai’s [1] well-known lower bound result about Inline graphic. In [8], the authors also observed that, while Inline graphic “behaves well” with respect to p-recognition of regular languages, the variety Inline graphic, a subclass of Inline graphic, does, in contrast, “behave badly” in the sense that monoids from Inline graphic do p-recognise regular languages that are not recognised by quasi-Inline graphic morphisms.

Now, Inline graphic is a well-studied and fundamental variety in algebraic automata theory (see, e.g., [15, 16]), corresponding through classical morphism-recognition to the class of regular languages in which membership depends on the presence or absence of a finite set of words as subwords. This paper is a contribution to the understanding of the power of programs over monoids in Inline graphic, a knowledge that certainly does not bring us closer to a new proof of the tameness of Inline graphic (as we are dealing with a strict subvariety of Inline graphic), but that is motivated by the importance of Inline graphic in algebraic automata theory and the unexpected power of programs over monoids in Inline graphic. The results we present in this article are twofold: first, we exhibit a fine hierarchy within the class of languages p-recognised by monoids from Inline graphic, depending on the length of those programs and on a parametrisation of Inline graphic; second, we show that a whole class of regular languages, that form a subclass of dot-depth one languages [15], are p-recognised by monoids from Inline graphic while, in general, they are not recognised by any quasi-Inline graphic morphism. This class roughly corresponds to dot-depth one languages where detection of a given factor does work only when it does not appear too often as a subword. We actually even conjecture that this class of languages with additional positional modular counting (that is, letters can be differentiated according to their position modulo some fixed number) corresponds exactly to all those p-recognised by monoids in Inline graphic, a statement that is interesting in itself for algebraic automata theory.

Organisation of the Paper. Following the present introduction, Sect. 2 is dedicated to the necessary preliminaries. In Sect. 3, we present the results about the fine hierarchy and in Sect. 4 we expose the results concerning the regular languages p-recognised by monoids from Inline graphic. Section 5 gives a short conclusion.

Note. This article is based on unpublished parts of the author’s Ph.D. thesis [7].

Preliminaries

Various Mathematical Materials

We assume the reader is familiar with the basics of formal language theory, semigroup theory and recognition by morphisms, that we might designate by classical recognition; for those, we only specify some things and refer the reader to the two classical references of the domain by Eilenberg [4, 5] and Pin [16].

General Notations and Conventions. Let Inline graphic. We shall denote by Inline graphic the set of all Inline graphic verifying Inline graphic. We shall also denote by [i] the set Inline graphic. Given some set E, we shall denote by Inline graphic the powerset of E. All our alphabets and words will always be finite; the empty word will be denoted by Inline graphic.

Varieties and Languages. A variety of monoids is a class of finite monoids closed under submonoids, Cartesian product and morphic images. A variety of semigroups is defined similarly. When dealing with varieties, we consider only finite monoids and semigroups, each having an idempotent power, a smallest Inline graphic such that Inline graphic for any element x. To give an example, the variety of finite aperiodic monoids, denoted by Inline graphic, contains all finite monoids M such that, given Inline graphic its idempotent power, Inline graphic for all Inline graphic.

To each variety Inline graphic of monoids or semigroups we associate the class Inline graphic of languages such that, respectively, their syntactic monoid or semigroup belongs to Inline graphic. For instance, Inline graphic is well-known to be the class of star-free languages.

Quasi Inline graphic Languages. If S is a semigroup we denote by Inline graphic the monoid S if S is already a monoid and Inline graphic otherwise.

The following definitions are taken from [17]. Let Inline graphic be a surjective morphism from Inline graphic to a finite monoid M. For all k consider the subset Inline graphic of M (where Inline graphic is the set of words over Inline graphic of length k). As M is finite there is a k such that Inline graphic. This implies that Inline graphic is a semigroup. The semigroup given by the smallest such k is called the stable semigroup of Inline graphic. If S is the stable semigroup of Inline graphic, Inline graphic is called the stable monoid of Inline graphic. If Inline graphic is a variety of monoids or semigroups, then we shall denote by Inline graphic the class of such surjective morphisms whose stable monoid or semigroup, respectively, is in Inline graphic and by Inline graphic the class of languages whose syntactic morphism is in Inline graphic.

Programs over Monoids. Programs over monoids form a non-uniform model of computation, first defined by Barrington and Thérien [3], extending Barrington’s permutation branching program model [2]. Let M be a finite monoid and Inline graphic an alphabet. A program P over M on Inline graphic is a finite sequence of instructions of the form (if) where Inline graphic and Inline graphic; said otherwise, it is a word over Inline graphic. The length of P, denoted by Inline graphic, is the number of its instructions. The program P defines a function from Inline graphic to M as follows. On input Inline graphic, each instruction (if) outputs the monoid element Inline graphic. A sequence of instructions then yields a sequence of elements of M and their product is the output P(w) of the program. A language Inline graphic is consequently recognised by P whenever there exists Inline graphic such that Inline graphic.

A language L over Inline graphic is recognised by a sequence of programs Inline graphic over some finite monoid M if for each n, the program Inline graphic is on Inline graphic and recognises Inline graphic. We say Inline graphic is of length s(n) for Inline graphic whenever Inline graphic for all Inline graphic and that it is of length at most s(n) whenever there exists Inline graphic verifying Inline graphic for all Inline graphic.

For Inline graphic and Inline graphic a variety of monoids, we denote by Inline graphic the class of languages recognised by sequences of programs over monoids in Inline graphic of length at most s(n). The class Inline graphic is then the class of languages p-recognised by a monoid in Inline graphic, i.e. recognised by sequences of programs over monoids in Inline graphic of polynomial length.

The following is an important property of Inline graphic.

Proposition 1

([12, Corollary 3.5]). Let Inline graphic be a variety of monoids, then Inline graphic is closed under Boolean operations.

Given two alphabets Inline graphic and Inline graphic, a Inline graphic-program on Inline graphic for Inline graphic is defined just like a program over some finite monoid M on Inline graphic, except that instructions output letters from Inline graphic and thus that the program outputs words over Inline graphic. Let now Inline graphic and Inline graphic. We say that L program-reduces to K if and only if there exists a sequence Inline graphic of Inline graphic-programs (the program-reduction) such that Inline graphic is on Inline graphic and Inline graphic for each Inline graphic. The following proposition shows closure of Inline graphic also under program-reductions.

Proposition 2

([7, Proposition 3.3.12 and Corollary 3.4.3]). Let Inline graphic and Inline graphic be two alphabets. Let Inline graphic be a variety of monoids. Given Inline graphic in Inline graphic and Inline graphic from which there exists a program-reduction to K of length t(n), for Inline graphic, we have that Inline graphic. In particular, when K is recognised (classically) by a monoid in Inline graphic, we have that Inline graphic.

Tameness and the Variety Inline graphic

We won’t introduce any of the proposed notions of tameness but will only state that the main consequence for a variety of monoids Inline graphic to be tame in the sense of [8] is that Inline graphic. This consequence has far-reaching implications from a computational-complexity-theoretic standpoint when Inline graphic happens to be equal to a circuit complexity class. For instance, tameness for Inline graphic implies that Inline graphic, which is equivalent to the fact that Inline graphic does not contain the language Inline graphic of words over Inline graphic containing a number of 1s not divisible by m for any Inline graphic (a central result in complexity theory [1, 6]).

Let us now define the variety of monoids Inline graphic. A finite monoid M of idempotent power Inline graphic belongs to Inline graphic if and only if Inline graphic for all Inline graphic. It is a strict subvariety of the variety Inline graphic, containing all finite monoids M of idempotent power Inline graphic such that Inline graphic for all Inline graphic, itself a strict subvariety of Inline graphic. The variety Inline graphic is a “small” one, well within Inline graphic.

We now give some specific definitions and results about Inline graphic that we will use, based essentially on [9], but also on [16, Chapter 4, Section 1].

For some alphabet Inline graphic and each Inline graphic, let us define the equivalence relation Inline graphic on Inline graphic by Inline graphic if and only if u and v have the same set of k-subwords (subwords of length at most k), for all Inline graphic. The relation Inline graphic is a congruence of finite index on Inline graphic. For an alphabet Inline graphic and a word Inline graphic, we shall write Inline graphic for the language of all words over Inline graphic having u as a subword. In the following, we consider that Inline graphic has precedence over Inline graphic and Inline graphic (but of course not over concatenation).

We define the class of piecewise testable languages Inline graphic as the class of regular languages such that for every alphabet Inline graphic, we associate to Inline graphic the set Inline graphic of all languages over Inline graphic that are Boolean combinations of languages of the form Inline graphic where Inline graphic. In fact, Inline graphic is the set of languages over Inline graphic equal to a union of Inline graphic-classes for some Inline graphic (see [18]). Simon showed [18] that a language is piecewise testable if and only if its syntactic monoid is in Inline graphic, i.e. Inline graphic.

We can define a hierarchy of piecewise testable languages in a natural way. For Inline graphic, let the class of k-piecewise testable languages Inline graphic be the class of regular languages such that for every alphabet Inline graphic, we associate to Inline graphic the set Inline graphic of all languages over Inline graphic that are Boolean combinations of languages of the form Inline graphic where Inline graphic with Inline graphic. We then have that Inline graphic is the set of languages over Inline graphic equal to a union of Inline graphic-classes. Let us define Inline graphic the inclusion-wise smallest variety of monoids containing the quotients of Inline graphic by Inline graphic for any alphabet Inline graphic: we have that a language is k-piecewise testable if and only if its syntactic monoid belongs to Inline graphic, i.e. Inline graphic. (See [9, Section 3].)

Fine Hierarchy

The first part of our investigation of the computational power of programs over monoids in Inline graphic concerns the influence of the length of programs on their computational capabilities.

We say two programs over a same monoid on the same set of input words are equivalent if and only if they recognise the same languages. Tesson and Thérien proved in [23] that for any monoid M in Inline graphic, there exists some Inline graphic such that for any alphabet Inline graphic there is a constant Inline graphic verifying that any program over M on Inline graphic for Inline graphic is equivalent to a program over M on Inline graphic of length at most Inline graphic. Since Inline graphic, any monoid in Inline graphic does also have this property. However, this does not imply that there exists some Inline graphic working for all monoids in Inline graphic, i.e. that Inline graphic collapses to Inline graphic.

In this section, we show on the one hand that, as for Inline graphic, while Inline graphic collapses to Inline graphic for any super-polynomial function Inline graphic, there does not exist any Inline graphic such that Inline graphic collapses to Inline graphic; and on the other hand that Inline graphic does optimally collapse to Inline graphic for each Inline graphic.

Strict Hierarchy

Given Inline graphic, we say that Inline graphic is a k-selector over n if Inline graphic is a function of Inline graphic that associates a subset of [n] to each vector in Inline graphic. For any sequence Inline graphic such that Inline graphic is a k-selector over n for each Inline graphic—a sequence we will call a sequence of k-selectors—, we set Inline graphic, where for each Inline graphic, the language Inline graphic is the set of words over Inline graphic of length Inline graphic that can be decomposed into Inline graphic consecutive blocks Inline graphic of n letters where the first k blocks each contain 1 exactly once and uniquely define a vector Inline graphic in Inline graphic, where for all Inline graphic, Inline graphic is given by the position of the only 1 in Inline graphic (i.e. Inline graphic) and v is such that there exists Inline graphic verifying that Inline graphic is 1. Observe that for any k-selector Inline graphic over 0, we have Inline graphic.

We now proceed similarly to what has been done in Subsection 5.1 in [8] to show, on one hand, that for all Inline graphic, there is a monoid Inline graphic in Inline graphic such that for any sequence of k-selectors Inline graphic, the language Inline graphic is recognised by a sequence of programs over Inline graphic of length at most Inline graphic; and, on the other hand, that for all Inline graphic there is a sequence of k-selectors Inline graphic such that for any finite monoid M and any sequence of programs Inline graphic over M of length at most Inline graphic, the language Inline graphic is not recognised by Inline graphic.

We obtain the following proposition.

Proposition 3

For all Inline graphic, we have Inline graphic. More precisely, for all Inline graphic and Inline graphic, we have Inline graphic.

Collapse

Looking at Proposition 3, it looks at first glance rather strange that, for each Inline graphic, we can only prove strictness of the hierarchy inside Inline graphic up to exponent Inline graphic. We now show, in a way similar to Subsection 5.2 in [8], that in fact Inline graphic does collapse to Inline graphic for all Inline graphic, showing Proposition 3 to be optimal in some sense.

Proposition 4

Let Inline graphic. Let Inline graphic and Inline graphic be an alphabet. Then there exists a constant Inline graphic such that any program over M on Inline graphic for Inline graphic is equivalent to a program over M on Inline graphic of length at most Inline graphic.

In particular, Inline graphic for all Inline graphic.

Regular Languages in Inline graphic

The second part of our investigation of the computational power of programs over monoids in Inline graphic is dedicated to understanding exactly what regular languages can be p-recognised by monoids in Inline graphic.

Non-tameness of Inline graphic

It is shown in [8] that Inline graphic, thus giving an example of a well-known subvariety of Inline graphic for which p-recognition allows to do unexpected things when recognising a regular language. How far does this unexpected power go?

The first thing to notice is that, though none of them is in Inline graphic, all languages of the form Inline graphic and Inline graphic for Inline graphic an alphabet and Inline graphic are in Inline graphic. Indeed, each of them can be recognised by a sequence of constant-length programs over the syntactic monoid of Inline graphic: for every input length, just output the image, through the syntactic morphism of Inline graphic, of the word made of the Inline graphic first or last letters. So, informally stated, programs over monoids in Inline graphic can check for some constant-length beginning or ending of their input words.

But they can do much more. Indeed, the language Inline graphic does not belong to Inline graphic (compute the stable monoid), yet it is in Inline graphic. The crucial insight is that it can be program-reduced in linear length to the piecewise testable language of all words over Inline graphic having ca as a subword but not the subwords cca, caa and cb by using the following trick (that we shall call “feedback-sweeping”) for input length Inline graphic: read the input letters in the order Inline graphic, output the letters read. This has already been observed in [8, Proposition 5].

Lemma 1

Inline graphic.

Using variants of the “feedback-sweeping” reading technique, we can prove that the phenomenon just described is not an isolated case.

Lemma 2

The languages Inline graphic, Inline graphic, Inline graphic, Inline graphic and Inline graphic do all belong to Inline graphic.

Hence, we are tempted to say that there are “much more” regular languages in Inline graphic than just those in Inline graphic, even though it is not clear to us whether Inline graphic or not. But can we show any upper bound on Inline graphic? It turns out that we can, relying on two known results.

First, since Inline graphic, we have Inline graphic, so Theorem 6 in [8], that states Inline graphic, implies that Inline graphic.

Second, let us define an important superclass of the class of piecewise testable languages. Let Inline graphic be an alphabet and Inline graphic (Inline graphic); we define Inline graphic. The class of dot-depth one languages is the class of Boolean combinations of languages of the form Inline graphic, Inline graphic and Inline graphic for Inline graphic an alphabet, Inline graphic and Inline graphic. The inclusion-wise smallest variety of semigroups containing all syntactic semigroups of dot-depth one languages is denoted by Inline graphic and verifies that Inline graphic is exactly the class of dot-depth one languages. (See [11, 15, 19].) It has been shown in [11, Corollary 8] that Inline graphic (if we extend the program-over-monoid formalism in the obvious way to finite semigroups). Now, we have Inline graphic, so that Inline graphic and hence Inline graphic.

To summarise, we have the following.

Proposition 5

Inline graphic.

In fact, we conjecture that the inverse inclusion does also hold.

Conjecture 1

Inline graphic.

Why do we think this should be true? Though, for a given alphabet Inline graphic, we cannot decide whether some word Inline graphic of length at least 2 appears as a factor of any given word w in Inline graphic with programs over monoids in Inline graphic (because Inline graphic), Lemma 2 and the possibilities offered by the “feedback-sweeping” technique give the impression that we can do it when we are guaranteed that u appears at most a fixed number of times in w, which seems somehow to be what dot-depth one languages become when restricted to belong to Inline graphic. This intuition motivates the definition of threshold dot-depth one languages.

Threshold Dot-Depth One Languages

The idea behind the definition of threshold dot-depth one languages is that we take the basic building blocks of dot-depth one languages, of the form Inline graphic for an alphabet Inline graphic, for Inline graphic and Inline graphic, and restrict them so that, given Inline graphic, membership of a word does really depend on the presence of a given word Inline graphic as a factor if and only if it appears less than l times as a subword.

Definition 1

Let Inline graphic be an alphabet. For all Inline graphic and Inline graphic, we define Inline graphic to be the language of words over Inline graphic containing Inline graphic as a subword or u as a factor, i.e. Inline graphic. Then, for all Inline graphic (Inline graphic) and Inline graphic, we define Inline graphic.

Obviously, for each Inline graphic an alphabet, Inline graphic and Inline graphic, the language Inline graphic equals Inline graphic. Over Inline graphic, the language Inline graphic contains all words containing a letter c verifying that in the prefix up to that letter, ababab appears as a subword or ab appears as a factor. Finally, the language Inline graphic over Inline graphic of Lemma 1 is equal to Inline graphic.

We then define a threshold dot-depth one language as any Boolean combination of languages of the form Inline graphic, Inline graphic and Inline graphic for Inline graphic an alphabet, for Inline graphic and Inline graphic.

Confirming the intuition briefly given above, the technique of “feedback-sweeping” can indeed be pushed further to prove that the whole class of threshold dot-depth one languages is contained in Inline graphic, and we dedicate the remainder of this section to prove it. Concerning Conjecture 1, our intuition leads us to believe that, in fact, the class of threshold dot-depth one languages with additional positional modular counting is exactly Inline graphic. We simply refer the interested reader to Section 5.4 of the author’s Ph.D. thesis [7], that contains a partial result supporting this belief, too technical and long to be presented here.

Let us now move on to the proof of the following theorem.

Theorem 1

Every threshold dot-depth one language belongs to Inline graphic.

As Inline graphic is closed under Boolean operations (Proposition 1), our goal is to prove, given an alphabet Inline graphic, given Inline graphic and Inline graphic (Inline graphic), that Inline graphic is in Inline graphic; the case of Inline graphic and Inline graphic for Inline graphic is easily handled (see the discussion at the beginning of Subsect. 4.1). To do this, we need to put Inline graphic in some normal form. It is readily seen that Inline graphic where the Inline graphic’s are defined thereafter.

Definition 2

Let Inline graphic be an alphabet.

For all Inline graphic, Inline graphic and Inline graphic, set Inline graphic.

Building directly a sequence of programs over a monoid in Inline graphic that decides Inline graphic for some alphabet Inline graphic and Inline graphic seems however tricky. We need to split things further by controlling precisely how many times each Inline graphic for Inline graphic appears in the right place when it does less than l times. To do this, we consider, for each Inline graphic, the language Inline graphic defined below.

Definition 3

Let Inline graphic be an alphabet.

For all Inline graphic (Inline graphic), Inline graphic, Inline graphic, we set

graphic file with name 492458_1_En_22_Equ2_HTML.gif

Now, for a given Inline graphic, we are interested in the words of Inline graphic such that for each Inline graphic verifying Inline graphic, the word Inline graphic indeed appears as a factor in the right place. We thus introduce a last language Inline graphic defined as follows.

Definition 4

Let Inline graphic be an alphabet.

For all Inline graphic (Inline graphic), Inline graphic, Inline graphic, we set

graphic file with name 492458_1_En_22_Equ3_HTML.gif

We now have the normal form we were looking for to prove Theorem 1: Inline graphic is equal to the union, over all Inline graphic, of the intersection of Inline graphic and Inline graphic. Though rather intuitive, the correctness of this decomposition is not so straightforward to prove and, actually, we can only prove it when for each Inline graphic, the letters in Inline graphic are all distinct.

Lemma 3

Let Inline graphic be an alphabet, Inline graphic and Inline graphic (Inline graphic) such that for each Inline graphic, the letters in Inline graphic are all distinct. Then,

graphic file with name M281.gif

Our goal now is to prove, given an alphabet Inline graphic, given Inline graphic and Inline graphic (Inline graphic) such that for each Inline graphic, the letters in Inline graphic are all distinct, that for any Inline graphic, the language Inline graphic is in Inline graphic; closure of Inline graphic under union (Proposition 1) consequently entails that Inline graphic. The way Inline graphic and Inline graphic are defined allows us to reason as follows. For each Inline graphic verifying Inline graphic, let Inline graphic be the language of words w over Inline graphic containing Inline graphic as a subword but not Inline graphic and such that Inline graphic with Inline graphic and Inline graphic, where Inline graphic and Inline graphic. If we manage to prove that for each Inline graphic verifying Inline graphic we have Inline graphic, we can conclude that Inline graphic does belong to Inline graphic by closure of Inline graphic under intersection, Proposition 1. The lemma that follows, the main lemma in the proof of Theorem 1, exactly shows this. The proof crucially uses the “feedback sweeping” technique, but note that we actually don’t know how to prove it when we do not enforce that for each Inline graphic, the letters in Inline graphic are all distinct.

Lemma 4

Let Inline graphic be an alphabet and Inline graphic such that its letters are all distinct. For all Inline graphic and Inline graphic, we have

graphic file with name 492458_1_En_22_Equ4_HTML.gif

Proof (Sketch)

Let Inline graphic be an alphabet and Inline graphic such that its letters are all distinct. Let Inline graphic and Inline graphic. We let

graphic file with name 492458_1_En_22_Equ5_HTML.gif

If Inline graphic, the lemma follows trivially because L is piecewise testable and hence belongs to Inline graphic, so we assume Inline graphic.

For each letter Inline graphic, we shall use Inline graphic distinct decorated letters of the form Inline graphic for some Inline graphic, using the convention that Inline graphic; of course, for two distinct letters Inline graphic, we have that Inline graphic and Inline graphic are distinct for all Inline graphic. We denote by A the alphabet of these decorated letters. The main idea of the proof is, for a given input length Inline graphic, to build an A-program Inline graphic over Inline graphic such that, given an input word Inline graphic, it first ouputs the Inline graphic first letters of w and then, for each i going from Inline graphic to n, outputs Inline graphic, followed by Inline graphic (a “sweep” of Inline graphic letters backwards down to position Inline graphic, decorating the letters incrementally) and finally by Inline graphic (a “sweep” forwards up to position i, continuing the incremental decoration of the letters). The idea behind this way of rearranging and decorating letters is that, given an input word Inline graphic, as long as we make sure that w and thus Inline graphic do contain Inline graphic as a subword but not Inline graphic, then Inline graphic can be decomposed as Inline graphic where Inline graphic, Inline graphic, and Inline graphic are minimal, with z containing Inline graphic as a subword for some Inline graphic if and only if Inline graphic. This means we can check whether Inline graphic by testing whether w belongs to some fixed piecewise testable language over A.

As explained before stating the previous lemma, we can now use it to prove the result we were aiming for.

Proposition 6

Let Inline graphic be an alphabet, Inline graphic and Inline graphic (Inline graphic) such that for each Inline graphic, the letters in Inline graphic are all distinct. For all Inline graphic, we have Inline graphic.

We thus derive the awaited corollary.

Corollary 1

Let Inline graphic be an alphabet, Inline graphic and Inline graphic (Inline graphic) such that for each Inline graphic, the letters in Inline graphic are all distinct. Then, Inline graphic.

However, what we really want to obtain is that Inline graphic without putting any restriction on the Inline graphic’s. But, in fact, to remove the constraint that the letters must be all distinct in each of the Inline graphic’s, we simply have to decorate each of the input letters with its position minus 1 modulo a big enough Inline graphic. This finally leads to the following proposition.

Proposition 7

Let Inline graphic be an alphabet, Inline graphic and Inline graphic (Inline graphic). Then Inline graphic.

This finishes to prove Theorem 1 by closure of Inline graphic under Boolean combinations (Proposition 1) and by the discussion at the beginning of Subsect. 4.1.

Conclusion

Although Inline graphic is very small compared to Inline graphic, we have shown that programs over monoids in Inline graphic are an interesting subject of study in that they allow to do quite unexpected things. The “feedback-sweeping” technique allows one to detect presence of a factor thanks to such programs as long as this factor does not appear too often as a subword: this is the basic principle behind threshold dot-depth one languages, that our article shows to belong wholly to Inline graphic.

Whether threshold dot-depth one languages with additional positional modular counting do correspond exactly to the languages in Inline graphic seems to be a challenging question, that we leave open. In his Ph.D. thesis [7], the author proved that all strongly unambiguous monomials (the basic building blocks in Inline graphic) that are imposed to belong to Inline graphic at the same time are in fact threshold dot-depth one languages. However, the proof looks much too complex and technical to be extended to, say, all languages in Inline graphic. New techniques are probably needed, and we might conclude by saying that proving (or disproving) this conjecture could be a nice research goal in algebraic automata theory.

Acknowledgements

The author thanks the anonymous referees for their helpful comments and suggestions.

Contributor Information

Alberto Leporati, Email: alberto.leporati@unimib.it.

Carlos Martín-Vide, Email: carlos.martin@urv.cat.

Dana Shapira, Email: shapird@g.ariel.ac.il.

Claudio Zandron, Email: zandron@disco.unimib.it.

Nathan Grosshans, Email: nathan.grosshans@polytechnique.edu, https://www.di.ens.fr/~ngrosshans/.

References

  • 1.Ajtai, M.: Inline graphic-formulae on finite structures. Ann. Pure Appl. Logic 24(1), 1–48 (1983)
  • 2.Barrington DAM. Bounded-width polynomial-size branching programs recognize exactly those languages in NCInline graphic J. Comput. Syst. Sci. 1989;38(1):150–164. doi: 10.1016/0022-0000(89)90037-8. [DOI] [Google Scholar]
  • 3.Barrington DAM, Thérien D. Finite monoids and the fine structure of NCInline graphic J. ACM. 1988;35(4):941–952. doi: 10.1145/48014.63138. [DOI] [Google Scholar]
  • 4.Eilenberg S. Automata, Languages, and Machines. New York: Academic Press; 1974. [Google Scholar]
  • 5.Eilenberg S. Automata, Languages, and Machines. New York: Academic Press; 1976. [Google Scholar]
  • 6.Furst ML, Saxe JB, Sipser M. Parity, circuits, and the polynomial-time hierarchy. Math. Syst. Theory. 1984;17(1):13–27. doi: 10.1007/BF01744431. [DOI] [Google Scholar]
  • 7.Grosshans, N.: The limits of Nečiporuk’s method and the power of programs over monoids taken from small varieties of finite monoids. Ph.D. thesis, University of Paris-Saclay, France (2018)
  • 8.Grosshans, N., McKenzie, P., Segoufin, L.: The power of programs over monoids in DA. In: MFCS 2017, Aalborg, Denmark, 21–25 August 2017, pp. 2:1–2:20 (2017)
  • 9.Klíma O, Polák L. Hierarchies of piecewise testable languages. Int. J. Found. Comput. Sci. 2010;21(4):517–533. doi: 10.1142/S0129054110007404. [DOI] [Google Scholar]
  • 10.Lautemann C, Tesson P, Thérien D. An algebraic point of view on the Crane Beach property. In: Ésik Z, editor. Computer Science Logic; Heidelberg: Springer; 2006. pp. 426–440. [Google Scholar]
  • 11.Maciel A, Péladeau P, Thérien D. Programs over semigroups of dot-depth one. Theor. Comput. Sci. 2000;245(1):135–148. doi: 10.1016/S0304-3975(99)00278-9. [DOI] [Google Scholar]
  • 12.McKenzie P, Péladeau P, Thérien D. NCInline graphic: the automata-theoretic viewpoint. Comput. Complex. 1991;1:330–359. doi: 10.1007/BF01212963. [DOI] [Google Scholar]
  • 13.Péladeau, P.: Classes de circuits booléens et variétés de monoïdes. Ph.D. thesis, Université Pierre-et-Marie-Curie (Paris-VI), Paris, France (1990)
  • 14.Péladeau P, Straubing H, Thérien D. Finite semigroup varieties defined by programs. Theor. Comput. Sci. 1997;180(1–2):325–339. doi: 10.1016/S0304-3975(96)00297-6. [DOI] [Google Scholar]
  • 15.Pin, J.: The dot-depth hierarchy, 45 years later. In: The Role of Theory in Computer Science - Essays Dedicated to Janusz Brzozowski, pp. 177–202 (2017)
  • 16.Pin J. Varieties of Formal Languages. New York: Plenum Publishing Co.; 1986. [Google Scholar]
  • 17.Pin J, Straubing H. Some results on Inline graphic-varieties. ITA. 2005;39(1):239–262. [Google Scholar]
  • 18.Simon I. Piecewise testable events. In: Brakhage H, editor. Automata Theory and Formal Languages; Heidelberg: Springer; 1975. pp. 214–222. [Google Scholar]
  • 19.Straubing H. Finite semigroup varieties of the form Inline graphic J. Pure Appl. Algebra. 1985;36:53–94. doi: 10.1016/0022-4049(85)90062-3. [DOI] [Google Scholar]
  • 20.Straubing H. When can one finite monoid simulate another? In: Birget JC, Margolis S, Meakin J, Sapir M, editors. Algorithmic Problems in Groups and Semigroups. Boston: Springer; 2000. pp. 267–288. [Google Scholar]
  • 21.Straubing H. Languages defined with modular counting quantifiers. Inf. Comput. 2001;166(2):112–132. doi: 10.1006/inco.2000.2923. [DOI] [Google Scholar]
  • 22.Tesson, P.: Computational complexity questions related to finite monoids and semigroups. Ph.D. thesis, McGill University, Montreal (2003)
  • 23.Tesson P, Thérien D. The computing power of programs over finite monoids. J. Autom. Lang. Comb. 2001;7(2):247–258. [Google Scholar]

Articles from Language and Automata Theory and Applications are provided here courtesy of Nature Publishing Group

RESOURCES