Circular Tessera Codes in the Evolution of the Genetic Code

Elena Fimmel; Martin Starman; Lutz Strüngmann

doi:10.1007/s11538-020-00724-z

. 2020 Apr 4;82(4):48. doi: 10.1007/s11538-020-00724-z

Circular Tessera Codes in the Evolution of the Genetic Code

Elena Fimmel ^1,^✉, Martin Starman ¹, Lutz Strüngmann ¹

PMCID: PMC7128014 PMID: 32248310

Abstract

The origin of the modern genetic code and the mechanisms that have contributed to its present form raise many questions. The main goal of this work is to test two hypotheses concerning the development of the genetic code for their compatibility and complementarity and see if they could benefit from each other. On the one hand, Gonzalez, Giannerini and Rosa developed a theory, based on four-based codons, which they called tesserae. This theory can explain the degeneracy of the modern vertebrate mitochondrial code. On the other hand, in the 1990s, so-called circular codes were discovered in nature, which seem to ensure the maintenance of a correct reading-frame during the translation process. It turns out that the two concepts not only do not contradict each other, but on the contrary complement and enrichen each other.

Keywords: Genetic code, Degeneracy, Circular code, Tessera

Introduction

In 1986, John Maynard Smith stated: “We understand biological phenomena only when we have invented machines with similar properties” (Smith 1986, pp 99–100). This quotation explains the motivation of this work quite well. This paper was written in order to better understand the origin of the genetic code using such a machinery. One possible machine or rather a model which gives a feasible explanation for an important aspect of the evolutionary processes of the genetic code was found by Gonzalez, Giannerini and Rosa. In their work “On the origin of degeneration in the genetic code” (Gonzalez et al. 2019) they focus on the degeneracy of amino acid coding and especially on symmetry as an essential cause and consequence of the natural phenomena of degeneracy (compare also Fimmel and Strüngmann 2016). A famous example, which shows the importance of including symmetry deliberations when considering natural phenomena, can be found in quantum mechanics. Here, symmetry describes more than just the patterns that matter takes – it is used to classify the nature of quantum states. This is by far not the only example of its kind. Noether’s theorem even states a one-to-one connection between fundamental laws of nature - so-called conservation laws- and respective symmetries in nature.

Taking these general considerations into account, Gonzalez, Giannerini, and Rosa argue that none of the theories regarding the origin of the genetic code pays the necessary attention to the idea of symmetry (Gonzalez et al. 2019). As a consequence the concept of tessera codes was developed. The tesserae build a subset of all tetranucleotides, chosen in such a way that the degeneracy of the vertebrate mitochondrial genetic code can be explained from the symmetries of the tesserae (Gonzalez et al. 2012).

The other line of thought adressed by the current work is the theory of circular codes. This theory is intended to explain the property of the noise-immunity of the genetic code, and is based on a proposal by Crick et al. (1957). They argue that the coding of amino acids requires only a subset of codons where the correct reading-frame is automatically and immediately recognizable - the so-called comma-free property. While Crick’s theory was refuted in reality (Nirenberg and Matthaei 1961), 40 years later so-called circular codes were discovered in nature (Arqués and Michel 1996). More specifically, it has been noticed that the set of codons, which, together with their frame-shifts in three potential reading-frames, are the most commonly used across all species, has very remarkable properties in terms of detecting the correct reading-frame (Fimmel and Strüngmann 2018; Fimmel et al. 2016; Michel 2017). The comma-free codes proposed by Crick belong to the same family of circular codes, but within them they have the most distinct error-detecting properties (see, for instance, Fimmel et al. 2018, 2017, 2016, xxxx). The natural circular codes have even more interesting structural properties, which makes it very doubtful that these structures play no role in biological processes (Arqués and Michel 1996, Fimmel and Strüngmann (2018)).

The primary goal of this work is to combine the two concepts, tesserae and circular codes, and see if they could benefit from each other. In this work we specify among other things a construction algorithm for circular tessera codes of maximal length. Furthermore, self-complementary tessera codes are characterized and criteria for their self-complementarity are formulated and proved in the language of graph theory. The growth tables for circular and comma-free tessera codes are also presented for the first time. In summary, one result of the work is that the two concepts under scrutiny—that of tessera codes and circularity—have proved to be mutually compatible and complementary.

Thus, with this work we hope to bring more clarity into the possible role of tesserae in the evolutionary process of the genetic code and the mechanisms behind it.

Definitions and Notations

The genetic code is written with words of three letters called codons, built over an alphabet

\begin{matrix} B : = {U (T), C, A, G} \end{matrix}

of four letters which are called nucleotide bases Uracil (Thymine), Cytosine, Adenine, and Guanine, in short U(T), C, A, G. Clearly, the number of codons is $4^{3} = 64$ and by $| B^{3} |$ we will denote the cardinality of the set $B^{3}$ . Accordingly, the set $B^{2}$ denotes the set of 16 dinucleotides and the set $B^{4}$ contains the 256 tetranucleotides. It is hypothesized that during evolution the genetic code had several ancestors that might have consisted not only of trinucleotides but of dinucleotides or tetranucleotides or even combinations of these (see Baranov et al. 2009; Gonzalez et al. 2012; Seligmann 2014; Patel 2005; Wilhelm and Nikolajewa 2004; Wu et al. 2005). In particular, in Gonzalez et al. (2012) the tessera code was suggested as an ancestral code that might have been the origin of the mitochondrial code (see also Gonzalez et al. 2019). In order to define the tessera code we have to introduce some group theory and how it can be applied in the genetic setting.

Klein Four-Group and Equivalence Classes of Dinucleotides

The symmetric group on a set of elements is usually known as the group of permutations of these elements. Applying this to our genetic alphabet $B$ we define the symmetric group $S_{B}$ as

\begin{matrix} S_{B} = {π : B \to B ∣ π i s b i j e c t i v e} \end{matrix}

with the usual group operation given by composition of functions. Recall that a group $(H, \circ)$ is a set H together with an operation $\circ : H \to H$ such that $\circ$ is associative and H contains a neutral element e as well as inverses $h^{- 1}$ for all $h \in H$ (see Rotman 1995 for more details on groups). The group $S_{B}$ has $4! = 24$ elements and is trivially isomorphic to the symmetric group $S_{4}$ on four elements. We will use standard notation as can be found in Rotman (1995), e.g. we will either write $π = (A, G, C, U)$ or $π : (A, U, C, G) \to (G, A, U, C)$ if $π$ satisfies $π (A) = G, π (U) = A, π (C) = U$ , and $π (G) = C$ . Naturally, any permutation $π : B \to B$ can be applied to n-nucleotides of any length componentwise, i.e. if $x = b_{1} \dots b_{n} \in B^{n}$ , then $π (x) = π (b_{1}) \dots π (b_{n})$ . There is no danger of confusion when denoting the induced bijective map $B^{n} \to B^{n}$ by $π$ again for any natural number n.

In Fimmel et al. (2014), Fimmel et al. (2015) a subgroup L of $S_{B}$ was identified that seems to play an important role in error-detection and error-correction mechanisms during the translation process. This group consists of all permutations from $S_{B}$ that preserve the codon-anticodon relation and can be geometrically interpreted as the symmetry group of a square. In particular, it contains 4 bijective transformations of nucleotide bases that are invariant with respect to the chemical characters of the nucleotides (we will use the notations from Fimmel et al. 2014, 2015). These are the

Identity:

\begin{matrix} I (or i d) : (A, U, C, G) \to (A, U, C, G) ; \end{matrix}

Strong/Weak (SW) or complementary transformation:

\begin{matrix} SW (or c) : (A, U, C, G) \to (U, A, G, C) ; \end{matrix}

Pyrimidine/Purine (YR) transformation:

\begin{matrix} YR (or p) : (A, U, C, G) \to (G, C, U, A) ; \end{matrix}

and Keto/Amino (KM) transformation:

\begin{matrix} KM (or r) : (A, U, C, G) \to (C, G, A, U) . \end{matrix}

In particular, the complementary map c is biologically important since it mirrors the hydrogen bonds $A \leftrightarrow T$ and $C \leftrightarrow G$ of the DNA double helix. Moreover, the transformation r from above carries codons of degeneracy class 4 to codons of degeneracy class less than 4 and vice versa - a symmetry property of the genetic code that was already observed by Rumer (see Fimmel et al. 2014, 2015 for more details). In the sequel we will denote the set of these four transformations as $V = {I, S W, Y R, K M}$ (Fig. 1).

Fig. 1 — Graphical representation of the primeval base symmetries. KM is represented by red, YR by green and SW by blue colored lines (Color figure online)

Equipped with the usual group operation of $S_{B}$ the set $V$ forms a subgroup of the symmetric group $S_{B}$ which is isomorph to the so-called Klein four-group. It can be easily verified that the group $V$ is commutative, i.e. $α \circ β = β \circ α$ for all $α, β \in V$ and that all permutations in $V$ are of order two, i.e applying them twice yields the identity $α \circ α = i d$ for every $α \in V$ .

As we will see in the next section, the group $V$ is used in order to define the class of tesserae in mathematical terms. If we consider $V$ acting on the set of dinucleotides $B^{2}$ we obtain four orbits of size four. Recall that an orbit of an element x (here a dinculeotide) under some group H (here $V$ ) is defined as $[x] = {h (x) : h \in H}$ . Each orbit represents an equivalence class under the natural equivalence relation $d_{1} d_{2} \sim d_{1}^{'} d_{2}^{'}$ if and only if there is $π \in V$ such that $π (d_{1} d_{2}) = d_{1}^{'} d_{2}^{'}$ . An easy observation shows that for each such equivalence class there is a unique transformation $π \in V$ that maps the first nucleotide of a dinucleotide in that class to the second nucleotide, e.g. the map SW for the class $[A U] = {A U, U A, C G, G C}$ . Table 1 below shows the four equivalence classes and the corresponding permutations.

Table 1.

Each column is one of the four equivalence classes of dinucleotides: $Σ_{I} = [A A]$ , $Σ_{SW} = [A U]$ , $Σ_{YR} = [A C]$ , $Σ_{KM} = [A G]$ under the action of $V$ on $B^{2}$

$V$	$Σ_{I}$	$Σ_{SW}$	$Σ_{YR}$	$Σ_{KM}$
I	AA	AU	AC	AG
SW	UU	UA	UG	UC
YR	CC	CG	CA	CU
KM	GG	GC	GU	GA

Open in a new tab

The left most column shows the transformation that sends the first dinucleotide in the class to the second, third and fourth, respectively, e.g. $Y R (A A) = C C$ . The column header are the equivalence classes names. The header index is the unique transformation used for mapping the first nucleotide of a dinucleotide to the second

We are now almost in the position to define the set of tesserae as introduced in Gonzalez et al. (2012). But before we need some more technicalities. Besides the group $S_{B}$ acting as a group of exchanges of bases, there is a second important group which consists of transformations that permute the positions of single bases in a nucleotide sequence. Together with the usual composition $\circ$ of maps these permutations form again a group that once more can be seen as a symmetric group $S_{n}$ . For the convenience of the reader we here only recall the biologically relevant permutations that will be of importance for us: the so-called reversing permutation and the $n - 1$ shift operations $α_{1}, \dots, α_{n - 1}$ . Given an n-nucleotide $x = N_{1} \dots N_{n}$ we define $\overset{\leftarrow}{^{}}$ and $α_{k}$ for $k \leq n - 1$ as

\begin{matrix} \overset{\leftarrow}{N_{1} \dots N_{n}} = N_{n} \dots N_{1}, α_{k} (x) = N_{k + 1} \dots N_{n} N_{1} \dots N_{k} \end{matrix}

which are the n-nucleotides obtained from x by reversing or a shift of k positions, respectively. Explicitely, for $n = 4$ we have

\begin{matrix} \overset{\leftarrow}{N_{1} N_{2} N_{3} N_{4}} = N_{4} N_{3} N_{2} N_{1} \end{matrix}

and

\begin{matrix} α_{1} (N_{1} N_{2} N_{3} N_{4}) = N_{2} N_{3} N_{4} N_{1} \\ α_{2} (N_{1} N_{2} N_{3} N_{4}) = N_{3} N_{4} N_{1} N_{2} α_{3} (N_{1} N_{2} N_{3} N_{4}) = N_{4} N_{1} N_{2} N_{3} \end{matrix}

It is now obvious that the anti-n-nucleotide of some n-nucleotide x can be described as $\overset{\leftarrow}{S W (x)}$ with the complementary map SW from $V$ . For trinucleotides (codons) it is well-known that the anti-codon is always different from the codon. However, if n is even it might happen that $\overset{\leftarrow}{S W (x)} = x$ for some n-nucleotide x. These nucleotide sequences are called self-complementary. For example, if $n = 4$ , then the tetranucleotide ACGU is self-complementary since $\overset{\leftarrow}{S W (A C G U)} = \overset{\leftarrow}{UGCA} = A C G U$ .

Tesserae: Definition and Structure

Tesserae were motivated biologically in an evolutionary context in Gonzalez et al. (2012). Each tessera is a tetranucleotide that has a particular form that comes from the symmetries induced by the group $V$ . Let us give a definition of a tessera in mathematical terms (see also Gonzalez et al. 2012 and Fimmel and Strüngmann 2019):

Definition 2.1

A tessera is a tetranucleotide (four letter word) $t \in B^{4}$ of the form

\begin{matrix} t = N_{1} N_{2} α (N_{1}) α (N_{2}) \end{matrix}

where $N_{1}, N_{2} \in B$ and $α \in V$ . The set of all valid tesserae is denoted by TESS.

The set TESS is also called the tessera code since it is a subset of $B^{4}$ and hence a code in the sense that every concatenation of words from TESS has a unique decomposition over TESS. Clearly, the size of TESS is 64 and so we have $| T E S S | = | B^{3} |$ . Table 2 shows the set of all tesserae together with their generating transformation.

Table 2.

The table of all tessera with the generating transformation

Dinucleotide	id	c	p	r
AA	AAAA	AAUU	AAGG	AACC
CC	CCCC	CCGG	CCUU	CCAA
GG	GGGG	GGCC	GGAA	GGUU
UU	UUUU	UUAA	UUCC	UUGG
AC	ACAC	ACUG	ACGU	ACCA
AG	AGAG	AGUC	AGGA	AGCU
AU	AUAU	AUUA	AUGC	AUCG
CA	CACA	CAGU	CAUG	CAAC
CG	CGCG	CGGC	CGUA	CGAU
CU	CUCU	CUGA	CUUC	CUAG
GA	GAGA	GACU	GAAG	GAUC
GC	GCGC	GCCG	GCAU	GCUA
GU	GUGU	GUCA	GUAC	GUUG
UA	UAUA	UAAU	UACG	UAGC
UC	UCUC	UCAG	UCCA	UCGA
UG	UGUG	UGAC	UGCA	UGGU

Open in a new tab

It is easy to see that a codon $N_{1} N_{2} N_{3} \in B^{3}$ can be uniquely extended to a valid tessera $t e s s (N_{1} N_{2} N_{3}) = N_{1} N_{2} N_{3} N_{4}$ by determing the unique permutation $α \in V$ such that $α (N_{1}) = N_{3}$ and letting $N_{4} = α (N_{3})$ . This shows that the tessera code TESS is 1-error-correcting and it was shown in Fimmel and Strüngmann (2019) that TESS can be obtained as a linear code from $B^{3}$ and by the so-called Plotkin construction from $B^{2}$ - for more details on this see (Fimmel and Strüngmann 2019).

In Gonzalez et al. (2012) the idea of symmetric primeval adaptor molecules that could recognize the normal reading frame in the coding strand in the 3 $'$ –5 $'$ direction, in the complementary strand in the 3 $'$ –5 $'$ direction, in the coding strand in the reverse 5 $'$ –3 $'$ direction and in the complementary strand in the reverse 5 $'$ –3 $'$ direction was utilized to propose an ancient model of tRNA adaptors that explains the reading mechanism and degeneracy distribution of the tesserae. In particular, since there exist self-complementary tesserae, e.g. ACGU, the tessera code allows degeneracy 2 and 4 only. Maintaining the degeneracy an algorithm was suggested in Gonzalez et al. (2019) for passing from the tessera code back to the (mitochondrial) genetic code in the following way: We assign to each of the transformations from $V$ a letter in the genetic alphabet via $I \leftrightarrow A$ , $S W \leftrightarrow U$ , $K M \leftrightarrow C$ and $Y R \leftrightarrow G$ and then perform the following algorithm displayed in Fig. 2.

Fig. 2 — Schematic representation of the mapping between the tessera $b_{1}$ $b_{2}$ $b_{3}$ $b_{4}$ onto the codon $x_{1}$ $x_{2}$ $x_{3}$ . (Color figure online)

For instance, the tessera ACGU will be mapped to the codon CUU since $K M (A) = C$ and $S W (C) = G$ . In the sequel we will denote by $c o d (N_{1} N_{2} N_{3} N_{4})$ the corresponding codon under this algorithm. However, note that the two mappings $t e s s (\cdot)$ and $c o d (\cdot)$ are not inverses of each other.

We now aim for a better description of tesserae. Let us assume that $N_{1} N_{2} N_{3} N_{4}$ is a tessera. By definition there is an element $α \in V$ such that

\begin{matrix} N_{3} N_{4} = α (N_{1} N_{2}) . \end{matrix}

This implies that $N_{1} N_{2}$ and $N_{3} N_{4}$ have to be in the same equivalence class $Σ_{α}$ displayed in Table 1. Thus, the tessera code can be split into four disjoint subsets.

\begin{matrix} T E S S = T E S S_{I} \cup T E S S_{SW} \cup T E S S_{YR} \cup T E S S_{KM} \end{matrix}

where

\begin{matrix} T E S S_{i} = {d_{1} d_{2} \in T E S S ∣ d_{1}, d_{2} \in Σ_{i}} for i \in {I, S W, Y R, K M} . \end{matrix}

Clearly, any subset $X \subseteq T E S S$ has a similar induced decomposition where the components could be empty.

Definition 2.2

Let $X \subseteq B^{4}$ be a tessera code. Then

\begin{matrix} X = X_{I} \cup X_{SW} \cup X_{YR} \cup X_{KM} \end{matrix}

where

\begin{matrix} X_{i} = X \cap T E S S_{i} for i \in {I, S W, Y R, K M} . \end{matrix}

The above decomposition will be used in Sect. 4 for constructing all maximal circular tessera codes.

Graph Theoretical Approach

In this section we recall a graph theory approach from Fimmel et al. (2016) that turned out to be very useful for determining properties of circular codes (see Sect. 3 for the definition of circularity) and extend it to our setting of tesserae. To each subset $X \subseteq B^{n}$ a directed graph $G (X)$ will be associated as the union of disjoint components $C_{j} (X)$ where $1 \leq j \leq ⌊ \frac{n}{2} ⌋$ . The vertices of such a component $C_{j} (X)$ will be initial segments and end segments of n-tuples from X of length l and $n - l$ , respectively.

Definition 2.3

Let $n \in N$ and $X \subseteq B^{n}$ . For $1 \leq j \leq ⌊ \frac{n}{2} ⌋$ we define a graph component $C_{j} (X) = (V_{j} (X), E_{j} (X))$ with set of vertices $V_{j} (X)$ and set of arcs $E_{j} (X)$ as follows:

$V_{j} (X) : = {N_{1} \dots N_{j}, N_{j + 1} \dots N_{n}, N_{1} \dots N_{n - j}, N_{n - j + 1} \dots N_{n} : N_{1} N_{2} N_{3} \dots N_{n} \in X}$
$E_{j} (X) : = {N_{1} \dots N_{j} \to N_{j + 1} \dots N_{n}, N_{1} \dots N_{n - j} \to N_{n - j + 1} \dots N_{n} : N_{1} N_{2} N_{3} \dots N_{n} \in X}$

The graph $C (X)$ associated to X is the union $C (X) = ⋃_{1 \leq j \leq ⌊ \frac{n}{2} ⌋} C_{j} (X)$ of the graphs $C_{j} (X)$ for all $1 \leq j \leq ⌊ \frac{n}{2} ⌋$ . The graph $C (X)$ is called the representing graph of X.

It is easy to see that the graph components $C_{j} (X)$ of a representing graph $G (X)$ are pairwise disjoint since their labels have different lengths. However, the components need not be connected. For the convenience of the reader and for a better illustration we give some examples for $n = 2, 3$ and 4 (Figs. 3, 4 and 5).

Fig. 3 — Graphical representation $G (X)$ of the dinucleotide code X = {*UC, CG, GU, AC, AA*} which has only one component $C_{1} (X)$ . (Color figure online)

Fig. 4 — Graphical representation $G (X)$ of the trinucleotide code X = {*UCA, UAC, CAU, ACA, ACG*} which has only one component $C_{1} (X)$ that is not connected. (Color figure online)

Fig. 5 — Graphical representation $G (X)$ of the tetranucleotide code X = {*AAUC, ACUA, ACUU, CUCU, CUUU*} which has two components $C_{1} (X)$ and $C_{2} (X)$ that are both not connected but have two components themselves. (Color figure online)

Since the tesserae are tetranucleotides it follows that any set of tesserae has two (maybe empty) graph components in their representing graph, one with labels of length 1 and 3 and the other with labels of length 2.

In Fimmel et al. (2016) the graph approach was used to characterize circularity of codes in terms of graph theory. We will consider circular tessera codes in the next section but it seems reasonable to state the corresponding theorem in this section. For the technical definition of circularity see Definition 3.1.

Theorem 2.4

Let $X \subseteq B^{n}$ . Then the following are equivalent:

X is a circular code;
the representing graph $G (X)$ is acyclic, i.e. does not contain any cycle.

In the particular case of tesserae we will use a second graph associated to a set that we shall utilize later on in order to construct maximal circular tessera codes.

Definition 2.5

Let $X \subseteq T E S S$ . The di-cut-graphs $T_{1, 3} (X)$ and $T_{2, 4} (X)$ associated to X are defined as the representing graphs $G (X_{1, 3})$ and $G (X_{2, 4})$ of the sets

\begin{matrix} X_{1, 3} = {N_{1} N_{3} ∣ N_{1} N_{2} N_{3} N_{4} \in X} \end{matrix}

and

\begin{matrix} X_{2, 4} = {N_{2} N_{4} ∣ N_{1} N_{2} N_{3} N_{4} \in X} \end{matrix}

To conclude this section we give an example of a di-cut-graph $T (X)$ of some tessera code X (Fig. 6).

Circular Tessera Codes

In this section we consider circular tessera codes. Simply speaking circularity means that a frame-shift in any concatenation of tesserae from that code will be detected. In the biological setting of the genetic code, a circular set of trinucleotides was first observed in Arqués and Michel (1996) and is supposed to play an important role in error-detection mechanisms during the translational process. We start with the definition of circularity for tesserae.

Definition 3.1

Let $n \in N$ . A tessera code $X \subseteq B^{4}$ is called n-circular if for any set of tessera $t_{i} \in X$ $(1 \leq i \leq n)$ the concatenation $t_{1} \dots t_{m}$ has a unique decomposition into tesserae from the code X for any $m \leq n$ if considered on the circle. We will call a tessera code $X \subseteq B^{4}$ circular, if it is n-circular for all $n \in N$ .

As we had noted before in Theorem 2.4 a tessera code X is circular if and only if its representing graph $G (X)$ is acyclic. Moreover, it is easy to see that the code X is n-circular if and only if for any concatenation $t_{1} \dots t_{m}$ of tesserae $t_{1}, \dots, t_{m}$ from X with $m \leq n$ the shifted sequences $α_{i} (t_{1} \dots t_{m})$ for $i \leq 3$ do not yield a valid sequence in $X^{m}$ , i.e.

\begin{matrix} α_{i} (t_{1} \dots t_{m}) \notin X^{m} \end{matrix}

In particular, a tessera code X is 1-circular if it does not contain the cyclically shifted tesserae of its members, i.e.

\begin{matrix} α_{i} (t) \notin X \end{matrix}

for all $i \leq 3$ and $t \in X$ . Therefore, a circular code can not contain any tessera that equals one of its shifts, e.g. $A C A C = α_{2} (A C A C)$ , and it makes sense to consider the equivalence classes that are formed by tesserae and their circular shifts. If all shifts are different, then this class is called complete. There are 12 such complete equivalence classes, each containing 4 elements. Four other classes each contain one element ${A A A A}, {C C C C}, {G G G G}, {U U U U}$ and six classes each with two elements like ${A C A C, C A C A}$ . Table 5 displays all the complete equivalence classes of tesserae (Table 3).

Table 5.

Numbers of self-complementary circular codes of different code lengths

Code length	1	2	3	4	5	6	7	8	9	10	11	12
Number	12	72	304	996	2580	5408	9264	12708	13696	11232	6144	1584

Open in a new tab

Table 3.

List of complete equivalence classes

Tessera	Shift 1	Shift 2	Shift 3	Class number
$AAUU$	AUUA	$UUAA$	UAAU	$D_{1}$
AAGG	AGGA	GGAA	GAAG	$D_{2}$
AACC	ACCA	CCAA	CAAC	$D_{3}$
$CCGG$	CGGC	$GGCC$	GCCG	$D_{4}$
CCUU	CUUC	UUCC	UCCU	$D_{5}$
UUGG	UGGU	GGUU	GUUG	$D_{6}$
$AGCU$	GCUA	$CUAG$	UAGC	$D_{7}$
$UGCA$	GCAU	$CAUG$	AUGC	$D_{8}$
$GUAC$	UACG	$ACGU$	CGUA	$D_{9}$
AGUC	GUCA	UCAG	CAGU	$D_{10}$
$UCGA$	CGAU	$GAUC$	AUCG	$D_{11}$
ACUG	CUGA	UGAC	GACU	$D_{12}$

Open in a new tab

Self-complementary tesserae are in bold

Since any circular code is also 1-circular and there are only 12 complete equivalence classes, it is obvious that a circular tessera code can contain at most 12 elements.

Definition 3.2

A circular tessera code is called maximal if it contains exactly 12 elements.

We will show in Sect. 4 how to construct all maximal circular tessera codes and now give an example of a 1-circular tessera code that is not 2-circular.

Example 3.3

Let $X = {A C G U, C A U G, G U C A, U G A C}$ . Then X is a 1-circular tessera code but the word ACGUCAUG has two decompositions on a circle

\begin{matrix} A C G U | C A U G and G U C A | U G A C = α_{2} (A C G U C A U G) . \end{matrix}

Thus X is not 2-circular. In particular, the graph component $C_{2} (X)$ of the representing graph of $G (X)$ of X contains a cycle.

Moreover, the example below shows that also the classes of 2- and 3-circular tessera codes are different:

Example 3.4

Let $X = {C A G U, U G C A, G U U G}$ . Then X is a 2-circular (by means of easy computations) but not a 3-circular tessera code since the word CAGUUGCAGUUG has two decompositions on a circle

\begin{matrix} C A G U | U G C A | G U U G and G U U G | C A G U | U G C A \end{matrix}

We show next that the graph component $C_{2} (G)$ being not acyclic is not an accident but in fact it is the only possibility for 1-circular codes not to be circular. In order to do so recall that a cycle in a graph $G$ is a sequence $e_{1} \to \dots \to e_{n} \to e_{1}$ of distinct vertices $e_{i}$ $(i \leq n)$ in $G$ . The length of this cycle is then defined to be n. Note that for $n = 1$ a cycle of length 1 is a loop.

Proposition 3.5

Let X be a tessera code. Then the following hold:

(i)
The maximal length of a cycle in $C_{1} (X)$ is 2; in particular, the maximal length of a path that does not contain a cycle is 1;
(ii)
The maximal length of a cycle in $C_{2} (X)$ is 4; in particular, the maximal length of a path that does not contain a cycle is 3.

Proof

Let X be a tessera code. We first prove (i) by showing that any path in $C_{1} (X)$ of length 2 must contain a cycle. Hence assume that $C_{1} (X)$ contains a path of length 2. Without loss of generality we may assume that it starts with a nucleotide, e.g.

\begin{matrix} N_{1} \to N_{2} N_{3} N_{4} \to N_{5} . \end{matrix}

Then $N_{1} N_{2} N_{3} N_{4}$ and $N_{2} N_{3} N_{4} N_{5}$ are valid tesserae from X. By definition of tesserae the former tells us that there is a transformation $α \in V$ such that $α (N_{2}) = N_{4}$ and $α (N_{3}) = N_{1}$ . The latter however, then implies that also $α (N_{3}) = N_{5}$ and so $N_{1} = N_{5}$ which shows that $α_{1} (N_{1} N_{2} N_{3} N_{4}) = N_{2} N_{3} N_{4} N_{1} \in X$ and $N_{1} \to N_{2} N_{3} N_{4} \to N_{1}$ is a cycle.

We now prove (ii) by showing that any path of length 4 in $C_{2} (X)$ contains a cycle. Assume that $C_{2} (X)$ contains a path of length 4, e.g.

\begin{matrix} N_{1} N_{2} \to N_{3} N_{4} \to N_{5} N_{6} \to N_{7} N_{8} \to N_{9} N_{10} . \end{matrix}

By definition of $G (X)$ there are permutations $π_{1}, π_{2}, π_{3}, π_{4} \in V$ such that

\begin{matrix} π_{1} (N_{1} N_{2}) = N_{3} N_{4}, π_{2} (N_{3} N_{4}) = N_{5} N_{6}, \\ π_{3} (N_{5} N_{6}) = N_{7} N_{8}, π_{4} (N_{7} N_{8}) = N_{9} N_{10} \end{matrix}

If one of the $π_{i}$ is the identity we obtain a cycle of length 1 (a loop). Thus all $π_{i}$ are different from the identity. If $π_{1} = π_{2}$ , then $N_{1} N_{2} = N_{5} N_{6}$ since $π_{1}^{2} = I$ . This gives a cycle of length 2. Thus $π_{1} \neq π_{2}$ and similarly $π_{2} \neq π_{3}$ , $π_{3} \neq π_{4}$ . If $π_{1} \neq π_{3}$ , then the group structure of $V$ implies that $π_{1} \circ π_{2} = π_{3}$ and so $N_{7} N_{8} = N_{1} N_{2}$ , hence we obtain a cycle of length 3. Finally, if $π_{1} = π_{3}$ , then similar arguments as above show that we get a cycle of length 3 or $π_{2} = π_{4}$ holds. Now

\begin{matrix} π_{4} (π_{3} (π_{2} (π_{1} (N_{1} N_{2})))) = π_{2} (π_{1} (π_{2} (π_{1} (N_{1} N_{2})))) = N_{9} N_{10} \end{matrix}

but $V$ is commutative and all elements in $V$ are of order 2, hence

\begin{matrix} N_{9} N_{10} = π_{2} (π_{1} (π_{2} (π_{1} (N_{1} N_{2})))) = N_{1} N_{2} . \end{matrix}

Consequently, the path itself is a cycle of length 4. $□$

As a corollary we obtain an important theorem. Note that part (ii) was also obtained in a bachelor-thesis (Cisowski 2015) with a much more technical proof.

Theorem 3.6

Let X be a tessera code. Then the following hold:

(i)
If X is 1-circular, then $C_{1} (X)$ is acyclic;
(ii)
The following two conditions are equivalent:
1. X is circular;
2. X is 3-circular.

Proof

We first prove (i). By Proposition 3.5 we know that the maximal length of a cycle in $C_{1} (X)$ is 2, hence a cycle would be of the form $N_{1} N_{2} \to N_{3} N_{4} \to N_{1} N_{2}$ which contradicts 1-circularity since $α_{2} (N_{1} N_{2} N_{3} N_{4}) = N_{3} N_{4} N_{1} N_{2}$ .

In order to prove (ii) note that by Proposition 3.5 the maximal length of a cycle in $G (X)$ is 4. However, a cycle of even length 2 is excluded by 1-circularity and of length 4 by 2-circularity since

\begin{matrix} N_{1} N_{2} \to N_{3} N_{4} \to N_{5} N_{6} \to N_{7} N_{8} \to N_{1} N_{2} \end{matrix}

implies that $N_{1} N_{2} N_{3} N_{4} | N_{5} N_{6} N_{7} N_{8}$ has two decompositions - a contradiction. Hence $G (X)$ does not contain any cycle of even length and the maximal length of an odd cycle is 3. By Theorem 2.3 from [13] we conclude that X is circular if and only if it is 3-circular. $□$

We conclude this section with a result that gives a handy criterion for constructing circular tessera codes and some application.

Theorem 3.7

Let $X \subseteq T E S S$ be a tessera code. Then X is circular if

X is 1-circular
One of the di-cut graphs $T_{1, 3} (X)$ and $T_{2, 4} (X)$ is acyclic.

Proof

Assume that X is 1-circular and one of the di-cut graphs $T_{1, 3} (X)$ and $T_{2, 4} (X)$ is acyclic. Without loss of generality we assume that $T_{1, 3} (X)$ is acyclic. Assume that X is not circular. Then Proposition 3.5 and Theorem 3.6 imply that the component $C_{1} (X)$ is acyclic and the maximal length of a cycle in $C_{2} (X)$ is 4. Assume without loss of generality that

\begin{matrix} N_{1} N_{2} \to N_{3} N_{4} \to N_{5} N_{6} \to N_{7} N_{8} \to N_{1} N_{2} \end{matrix}

is a cycle in $G (X)$ . Thus the tesserae $N_{1} N_{2} N_{3} N_{4}, N_{3} N_{4} N_{5} N_{6}, N_{5} N_{6} N_{7} N_{8}$ and $N_{7} N_{8} N_{1} N_{2}$ are in X. By definition of $T_{1, 3} (X)$ is follows that $N_{1} N_{3}, N_{3} N_{5}, N_{5} N_{7}$ and $N_{7} N_{1}$ are dinucleotides in the set $X_{1, 3}$ and hence $N_{1}, N_{3}, N_{5}$ and $N_{7}$ are vertices of $T_{1, 3} (X)$ . Moreover,

\begin{matrix} N_{1} \to N_{3} \to N_{5} \to N_{7} \to N_{1} \end{matrix}

is a cycle in $T (X)$ - a contradiction to the fact that $T_{1, 3} (X)$ is acyclic. $□$

The converse of Theorem 3.7 does not hold as the following example shows. Note, however, that the code $X_{1, 3}$ (respectively $X_{2, 4}$ ) can never contain dinucleotides of the form NN since they would imply that there is a tessera of the form NKNK in X which contradicts 1-circularity.

Example 3.8

Let

\begin{matrix} X = {A G U C, G A A G, C A A C, G G C C, A G C U, U G C A, G U A C, U U A A, C G A U, \\ G A C U, C U U C, G U U G}, \end{matrix}

then X is a maximal circular tessera code but neither $T_{1, 3} (X)$ nor $T_{2, 4} (X)$ is acyclic.

We now state some application of the above results in order to construct maximal circular tessera codes from circular dinucleotide codes. In fact, the constructed codes will even have stronger properties:

Definition 3.9

A circular tessera code $X \subseteq T E S S$ is called a $C^{4}$ -code if also the three shifted codes $α_{1} (X)$ , $α_{2} (X)$ and $α_{3} (X)$ are circular.

Recall from Fimmel et al. (2015) that a maximal circular dinucleotide code $D \subseteq B^{2}$ must be of the form $D = {N_{1} N_{2}, N_{1} N_{3}, N_{1} N_{4}, N_{2} N_{3}, N_{2} N_{4}, N_{3} N_{4}}$ where $N_{1} > N_{2} > N_{3} > N_{4}$ is any linear ordering of the genetic alphabet $B$ .

Proposition 3.10

Let $D = {N_{1} N_{2}, N_{1} N_{3}, N_{1} N_{4}, N_{2} N_{3}, N_{2} N_{4}, N_{3} N_{4}}$ be a maximal circular dinucleotide code. Then

\begin{matrix} X = & {N_{1} N_{1} N_{2} N_{2}, N_{1} N_{1} N_{3} N_{3}, N_{1} N_{1} N_{4} N_{4}} \cup \\ \cup {N_{1} N_{3} N_{2} N_{4}, N_{1} N_{4} N_{2} N_{3}} \cup \\ \cup {N_{1} N_{2} N_{3} N_{3}, N_{1} N_{4} N_{3} N_{2}} \cup \\ \cup {N_{1} N_{3} N_{4} N_{2}, N_{1} N_{2} N_{4} N_{3}} \cup \\ \cup {N_{2} N_{2} N_{3} N_{3}, N_{2} N_{2} N_{4} N_{4}, N_{3} N_{3} N_{4} N_{4}} \end{matrix}

is a maximal tessera $C^{4}$ -code such that $T_{1, 3} (X) = G (D)$ .

Proof

We first prove circularity of the code X. Clearly, $T_{1, 3} (X) = G (D)$ . Since D is circular its graph $G (D)$ is acyclic by Theorem 2.4 and thus we only need to verify that X is 1-circular by Theorem 3.7. But this is clear since the code contains exactly one tessera from each of the twelve complete equivalence classes from Table 5.

Now let $X^{(n)}$ be the nth shift of X for $n \leq 3$ . Then we have

\begin{matrix} X_{1, 3}^{(1)} = & {N_{1} N_{2}, N_{1} N_{3}, N_{1} N_{4}, N_{3} N_{4}, N_{4} N_{3}, N_{2} N_{3}, N_{4} N_{2}, N_{3} N_{2}, N_{2} N_{4}} \\ X_{1, 3}^{(2)} = & {N_{2} N_{1}, N_{3} N_{1}, N_{4} N_{1}, N_{3} N_{2}, N_{4} N_{2}, N_{4} N_{3}} = \overset{\leftarrow}{X_{1, 3}} = \overset{\leftarrow}{D} \\ X_{1, 3}^{(3)} = & {N_{2} N_{1}, N_{3} N_{1}, N_{4} N_{1}, N_{4} N_{3}, N_{3} N_{4}, N_{3} N_{2}, N_{2} N_{4}, N_{4} N_{2}, N_{4} N_{3}} = \overset{\leftarrow}{X_{1, 3}^{(1)}} \end{matrix}

Clearly, $X_{1, 3}^{(2)}$ is a dinucleotide circular code since it is equal to $\overset{\leftarrow}{D}$ , hence its representing graph $G (X_{1, 3}^{(2)}) = T_{1, 3} (X^{(2)})$ is acyclic and as above $X^{(2)}$ is 1-circular. By Theorem 3.7 we conclude that $X^{(2)}$ is a circular code.

It remains to show that also $X^{(1)}$ and $X^{(3)}$ are circular. However, in this case

\begin{matrix} X_{2, 4}^{(1)} = \overset{\leftarrow}{X_{1, 3}} = \overset{\leftarrow}{D} \end{matrix}

which is circular and so Theorem 3.7 implies that also $X^{(1)}$ and also $X^{(3)}$ are circular. Hence X is a $C^{4}$ -code. $□$

We would like to remark that the construction in the above lemma has some flexibility, e.g the tessera of the form $N_{i} N_{i} N_{j} N_{j}$ can be substituted by tessera from the same equivalence class. However, it is not obvious how to construct all maximal circular tessera codes using this method. Nevertheless, in the next section we will give a way to obtain all such codes.

Construction of All Maximal Circular Tessera Codes

This section introduces one possibility to construct all maximal circular tessera codes. Recall that a circular tessera code is maximal if it contains exactly 12 elements. The construction will be accomplished in two major steps. Firstly, for each of the four equivalence classes from Table 1 we define a tournament on four vertices which are representing the single dinucleotides. Finally, we combine the four tournaments constructed in the previous step to construct maximal circular tessera codes. Recall that a tournament is a complete oriented graph (see e.g. Clark and Holton 1991). Figure 7 shows an example of a tournament.

Fig. 7 — An acyclic tournament on four nodes. (Color figure online)

As already proved in Theorem 3.6, the graph component $C_{1} (X)$ associated to a tessera code X has either no path bigger than 1 or X is not circular. Even more precise, if $C_{1} (X)$ is acyclic the code X must not even be 1-circular. Considering that, a construction of a maximal circular tessera code could almost be reduced to the problem of constructing a valid and acyclic $C_{2}$ which represents a correct tessera code X.

Step 1:
In this step we construct four acyclic tournaments which together represent a tessera code $X^{'}$ of length 24 so that $C_{2} (X^{'})$ is acyclic. Note that a tournament on 4 vertices has exactly 6 edges and in order to be acyclic it has to be isomorphic to the tournament given in Fig. 7. Below we will show how to construct tournaments on four vertices that represent a correct (circular) tessera code, i.e. the tournaments will be acyclic. Together they form the desired code $X^{'}$ as
$\begin{matrix} X^{'} = & X_{I} \cup X_{SW} \cup X_{YR} \cup X_{KM} \end{matrix}$ 1

$\begin{matrix} with | X_{I} | = & | X_{SW} | = | X_{YR} | = | X_{KM} | = 6 and, thus, | X^{'} | = 24 \end{matrix}$ 2
As it can be seen from the construction, $C_{2} (X^{'})$ is acyclic as it is the union of acyclic tournaments, while $C_{1} (X^{'})$ is not. Yet, for this initial step we can ignore this fact. Since $C_{2} (X_{I})$ , $C_{2} (X_{SW})$ , $C_{2} (X_{YR})$ and $C_{2} (X_{KM})$ are disjoint it is sufficient that these subgraphs are acyclic to ensure the acyclicity of $C_{2} (X^{'})$ . As mentioned above, each of these subgraphs has to be isomorphic to the graph in Fig. 7.

Let us choose one of the equivalence classes $Σ_{i}, i \in {I, S W, Y R, K M}$ and assign numbers 1, 2, 3, 4 to the dinucleotides of $Σ_{i}$ . Now we draw directed edges from each node to the nodes with a higher number. This way we will obtain four acyclic tournaments, each of them represents a circular tessera code of size 6. This gives 4! possible assignments per subgraph. Hence, there are altogether ${(4!)}^{4} = 331776$ tessera codes of size 24 with an acyclic $C_{2}$ -component.
Step 2:
In this step, we use the 331776 tessera codes, constructed in Step 1, to construct all possible maximal circular tessera codes. Since the $C_{2}$ is already acyclic, it is sufficient to focus on $C_{1}$ .

Lemma 4.1

Let $X^{'}$ be a tessera code constructed as above and

\begin{matrix} t = N_{1} N_{2} N_{3} N_{4} = N_{1} N_{2} γ (N_{1} N_{2}) \in X^{'} \end{matrix}

for some $γ \in V$ . Then the following hold:

$α_{2} (t) \notin X^{'}$
$α_{3} (t) = N_{4} N_{1} N_{2} N_{3} \in X^{'} or α_{1} (t) = N_{2} N_{3} N_{4} N_{1} \in X^{'}$ .

Proof

First we prove (1). Obviously, t is represented by the arrow $N_{1} N_{2} \to N_{3} N_{4}$ in the corresponding tournament. Obviously, $γ \neq i d$ . Let us consider $α_{2} (t) = N_{3} N_{4} N_{1} N_{2}$ . It follows that $α_{2} (t) \notin X^{'}$ since it would be represented in the same tournament by the opposite directed arrow $N_{4} N_{3} \to N_{1} N_{2}$ - a contradiction. Now we claim that one of the remaining shifts of t

\begin{matrix} α_{3} (t) = N_{4} N_{1} N_{2} N_{3} or α_{1} (t) = N_{2} N_{3} N_{4} N_{1} \end{matrix}

is necessarily in the code $X^{'}$ . Let us first assert that the dinucleotides $N_{4} N_{1}$ and $N_{2} N_{3}$ cannot be in the same equivalence class as $N_{1} N_{2}$ and $N_{3} N_{4}$ since in this case $N_{4} = N_{2}$ takes place and, thus, $γ = i d$ . Consequently, one of the arrows $N_{4} N_{1} \to N_{2} N_{3}$ or $N_{2} N_{3} \to N_{4} N_{1}$ is drawn in the corresponding tournament and it follows that $α_{3} (t) \in X^{'}$ or $α_{1} (t) \in X^{'}$ . This proves (2). $□$

The above lemma shows that consequently, $X^{'}$ consists of 12 pairs of cyclically equivalent tesserae. To ensure that the codes are circular, one of the cyclically equivalent tuples must be removed. This has to be done for all 12 cyclically equivalent pairs of tuples in such a code $X^{'}$ . It follows that each of the 331776 codes can be used to construct $2^{12}$ circular codes - with possible repetitions. It remains to prove that all maximal circular tessera codes can be obtained this way. Let X be a auch a maximal code. As shown above, the $C_{2}$ component of each $X_{i}, i \in {I, K M, S W, Y R}$ is a simple directed acyclic graph with a maximum of four nodes. According to Theorem 3.1 (Fimmel et al. 2017), such a graph can be embedded in an acyclic tournament. In Step 1, all possible acyclic tournaments are constructed. Step 2 takes all possible subgraphs of each tournament and combines those. This ensures that all possible maximal circular tessera codes are represented in the construction.

Hence, the total amount of $2^{12} \times {(4!)}^{4} = 1358954496$ constructed maximal circular tessera codes include all maximal circular tessera codes.

The table below gives the exact numbers of circular and even $C^{4}$ - codes (compare 3.9) for all cardinalities from 1 to the maximum 12. Moreover, it also shows that number of comma-free codes. Recall that comma-free codes form a subclass of circular codes.

Definition 4.2

A code $X \subseteq B^{l}$ is called comma-free if any concatenation $x_{1} x_{2}$ does not contain any $x \in X$ as a substring except for $x_{1}$ (as initial segment) and $x_{2}$ (as end segment) themselves.

Clearly, a comma-free code is circular and X is comma-free if and only if it associated graph has no path of length more than 2 (see Fimmel et al. 2016) (Table 4).

Table 4.

Numbers of circular, comma-free and $C^{4}$ -tessera codes of different code lengths

Code length	# 1-circular codes	# Circular codes	# $C^{4}$ -codes	# Comma free codes
1	48	48	48	48
2	1056	1056	1056	1056
3	14080	14048	14016	13952
4	126720	125544	124368	122376
5	811008	791952	773088	745584
6	3784704	3606048	3433584	3214272
7	12976128	11908800	10922112	9816960
8	32440320	28230456	24577404	20952504
9	57671680	46720800	37987120	30297824
10	69206016	51111024	38129856	28015728
11	50331648	33113472	22240992	14790144
12	16777216	9592512	5685408	3351232

Open in a new tab

Self-Complementary Circular Tessera Codes

In this section we will discuss some properties of self-complementary tessera codes. In particular, we will determine all maximal self-complementary comma-free tessera codes and give a graph-theoretical characterization of self-complementarity for tessera codes.

Let us first recall the definition of self-complementarity of a code.

Definition 4.3

Let $X \subseteq B^{ℓ}$ be a $ℓ$ -nucleotide code. We will call X self-complementary if for each $ℓ$ -nucleotide $x \in X$ its anti- $ℓ$ -nucleotide $\overset{\leftarrow}{S W (x)}$ is also in X:

\begin{matrix} x \in X \Leftrightarrow \overset{\leftarrow}{S W (x)} \in X . \end{matrix}

We will also use the notation

\begin{matrix} X = \overset{\leftarrow}{S W (X)} . \end{matrix}

According to the above, a circular tessera code can contain a maximum of 12 tesserae. Such a code can even be self-complementary, as the next example shows.

Example 4.4

The following code $X \subseteq T E S S$ is a self-complementary maximal circular code:

\begin{matrix} X = {A A U U, C C G G, A G C U, U G C A, G U A C, U C G A, A A G G, \\ C C U U, A A C C, G G U U, A G U C, G A C U} . \end{matrix}

The next lemma gives the exact number of self-complementary 1-circular tessera codes.

Lemma 4.5

The maximal size of a self-complementary 1-circular tessera code is 12 and the number of them is 4096.

Proof

Firstly, Example 4.4 shows that there are self-complementary circular codes of size 12 which is maximal. Secondly, inn order to calculate the exact number of self-complementary 1-circular codes, we first ascertain that for 6 conjugacy classes, the respective antitessera of a tessera from that class is found in another conjugacy class: The antitesserae of tesserae from class $D_{2}$ are all in class $D_{5}$ , from class $D_{3}$ in class $D_{6}$ and from class $D_{10}$ in class $D_{12}$ and, of course, vice versa. Thus, we have $4^{3}$ possibilities to choose 6 tesserae from these conjugacy classes for a 1-circular self-complementary tessera code. As for the classes $D_{1}, D_{4}, D_{7}, D_{8}, D_{9}, D_{11}$ , only the self-complementary tesserae can be chosen from these, since the other two form tessera-antitessera pairs and are cyclically equivalent. So we have further $2^{6}$ possibilities for this. Altogether we have $2^{6} \cdot 4^{3} = 4096$ maximal self-complementary 1-circular codes. $□$

The following example shows that not every 1-circular self-complementary tessera code is also circular (even not 2-circular).

Example 4.6

Let us take (self-complementary) tesserae AAUU from the class $D_{1}$ and CCGG from the class $D_{4}$ , as well as GGAA (from $D_{2}$ ) and UUCC (from $D_{5}$ ) which are complementary to each other. Then we have that the word CCGGAAUU has two different decompositions on a circle:

\begin{matrix} C C G G | A A U U and C C | G G A A | U U . \end{matrix}

With an extensive computer calculation the exact numbers of self-complementary circular and comma-free codes of maximal length are calculated:

Lemma 4.7

There are

1584 self-complementary circular (Table 5) and
16 self-complementary comma-free (Table 6)

tessera codes of maximal length.

Table 6.

The list of all self-complementary comma-free tessera codes of maximal length

UUAA	CCAA	AGGA	UCCU	UUGG	CCGG	UCGA	CAUG	ACGU	AGCU	ACUG	CAGU
AAUU	AACC	AGGA	UCCU	GGUU	GGCC	UCGA	CAUG	ACGU	AGCU	ACUG	CAGU
UUAA	CCAA	GAAG	CUUC	UUGG	CCGG	GAUC	CAUG	ACGU	CUAG	ACUG	CAGU
AAUU	AACC	GAAG	CUUC	GGUU	GGCC	GAUC	CAUG	ACGU	CUAG	ACUG	CAGU
UUAA	CCAA	AGGA	UCCU	UUGG	CCGG	UCGA	UGCA	GUAC	AGCU	UGAC	GUCA
AAUU	AACC	AGGA	UCCU	GGUU	GGCC	UCGA	UGCA	GUAC	AGCU	UGAC	GUCA
UUAA	CCAA	GAAG	CUUC	UUGG	CCGG	GAUC	UGCA	GUAC	CUAG	UGAC	GUCA
AAUU	AACC	GAAG	CUUC	GGUU	GGCC	GAUC	UGCA	GUAC	CUAG	UGAC	GUCA
AAUU	ACCA	AAGG	CCUU	UGGU	CCGG	GAUC	UGCA	ACGU	AGCU	GACU	AGUC
UUAA	ACCA	GGAA	UUCC	UGGU	GGCC	GAUC	UGCA	ACGU	AGCU	GACU	AGUC
AAUU	CAAC	AAGG	CCUU	GUUG	CCGG	GAUC	CAUG	GUAC	AGCU	GACU	AGUC
UUAA	CAAC	GGAA	UUCC	GUUG	GGCC	GAUC	CAUG	GUAC	AGCU	GACU	AGUC
AAUU	ACCA	AAGG	CCUU	UGGU	CCGG	UCGA	UGCA	ACGU	CUAG	CUGA	UCAG
UUAA	ACCA	GGAA	UUCC	UGGU	GGCC	UCGA	UGCA	ACGU	CUAG	CUGA	UCAG
AAUU	CAAC	AAGG	CCUU	GUUG	CCGG	UCGA	CAUG	GUAC	CUAG	CUGA	UCAG
UUAA	CAAC	GGAA	UUCC	GUUG	GGCC	UCGA	CAUG	GUAC	CUAG	CUGA	UCAG

Open in a new tab

We now aim for a graph-theoretical characterization of self-complementarity for tessera codes. Let us start with some observations on self-complementary 1-circular tessera codes:

Lemma 4.8

Let $X \subseteq T E S S$ be a self-complementary 1-circular tessera code. Then it holds

\begin{matrix} X_{SW} = \emptyset . \end{matrix}

Proof

Let X be a self-complementary 1-circular tessera code. Then for all $t = d_{1} d_{2} \in X_{SW}$

\begin{matrix} \overset{\leftarrow}{c (t)} = \overset{\leftarrow}{c (d_{2})} \overset{\leftarrow}{c (d_{1})} = d_{2} d_{1} = α_{2} (t) \end{matrix}

where $d_{1}, d_{2} \in Σ_{SW}$ . However, cyclically equivalent tesserae cannot be in the same 1-circular code. $□$

The next property is discovered by examining maximum circular codes of codons (RNA triplets) (Fimmel et al. 2018). Assume that $Y \subset B^{3}$ is a trinucleotide self-complementary code, $G (Y) = (V, E)$ the graph associated to Y. Then the following conditions are true:

$V = \overset{\leftarrow}{c (V)}$
$d^{+} (v) = d^{-} (\overset{\leftarrow}{c (v)})$ for all vertices $v \in V$

where $d^{+} (v)$ of a vertex v denotes the number of outgoing edges (directed edges that start in v) and $d^{-} (v)$ denotes the number of ingoing edges, respectively. It was also shown in Fimmel et al. (2018) that the conditions from above are not sufficient in general to ensure self-complementarity but only for circular codes of size at least 18.

We will show next that in the case of tesserae or dinucleotides, the size of the code does not matter and that one can obtain a similar result. Let us first prove the claim for dinucleotides:

Lemma 4.9

Let $X \subseteq B^{2}$ be a 1-circular dinucleotide code, $G (X) = (V, E)$ its associated graph. X is self-complementary if and only if

$V = c (V)$
$d^{+} (v) = d^{-} (c (v))$ for all vertices $v \in V$

Proof

Let X be a self-complementary dinucleotide code, $l_{1} l_{2} \in X$ for some $l_{1} l_{2} \in B$ . Due to self-complementarity of X we have $c (l_{2}) c (l_{1}) \in X$ which implies that both conditions (1) and (2). Conversely, assume that X is a 1-circular code. Then its associated graph $G (X)$ can be embedded into a tournament on four vertices $A, C, G, U \in B$ (compare Fimmel et al. 2017). Assume that $G (X)$ satisfies the conditions (1) and (2). The presence or absence of the self-complementary dinucleotides AU, UA, CG or GC in X does not affect either the self-complementarity of X or the conditions (1) and (2). Let us focus then on non-self-complementary dinucleotides from X. Suppose without loss of generality that the dinucleotide $A \to C$ is in the code. For conditions (1) and (2) to be met, a dinucleotide $N_{1} U$ and a dinucleotide $G N_{2}$ must be in the code. This can be achieved in three ways:

$N_{1} = G, N_{2} = U$ In this case $A C = \overset{\leftarrow}{c (G U)}$ is valid or
$N_{1} = C, N_{2} = U$ The condition (2) can now only be met if the dinucleotide $A G \in X$ and the code is self-complementary or
$N_{1} = C, N_{2} = A$ The condition (2) can now only be met if the dinucleotide $U G \in X$ and the code is self-complementary

This proves that X is self-complementary. $□$

In the case of tesserae we should additionally consider the condition from the Lemma 4.8 and obtain a handy characterization of self-complementarity.

Theorem 4.10

Let $X \subseteq T E S S$ be a 1-circular tessera code, $C_{2} (X) = (V_{2}, E_{2})$ . X is self-complementary if and only if

$X_{SW} = \emptyset$
$V_{2} = \overset{\leftarrow}{c (V_{2})}$
$d^{+} (v) = d^{-} (\overset{\leftarrow}{c (v)})$ for all vertices $v \in V_{2}$

Proof

One implication is analogous to the proof of Proposition 3.1 in Fimmel et al. (2018) considering Lemma 4.8. Conversely, assume that $X \subseteq T E S S$ is a 1-circular tessera code that satisfies all three conditions (1), (2), (3). It is immediately clear by direct verification that for all equivalence classes $Σ_{i}$ with $i \in {I, S W, Y R, K M}$

\begin{matrix} \overset{\leftarrow}{c (Σ_{i})} = Σ_{i} \end{matrix}

holds, i.e. the dinucleotide codes $Σ_{i}$ are self-complementary. So we can restrict ourselves to the consideration of $C_{2} (X_{i})$ for $i \in {I, S W, Y R, K M}$ . Since X is a 1-circular code each of $C_{2} (X_{i})$ is embedded into a tournament on four nodes.

Secondly, as we can see from Table 1, two of the six tesserae represented in each tournament, except of that corrsponding to $Σ_{SW}$ , are self-complementary:

For $Σ_{I}$ these are AAUU (or UUAA) and CCGG (or GGCC)
For $Σ_{YR}$ these are ACGU (or GUAC) and UGCA (or CAUG)
For $Σ_{KM}$ these are AGCU (or CUAG) and UCGA (or GAUC)

and for each not self-complementary tessera $T = d_{1} d_{2} \in X_{i}$ where $i \in {I, S W, Y R, K M}$ its anti-tessera should be in the same component $X_{i}$ due to the fact that

\begin{matrix} \overset{\leftarrow}{c (U)} = \overset{\leftarrow}{c (d_{2})} \overset{\leftarrow}{c (d_{1})} . \end{matrix}

The rest of the proof can now be done analogously to the proof of Lemma 4.9.

$□$

In the Theorem above, the condition of 1-circularity can not be omitted, as the following example shows:

Example 4.11

Let us consider the following tessera code

\begin{matrix} X = {C U G A, G A C U, A G A G, U C U C} . \end{matrix}

The code is obviously not 1-circular and non-self-complementary since, for instance, $\overset{\leftarrow}{c (A G A G)} = C U C U \notin X$ takes place. But all three conditions from Theorem 4.10 are fulfilled. In the picture below, the round and square nodes represent pairs of reversed-complementary dinucleotides.

graphic file with name 11538_2020_724_Figa_HTML.jpg

We conclude this section with a second theorem that gives a graph-theoretical characterization for tessera codes that are not 1-circular using the graph component $C_{1} (X)$ of a code X.

Theorem 4.12

Let $X \subseteq T E S S$ be a tessera code, $C_{1} (X) = (V_{1}, E_{1})$ . X is self-complementary if and only if

$V_{1} = \overset{\leftarrow}{c (V_{1})}$
$d^{+} (v) = d^{-} (\overset{\leftarrow}{c (v)})$ for all vertices $v \in V_{1}$

Proof

Let us assume that $X \subseteq T E S S$ satisfies properties (1) and (2) from Theorem 4.12. Hence, for any tessera $N_{1} N_{2} N_{3} N_{4} \in X$ we have that $N_{2} N_{3} N_{4} \in V_{1}$ and by property (1) also $c (N_{4} N_{3} N_{2}) \in V_{1}$ . Property (2) then implies that $c (N_{4} N_{3} N_{2}) N_{5} \in X$ for some basis $N_{5}$ . It is clear that $N_{5}$ has to be the complement of $N_{1}$ by the unique definition of tesserae. More precisely, assume that $π \in V$ such that $N_{2} = π (N_{4})$ which implies that $c (N_{2}) = π (c (N_{4}))$ and thus $c (N_{3}) = π (N_{5})$ . Hence $N_{5}$ = $c (N_{1})$ . Therefore $c (N_{4} N_{3} N_{2}) N_{5} = c (\overset{\leftarrow}{N_{1} N_{2} N_{3} N_{4}}) \in X$ and X is self-complementary. $□$

Let us make a final remark: A 1-circular tessera code X represented by a tournament which is built on four dinucleotides of one of the equivalence classes (see Table 1) is self-complementary if and only if the numbers 1, 2, 3, 4 (see paragraph Construct a Tournament) are assigned to dinucleotides so that 1 is complementary to 4 and 2 is complementary to 3, i.e. $d_{1} = \overset{\leftarrow}{c (d_{4})}, d_{2} = \overset{\leftarrow}{c (d_{3})}$ . In order to see this let the order on dinucleotides be defined as described above, $d_{i} d_{j} \in X, i < j, i, j \in {1, 2, 3, 4}$ and

\begin{matrix} \overset{\leftarrow}{c (d_{i} d_{j})} = \overset{\leftarrow}{c (d_{j})} \overset{\leftarrow}{c (d_{i})} = d_{k} d_{l} . \end{matrix}

If $i = 1$ or $j = 4$ then it is obvious that $k < l$ since $k = 1$ or $l = 4$ and $d_{k} d_{l} \in X$ . The only remaining case is $i = 2, j = 3$ . But in this case $k = 2, l = 3$ takes place per definition of the order on dinucleotides and $d_{2} d_{3} \in X$ . The opposite direction: Let $d_{1} = \overset{\leftarrow}{c (d_{2})}$ and, correspondingly, $d_{3} = \overset{\leftarrow}{c (d_{4})}$ . Then $\overset{\leftarrow}{c (d_{1} d_{3})} = d_{4} d_{2} \notin X$ . The case $d_{1} = \overset{\leftarrow}{c (d_{3})}$ is analogous. In both cases X is not a self-complementary code. Here is an example.

Example 4.13

For example, let us consider the class $Σ_{KM}$ . Then one possible self-complementary assignment would be: $1 \to C U$ , $4 \to A G$ , $2 \to U C$ and $3 \to G A$ . The represented code $X_{KM} =$ {CUAG, CUUC, CUGA, UCAG, UCGA, GAAG} is self-complementary.

This shows that in the construction of all maximal circular tessera codes one can also identify and construct all maximal self-complementary circular codes.

Conclusions

In this work we have identified and characterized circular tessera codes and their properties. In Gonzalez et al. (2012) and Gonzalez et al. (2019) Gonzalez, Giannerini and Rosa had proposed an ancestor code of the universal genetic code that is based on 64 tetranucleotides built from dinucleotides by using the Klein four symmetry group. It was hypothesized that this tessera code existed before LUCA and even before the early genetic code that coded for 20 amino acids using all 64 codons. Possible primeval adaptor molecules that could decode the tessera were also modelled and it was shown that the tessera code mirrors exactly the degeneracy distribution of the mitochondrial genetic code.

We have combined the theory of tesserae with the the theory of circular codes that have been studied extensively during the last decades. Circular codes were found by an extensive statistical investigation in Arqués and Michel (1996) and seem to play an important role in the detection and correction mechanisms of the ribosome during translation. Moreover, it was hypothesized in [13] that ancestor codes of the universal genetic code might have used codons from a circular code only. Thus it was reasonable to investigate circular tessera codes which could have existed between a primitive genetic code and the tessera code.

Our results show that circular tessera codes can be of size 12 at most and we have given construction methods for all circular tessera codes of this size. Moreover, the number of circular (comma-free, self-complementary) tessera codes of any size between 1 and 12 have been calculated.

Acknowledgements

Open Access funding provided by Projekt DEAL.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Elena Fimmel, Email: e.fimmel@hs-mannheim.de.

Martin Starman, Email: m.starman@live.com.

Lutz Strüngmann, Email: l.struengmann@hs-mannheim.de.

References

Arqués DG, Michel CJ. A complementary circular code in the protein coding genes. J Theor Biol. 1996;182:45–58. doi: 10.1006/jtbi.1996.0142. [DOI] [PubMed] [Google Scholar]
Baranov PV, Venin M, Provan G. Codon size reduction as the origin of the triplet genetic code. PLoS ONE. 2009;4(5):e5708. doi: 10.1371/journal.pone.0005708. [DOI] [PMC free article] [PubMed] [Google Scholar]
Clark J, Holton DA. A first look at graph theory. Newark: World Scientific; 1991. [Google Scholar]
Cisowski D (2015) Tessera-based encoding of the mitochondrial genome. Bachelor-Thesis, Mannheim
Crick F, Griffith JS, Orgel LE. Codes without commas. Proc Natl Acad Sci USA. 1957;43(5):416–21. doi: 10.1073/pnas.43.5.416. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fimmel E, Michel ChJ, Starman M, Strüngmann L. Self-complementary circular codes in coding theory. Theory Biosci. 2018;37(1):51–65. doi: 10.1007/s12064-018-0259-4. [DOI] [PubMed] [Google Scholar]
Fimmel E, Michel ChJ, Strüngmann L. Diletter circular codes over finite alphabets. Math Biosci. 2017;294:120–129. doi: 10.1016/j.mbs.2017.10.001. [DOI] [PubMed] [Google Scholar]
Fimmel E, Strüngmann L. Mathematical fundamentals for the noise immunity of the Genetic Code. BioSystems. 2018;164:186–198. doi: 10.1016/j.biosystems.2017.09.007. [DOI] [PubMed] [Google Scholar]
Fimmel E, Strüngmann L. Linear codes and the mitochondrial genetic code. BioSystems. 2019;184:103990. doi: 10.1016/j.biosystems.2019.103990. [DOI] [PubMed] [Google Scholar]
Fimmel E, Michel CJ, Strüngmann L. $n$ -nucleotide circular codes in graph theory. Phil Trans A. 2016;374:20150058. doi: 10.1098/rsta.2015.0058. [DOI] [PubMed] [Google Scholar]
Fimmel E, Giannerini S, Gonzalez D, Strüngmann L. Circular codes, symmetries and transformations. J Math Biol. 2014;70(7):1623–44. doi: 10.1007/s00285-014-0806-7. [DOI] [PubMed] [Google Scholar]
Fimmel E, Giannerini S, Gonzalez D, Strüngmann L. Dinucleotide circular codes and bijective transformations. J Theor Biol. 2015;386:159–165. doi: 10.1016/j.jtbi.2015.08.034. [DOI] [PubMed] [Google Scholar]
Fimmel E, Michel Ch. J, Pirot F, Sereni JS, Starman M, Strüngmann L (2020) The relation between k-circularity and circularity of codes, submitted [DOI] [PMC free article] [PubMed]
Fimmel E, Strüngmann L. Yury Borisovich Rumer and his biological papers on the genetic code. Phil Trans R Soc A. 2016;374:20150228. doi: 10.1098/rsta.2015.0228. [DOI] [PubMed] [Google Scholar]
Gonzalez DL, Giannerini S, Rosa R (2012) On the origin of the mitochondrial genetic code: towards a unified mathematical framework for the management of genetic information. In: Nature precedings. 10.1038/npre.2012.7136
Gonzalez DL, Giannerini S, Rosa R (2019) On the origin of degeneracy in the genetic code., In: Interface Focus 9: 20190038. 10.1098/rsfs.2019.0038 [DOI] [PMC free article] [PubMed]
Michel CJ. The maximal $C^{3}$ self-complementary trinucleotide circular code $X$ in genes of bacteria, archaea, eukaryotes, plasmids and viruses. Life. 2017;7(20):1–16. doi: 10.3390/life7020020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nirenberg MW, Matthaei JH. The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc Natl Acad Sci USA. 1961;47:1588–1602. doi: 10.1073/pnas.47.10.1588. [DOI] [PMC free article] [PubMed] [Google Scholar]
Patel A. The triplet genetic code had a doublet predecessor. J theor Biol. 2005;233:527–532. doi: 10.1016/j.jtbi.2004.10.029. [DOI] [PubMed] [Google Scholar]
Rotman JJ. An introduction to the theory of groups. Berlin: Springer; 1995. [Google Scholar]
Seligmann H. Putative anticodons in mitochondrial tRNA sidearm loops: Pocketknife tRNAs? J Theor Biol. 2014;7(340):155–63. doi: 10.1016/j.jtbi.2013.08.030. [DOI] [PubMed] [Google Scholar]
Smith JM. The problems of biology. Oxford: Oxford University Press; 1986. [Google Scholar]
Wilhelm T, Nikolajewa S. A new classification scheme of the genetic code. J Mol Evol. 2004;59(5):598–605. doi: 10.1007/s00239-004-2650-7. [DOI] [PubMed] [Google Scholar]
Wu HL, Bagby S, van den Elsen JM. Evolution of the genetic triplet code via two types of doublet codons. J Mol Evol. 2005;61(1):54–64. doi: 10.1007/s00239-004-0224-3. [DOI] [PubMed] [Google Scholar]

[CR1] Arqués DG, Michel CJ. A complementary circular code in the protein coding genes. J Theor Biol. 1996;182:45–58. doi: 10.1006/jtbi.1996.0142. [DOI] [PubMed] [Google Scholar]

[CR2] Baranov PV, Venin M, Provan G. Codon size reduction as the origin of the triplet genetic code. PLoS ONE. 2009;4(5):e5708. doi: 10.1371/journal.pone.0005708. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] Clark J, Holton DA. A first look at graph theory. Newark: World Scientific; 1991. [Google Scholar]

[CR4] Cisowski D (2015) Tessera-based encoding of the mitochondrial genome. Bachelor-Thesis, Mannheim

[CR5] Crick F, Griffith JS, Orgel LE. Codes without commas. Proc Natl Acad Sci USA. 1957;43(5):416–21. doi: 10.1073/pnas.43.5.416. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] Fimmel E, Michel ChJ, Starman M, Strüngmann L. Self-complementary circular codes in coding theory. Theory Biosci. 2018;37(1):51–65. doi: 10.1007/s12064-018-0259-4. [DOI] [PubMed] [Google Scholar]

[CR7] Fimmel E, Michel ChJ, Strüngmann L. Diletter circular codes over finite alphabets. Math Biosci. 2017;294:120–129. doi: 10.1016/j.mbs.2017.10.001. [DOI] [PubMed] [Google Scholar]

[CR8] Fimmel E, Strüngmann L. Mathematical fundamentals for the noise immunity of the Genetic Code. BioSystems. 2018;164:186–198. doi: 10.1016/j.biosystems.2017.09.007. [DOI] [PubMed] [Google Scholar]

[CR9] Fimmel E, Strüngmann L. Linear codes and the mitochondrial genetic code. BioSystems. 2019;184:103990. doi: 10.1016/j.biosystems.2019.103990. [DOI] [PubMed] [Google Scholar]

[CR10] Fimmel E, Michel CJ, Strüngmann L. $n$ -nucleotide circular codes in graph theory. Phil Trans A. 2016;374:20150058. doi: 10.1098/rsta.2015.0058. [DOI] [PubMed] [Google Scholar]

[CR11] Fimmel E, Giannerini S, Gonzalez D, Strüngmann L. Circular codes, symmetries and transformations. J Math Biol. 2014;70(7):1623–44. doi: 10.1007/s00285-014-0806-7. [DOI] [PubMed] [Google Scholar]

[CR12] Fimmel E, Giannerini S, Gonzalez D, Strüngmann L. Dinucleotide circular codes and bijective transformations. J Theor Biol. 2015;386:159–165. doi: 10.1016/j.jtbi.2015.08.034. [DOI] [PubMed] [Google Scholar]

[CR13] Fimmel E, Michel Ch. J, Pirot F, Sereni JS, Starman M, Strüngmann L (2020) The relation between k-circularity and circularity of codes, submitted [DOI] [PMC free article] [PubMed]

[CR14] Fimmel E, Strüngmann L. Yury Borisovich Rumer and his biological papers on the genetic code. Phil Trans R Soc A. 2016;374:20150228. doi: 10.1098/rsta.2015.0228. [DOI] [PubMed] [Google Scholar]

[CR15] Gonzalez DL, Giannerini S, Rosa R (2012) On the origin of the mitochondrial genetic code: towards a unified mathematical framework for the management of genetic information. In: Nature precedings. 10.1038/npre.2012.7136

[CR16] Gonzalez DL, Giannerini S, Rosa R (2019) On the origin of degeneracy in the genetic code., In: Interface Focus 9: 20190038. 10.1098/rsfs.2019.0038 [DOI] [PMC free article] [PubMed]

[CR17] Michel CJ. The maximal $C^{3}$ self-complementary trinucleotide circular code $X$ in genes of bacteria, archaea, eukaryotes, plasmids and viruses. Life. 2017;7(20):1–16. doi: 10.3390/life7020020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] Nirenberg MW, Matthaei JH. The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. Proc Natl Acad Sci USA. 1961;47:1588–1602. doi: 10.1073/pnas.47.10.1588. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] Patel A. The triplet genetic code had a doublet predecessor. J theor Biol. 2005;233:527–532. doi: 10.1016/j.jtbi.2004.10.029. [DOI] [PubMed] [Google Scholar]

[CR20] Rotman JJ. An introduction to the theory of groups. Berlin: Springer; 1995. [Google Scholar]

[CR21] Seligmann H. Putative anticodons in mitochondrial tRNA sidearm loops: Pocketknife tRNAs? J Theor Biol. 2014;7(340):155–63. doi: 10.1016/j.jtbi.2013.08.030. [DOI] [PubMed] [Google Scholar]

[CR22] Smith JM. The problems of biology. Oxford: Oxford University Press; 1986. [Google Scholar]

[CR23] Wilhelm T, Nikolajewa S. A new classification scheme of the genetic code. J Mol Evol. 2004;59(5):598–605. doi: 10.1007/s00239-004-2650-7. [DOI] [PubMed] [Google Scholar]

[CR24] Wu HL, Bagby S, van den Elsen JM. Evolution of the genetic triplet code via two types of doublet codons. J Mol Evol. 2005;61(1):54–64. doi: 10.1007/s00239-004-0224-3. [DOI] [PubMed] [Google Scholar]

PERMALINK

Circular Tessera Codes in the Evolution of the Genetic Code

Elena Fimmel

Martin Starman

Lutz Strüngmann

Abstract

Introduction

Definitions and Notations

Klein Four-Group and Equivalence Classes of Dinucleotides

Fig. 1.

Table 1.

Tesserae: Definition and Structure

Definition 2.1

Table 2.

Fig. 2.

Definition 2.2

Graph Theoretical Approach

Definition 2.3

Fig. 3.

Fig. 4.

Fig. 5.

Theorem 2.4

Definition 2.5

Fig. 6.

Circular Tessera Codes

Definition 3.1

Table 5.

Table 3.

Definition 3.2

Example 3.3

Example 3.4

Proposition 3.5

Proof

Theorem 3.6

Proof

Theorem 3.7

Proof

Example 3.8

Definition 3.9

Proposition 3.10

Proof

Construction of All Maximal Circular Tessera Codes

Fig. 7.

Lemma 4.1

Proof

Definition 4.2

Table 4.

Self-Complementary Circular Tessera Codes

Definition 4.3

Example 4.4

Lemma 4.5

Proof

Example 4.6

Lemma 4.7

Table 6.

Lemma 4.8

Proof

Lemma 4.9

Proof

Theorem 4.10

Proof

Example 4.11

Theorem 4.12

Proof

Example 4.13

Conclusions

Acknowledgements

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases