Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 May 26;12086:237–250. doi: 10.1007/978-3-030-48516-0_18

On Tree Substitution Grammars

Andreas Maletti , Kevin Stier ‡,
Editors: Nataša Jonoska8, Dmytro Savchuk9
PMCID: PMC7247875

Abstract

Tree substitution grammars are formal models that are used extensively in natural language processing. It is demonstrated that their expressive power is located strictly between the local tree grammars and the regular tree grammars. A decision procedure for the problem of determining whether a tree substitution grammar generates a local tree language is provided. Unfortunately, the class of tree substitution languages is neither closed under union, nor intersection, nor complements. Indeed unions of tree substitution languages even generate an infinite hierarchy. However, all finite and all co-finite tree languages are tree substitution languages.

Introduction

Trees are a fundamental data structure in computer science and are used in many application areas like natural language processing [12], database theory [1], and compiler construction [17]. All the mentioned applications as well as others [6, 7] require effective representations of sets of trees, also called tree languages. These requirements triggered detailed investigations of various classes of tree languages since the 1960s and by now there exists an abundance of models [5].

The most robust of those classes of tree languages are the regular tree languages [6, 7], which are generated by finite-state tree automata, which are a natural extension of the finite-state string automata that generate the regular string languages [18]. Most standard problems are decidable for the regular tree languages and they generally enjoy the same nice algorithmic properties as the regular string languages. The main feature of those automata are their finitely many states, which enable most of the positive properties. However, these states are not exhibited directly in the trees generated. In application areas like natural language processing, in which representations of tree languages have to be inferred from finite sets of trees, practitioners often resorted to simpler models, in which the representation can more readily be induced from the sample.

Tree substitution grammars were originally introduced as a special case of tree-adjoining grammars [9, 11], in which no adjunction is allowed. This restriction proved useful in the lexicalization of context-free grammars [10]. However, tree substitution grammar soon became popular in the parsing community [15] under the approach called data-oriented parsing [3] and were the formal model of many state-of-the-art parsers [16]. Similarly, synchronous tree substitution grammars, which are the same as the syntax-directed translation schemes of [2], are used in many statistical machine translation models [4, 8, 13, 14]. Despite the multitude of applications, a fundamental study of their expressive power is missing. Rather they are attributed properties like “extended domain of locality”, which provides some intuition, but has no formal definition.

A tree substitution grammar G is essentially a finite set F of tree fragments together with a set R of permissible root labels. Those tree fragments can be arbitrarily tall or large, which distinguishes tree substitution grammars from local tree grammars [6, 7]. In addition, the fragments can contain leaves that are labeled by internal symbols. Leaves with such labels are called open and can be expanded further by fragments of F that have the same symbol as root label. Indeed G generates trees from a permissible root label of R by successively expanding open leaves with fragments of F until no open leaves remain. The set of all trees derivable in this manner is called the tree language generated by G. The tree languages that can be generated by some tree substitution grammar are called the tree substitution languages.

In this contribution we start a fundamental study of the expressive power of tree substitution grammars. We show that tree substitution grammars are strictly more expressive than local tree grammars [6, 7], but strictly less expressive than finite-state tree automata (see Corollary 10). This, in particular, yields that most standard decision problems are also decidable for tree substitution languages because they are regular. In addition, it is decidable to determine whether a given tree substitution language is local (see Theorem 8). The decidability status of the related question whether a given regular tree language is a tree substitution language remains open. It is interesting to note that all finite and co-finite tree languages are tree substitution languages (see Theorem 6), which makes them much more useful for the approximation of finite samples of trees than the local tree languages, which do not contain all finite tree languages.

We also investigate the closure properties of the tree substitution languages. Unfortunately, they are neither closed under union (see Theorem 9), nor under intersection (see Theorem 13), nor under complement (see Theorem 14). In fact, unions of tree substitution languages even form a strict hierarchy (see Theorem 11), so unions of k tree substitution languages are strictly less expressive than unions of Inline graphic tree substitution languages. A similar hierarchy is significantly more difficult to prove for intersections and remains an open problem because intersections break the “extended domain of locality” (as shown in the proof of Theorem 13) and can manage a non-explicit information transport over unbounded distances in the trees. Indeed the trivial union construction, which just takes the union of the fragments of the individual tree substitution grammars Inline graphic, does yield a tree substitution grammar G that can generate each tree that can be generated by some Inline graphic. However, G might over-generalize in the sense that it may also generate trees that cannot be generated by any Inline graphic. This property is utilized in grammar induction to generalize beyond the seen data. Overall, the expressive power of tree substitution grammars is interesting and offers new challenging problems because they are used extensively in real-world applications despite their brittle expressive power. It is exactly this absence of good closure properties, which requires separate arguments for each individual problem and thus makes several problems challenging as outlined in the open problems section.

Preliminaries

We denote the set of nonnegative integers (including 0) by Inline graphic. For every Inline graphic, we use the subset Inline graphic. An alphabet A is simply a finite set and Inline graphic is the set of all finite words over A, where Inline graphic containing k factors A and Inline graphic, of which Inline graphic is called the empty word. The length Inline graphic of a word Inline graphic with Inline graphic is Inline graphic; i.e. the number of symbols making up w. Given words Inline graphic, their concatenation is written v.w or simply vw. We write Inline graphic provided that there exists Inline graphic such that Inline graphic. The relation Inline graphic is actually a partial order, called the prefix order.

Let S be a set and Inline graphic be a relation. The identity on S is the relation Inline graphic. Given another relation Inline graphic, the composition Inline graphic is given by Inline graphic. The relation R is reflexive if Inline graphic, and it is transitive if Inline graphic. The reflexive, transitive closure of R is Inline graphic and the transitive closure of R is Inline graphic, where Inline graphic and Inline graphic containing k times the relation R.

A ranked alphabet Inline graphic is a pair consisting of an alphabet Inline graphic and a mapping Inline graphic that assigns a rank to each symbol of Inline graphic. We usually denote a ranked alphabet Inline graphic by just Inline graphic alone when the ranks are clear. We also write Inline graphic to indicate that Inline graphic. Moreover, for every Inline graphic, we let Inline graphic. Given a ranked alphabet Inline graphic and a set Z, the set Inline graphic of Inline graphic" trees indexed by Z is the smallest set T such that Inline graphic and Inline graphic for every Inline graphic, Inline graphic, and Inline graphic. We abbreviate Inline graphic simply to Inline graphic, and any subset Inline graphic is called tree language. It is co-finite if Inline graphic is finite.

Next, we recall some common notions and notations for trees. In the following, let Inline graphic be a tree for a ranked alphabet Inline graphic and a set Z. The set Inline graphic of positions of t is inductively defined by Inline graphic for all Inline graphic, and Inline graphic for every Inline graphic, Inline graphic, and Inline graphic. The height of t is defined by Inline graphic, and the size of t is defined by Inline graphic. A leaf is a position Inline graphic such that Inline graphic. We denote the subset of leaves of Inline graphic by Inline graphic. Given a position Inline graphic, the label t(p) of t at p and the subtree Inline graphic of t at p are defined by Inline graphic for all Inline graphic, and

graphic file with name M67.gif

for all Inline graphic, Inline graphic, and Inline graphic. Finally, the replacement Inline graphic of the leaf Inline graphic by another tree Inline graphic is given by Inline graphic for every Inline graphic, and Inline graphic for every Inline graphic, Inline graphic, Inline graphic, Inline graphic, and Inline graphic.

We reserve the use of the special symbol Inline graphic. A tree Inline graphic is a context, if there exists exactly one Inline graphic with Inline graphic; i.e., there is exactly one occurrence of Inline graphic in t. The set of all such contexts is denoted by Inline graphic. Given a context Inline graphic and a tree Inline graphic, the substitution c[t] of t into c yields the tree Inline graphic, where p is the unique position Inline graphic with Inline graphic. Note that given Inline graphic, also Inline graphic. Similarly, we write Inline graphic for Inline graphic containing the context c a total of k times.

Finally, let us recall regular tree grammars (RTGs) [6, 7]. An RTG is a tuple Inline graphic, where Q is a finite set of states such that Inline graphic, Inline graphic is a ranked alphabet of input symbols, Inline graphic is a set of initial states, and Inline graphic is a finite set of productions. We also write productions (qt) as Inline graphic. The derivation relation for Inline graphic is defined for every Inline graphic by Inline graphic if and only if there exists a production Inline graphic and a context Inline graphic such that Inline graphic and Inline graphic. The tree language generated by G is Inline graphic. A tree language L is regular if there exists an RTG G such that Inline graphic. The class of regular tree languages is denoted by Inline graphic. We note that Inline graphic coincides with the class of tree languages generated by tree automata [6, 7].

Tree Substitution Grammars

Let us start with the formal definition of tree substitution grammars (TSGs) taken essentially from the natural language processing community [10, 11]. TSGs have been applied to various tasks including parsing [16] and machine translation [19]. Consequently, the definitions of TSGs vary, but our definition captures the essence of the notion, while still being convenient to work with.

Definition 1

A tree substitution grammar (TSG) is a tuple Inline graphic, in which Inline graphic is a ranked alphabet of input symbols, Inline graphic is a set of root labels, and Inline graphic is a finite set of fragments. The TSG G is a local tree grammar (LTG) if Inline graphic for all Inline graphic.

Example 2

Consider the ranked alphabet Inline graphic and the TSG Inline graphic with the fragments displayed in Fig. 1. Clearly, this TSG is not an LTG due to the third and fourth fragment.

Fig. 1.

Fig. 1.

Fragments of the TSG of Example 2.

Next we present the derivation semantics for a TSG Inline graphic. Essentially we start the derivation process with a tree consisting solely of a root label of R and then iteratively replace a leaf by a fragment of F with the same root label. This process can be repeated until no replacements are possible anymore. If the such obtained tree t contains only leaves that are labeled by nullary symbols, then t is part of the tree language generated by G.

Definition 3

Let Inline graphic be a TSG. For any two trees Inline graphic, we write Inline graphic if there exists a fragment Inline graphic and a context Inline graphic such that Inline graphic and Inline graphic. The TSG G generates the tree language Inline graphic.

Example 4

Let Inline graphic and consider the TSG Inline graphic with the fragments displayed in Fig. 3. The derivation presented in Fig. 3 illustrates that a derived tree can contain several leaves that still need to be independently replaced. More precisely, both occurrences of Inline graphic in the tree Inline graphic are independently replaced in the displayed derivation.

Fig. 3.

Fig. 3.

Fragments of the TSG G of Example 4 and example derivation steps.

Example 5

Consider the TSG G from Example 2. A few derivation steps are displayed in Fig. 2. Let Inline graphic and Inline graphic. Overall, this TSG generates the tree language

graphic file with name M137.gif

Fig. 2.

Fig. 2.

Example derivation steps using the TSG G of Example 2.

Two TSGs G and Inline graphic are equivalent if Inline graphic. A tree language L is a tree substitution language if there exists a TSG G such that Inline graphic, and it is local [6, 7] if there exists a local tree grammar G such that Inline graphic. The classes of all tree substitution languages and all local tree languages are denoted by Inline graphic and Inline graphic, respectively.

Expressive Power

In this section, we investigate the expressive power of tree substitution grammars and start with some simple tree languages that are contained in Inline graphic. To this end, let Inline graphic and Inline graphic be the classes of all finite and all co-finite tree languages, respectively.

Theorem 6

Inline graphic.

Proof

Every finite tree language Inline graphic is trivially a tree substitution language via the TSG Inline graphic with Inline graphic.

Now, let Inline graphic be a co-finite tree language and Inline graphic be the finitely many trees outside L. Moreover, let Inline graphic be larger than the height of the tallest tree from Inline graphic. We construct the TSG Inline graphic with

  • Inline graphic and

  • Inline graphic.

Clearly, F is finite. Now we prove Inline graphic. For Inline graphic it is sufficient to show that Inline graphic for every Inline graphic. Obviously, the fragments of F are either in L or have height at least n, which proves Inline graphic. We prove the converse Inline graphic by contradiction, so suppose that there exists Inline graphic with Inline graphic. Then there also exists a smallest Inline graphic with Inline graphic. Since all trees Inline graphic with Inline graphic can be generated directly using a single fragment from F, we must have Inline graphic. Let

graphic file with name M171.gif

be the short positions that are prefixes to long positions, and let Inline graphic be the maximal (with respect to Inline graphic) elements of P. We construct the unique tree Inline graphic with positions

graphic file with name M175.gif

and labels Inline graphic for all Inline graphic. In other words, we obtain f by cutting all paths in Inline graphic that have length more than 2n at length n. Obviously, Inline graphic. In addition, we observe that Inline graphic for all Inline graphic. For every Inline graphic, we thus obtain Inline graphic and Inline graphic since Inline graphic and Inline graphic is the smallest counterexample. However, this yields that Inline graphic as well as Inline graphic for all Inline graphic. Altogether Inline graphic, which proves that Inline graphic contradicting the assumption.    Inline graphic

Next we relate the class of tree substitution languages to the well-known classes of local and regular tree languages, respectively. Unsurprisingly, they are situated strictly between them, but the second strictness will be established later (see Corollary 10).

Theorem 7

Inline graphic.

Proof

The first inclusion holds by definition. For the latter, let Inline graphic be a TSG and Inline graphic a new symbol. We construct an RTG Inline graphic such that Inline graphic. To this end, we use copies Inline graphic of the input symbols of Inline graphic as states. The productions are given by Inline graphic with

graphic file with name M201.gif

where Inline graphic is inductively defined by

graphic file with name M203.gif

for every Inline graphic and Inline graphic for all Inline graphic, Inline graphic, and Inline graphic. Clearly any derivation Inline graphic of G yields a corresponding derivation Inline graphic of Inline graphic. Together with Inline graphic for all Inline graphic and the new initial states, we obtain Inline graphic. The converse is proved similarly.

The first inclusion is strict because Inline graphic by Theorem 6, but it is well-known [6, 7] that Inline graphic.    Inline graphic

The inclusion Inline graphic immediately yields that most interesting problems are decidable for tree substitution languages. For example, the emptiness, finiteness, inclusion, and equivalence problems are all decidable because they are decidable for regular tree languages [6, 7]. We proceed with a subclass definability problem: Is it decidable whether an effectively presented tree substitution language is local? Whenever we speak about an effectively presented tree substitution language L, we assume that we are actually given a tree substitution grammar G such that Inline graphic. Let Inline graphic be a TSG. A fragment Inline graphic is useless if G and Inline graphic are equivalent. The TSG G is reduced if no fragment Inline graphic is useless. Clearly, for every TSG we can construct an equivalent reduced TSG.

Theorem 8

For every effectively presented Inline graphic, it is decidable whether Inline graphic.

Proof

Let Inline graphic be a reduced tree substitution grammar such that Inline graphic. We construct the local tree grammar Inline graphic with

graphic file with name M229.gif

Obviously, Inline graphic and all fragments of Inline graphic are essential for this property. Consequently, L is local if and only if Inline graphic. Since both Inline graphic and L are regular by Theorem 7 and inclusion is decidable for regular tree languages [6, 7], we obtain the desired statement.    Inline graphic

Closure Properties

In this section, we investigate the closure properties of the class of tree substitution languages. More specifically, we investigate the Boolean operations and the hierarchy for union. Unfortunately, the results are all negative, but they and, in particular, their proofs shed additional light on the expressive power of tree substitution languages. Let us start with union.

Theorem 9

Inline graphic is not closed under union.

Proof

Consider the ranked alphabet Inline graphic and the LTGs

graphic file with name M239.gif

which generate the local tree languages (see Fig. 4)

graphic file with name M240.gif

with Inline graphic. Now suppose that their union Inline graphic is a tree substitution language; i.e., Inline graphic. Hence there exists a TSG Inline graphic such that Inline graphic. Let Inline graphic be such that Inline graphic. Since Inline graphic, there must exist a derivation Inline graphic and Inline graphic. Since Inline graphic at least two derivation steps are required, so Inline graphic for some Inline graphic, which yields the subderivation Inline graphic. In the same manner we consider the tree Inline graphic, for which the derivation Inline graphic for some Inline graphic and the subderivation Inline graphic must exist. However, exchanging the subderivations yields the derivation

graphic file with name M259.gif

which shows Inline graphic contradicting Inline graphic.    Inline graphic

Fig. 4.

Fig. 4.

The tree languages Inline graphic and Inline graphic used in the proof of Theorem 9.

Since the class of regular tree languages is closed under union [6, 7], we obtain the following corollary from Theorems 7 and 9.

Corollary 10

Inline graphic.

We demonstrated that the union of two tree substitution languages need not be a tree substitution language. Next, we ask ourselves whether additional unions increase the expressive power even further. For every Inline graphic let

graphic file with name M265.gif

be the class of those tree languages that can be presented as unions of k tree substitution languages. Since Inline graphic (see Theorem 6), we obtain Inline graphic, Inline graphic, and Inline graphic for every Inline graphic. Next, we show that the mentioned inclusion is actually strict, so that we obtain an infinite hierarchy.

Theorem 11

Inline graphic-TSL Inline graphic-TSL for all Inline graphic.

Proof

The statement is clear for Inline graphic, so let Inline graphic. Consider the ranked alphabet Inline graphic and the TSG Inline graphic for every Inline graphic, where

graphic file with name M279.gif

and Inline graphic with Inline graphic. Clearly, Inline graphic with Inline graphic. The tree substitution language Inline graphic and the tree Inline graphic are illustrated in Fig. 5.

Fig. 5.

Fig. 5.

Illustration of the tree substitution languages used in the proof of Theorem 11.

Obviously, Inline graphic-TSL and those individual tree languages are infinite and pairwise disjoint. For the sake of a contradiction, assume that Inline graphic-TSL; i. e. there exist Inline graphic such that Inline graphic. The pigeonhole principle establishes that there exist Inline graphic and Inline graphic with Inline graphic such that Inline graphic and Inline graphic are infinite. Let Inline graphic be a TSG such that Inline graphic. Let Inline graphic. Since Inline graphic is infinite, there exists Inline graphic such that Inline graphic. Similarly, there exists Inline graphic such that Inline graphic because Inline graphic is infinite. Inspecting the derivations for those trees there exist Inline graphic such that

graphic file with name M305.gif

Exchanging the subderivations we obtain

graphic file with name M306.gif

and thus Inline graphic, which is a contradiction because Inline graphic.    Inline graphic

Corollary 12

(of Theorem 11).

graphic file with name M310.gif

Let us move on to intersection. Unfortunately, Inline graphic is not closed under intersection, but intersections of Inline graphic become quite powerful. In particular, they allow information to be transported over unbounded distances, which can be observed from the proof.

Theorem 13

Inline graphic is not closed under intersection.

Proof

Recall the ranked alphabet Inline graphic and the TSG G of Example 2 as well as the contexts Inline graphic and Inline graphic from Example 5. Additionally, let Inline graphic with Inline graphic displayed in Fig. 6. The generated tree substitution languages L(G) and Inline graphic are

graphic file with name M322.gif

respectively, which are also illustrated in Fig. 7. Their intersection

graphic file with name 492976_1_En_18_Equ19_HTML.gif

contains only trees, in which all left children along the spine carry the same label. This tree language is not a tree substitution language, which can be proved using the subderivation exchange technique used in the proof of Theorem 9.    Inline graphic

Fig. 6.

Fig. 6.

Fragments of the TSG Inline graphic used in the proof of Theorem 13.

Fig. 7.

Fig. 7.

Tree substitution languages L(G) and Inline graphic used in the proof of Theorem 13.

Note how the intersection achieves a global synchronization in the proof of Theorem 13. This power makes the investigation of the intersection hierarchy difficult. We leave the strictness of the intersection hierarchy as an open problem and conclude by considering the complement.

Theorem 14

Inline graphic is not closed under complements.

Proof

Consider the ranked alphabet Inline graphic and the LTG Inline graphic with fragments

graphic file with name M327.gif

The generated tree language is illustrated in Fig. 8. Now suppose that its complement Inline graphic is a tree substitution language; i.e., Inline graphic. Hence there exists a TSG Inline graphic such that Inline graphic. Let Inline graphic be such that Inline graphic. Since Inline graphic (see Fig. 8) there must exist a derivation Inline graphic and Inline graphic. Since Inline graphic at least two derivation steps are required, so Inline graphic for some Inline graphic, which yields the subderivation Inline graphic. Similarly, we consider the tree Inline graphic (see Fig. 8), for which the derivation Inline graphic for some Inline graphic and the subderivation Inline graphic must exist. However, exchanging the subderivations yields the derivation

graphic file with name M345.gif

which shows Inline graphic contradicting Inline graphic.    Inline graphic

Fig. 8.

Fig. 8.

Trees used in the proof of Theorem 14.

Open Problems

We showed that it is decidable whether a given tree substitution language is local. It remains open if we can also decide whether a given regular tree language is a tree substitution language. Progress on this problem will probably provide additional fine-grained insight into the expressive power of tree substitution grammars in comparison to the regular tree grammars.

Another open problem concerns the intersection hierarchy. We showed that unions of tree substitution languages can progressively express more and more tree languages. A similar hierarchy also exists for intersections of tree substitution languages and we showed that the intersection of two tree substitution languages is not necessarily a tree substitution languages. However, it remains open whether there is an infinite intersection hierarchy or whether it collapses at some level.

Acknowledgements

The authors gratefully acknowledge the financial support of the Research Training Group 1763 (QuantLA: Quantitative Logics and Automata), which is funded by the German Research Foundation (DFG). In addition, the authors would like to thank the anonymous reviewers for the careful reading of the manuscript and their valuable feedback.

Footnotes

K. Stier—Supported by DFG Research Training Group 1763 (QuantLA).

Contributor Information

Nataša Jonoska, Email: jonoska@mail.usf.edu.

Dmytro Savchuk, Email: savchuk@usf.edu.

Andreas Maletti, Email: maletti@informatik.uni-leipzig.de.

Kevin Stier, Email: stier@informatik.uni-leipzig.de.

References

  • 1.Abiteboul S, Hull R, Vianu V. Foundations of Databases. Boston: Addison Wesley; 1994. [Google Scholar]
  • 2.Aho AV, Ullman JD. The Theory of Parsing, Translation, and Compiling. Upper Saddle River: Prentice Hall; 1972. [Google Scholar]
  • 3.Bod R. The data-oriented parsing approach: theory and application. In: Fulcher J, Jain LC, editors. Computational Intelligence: A Compendium. Heidelberg: Springer; 2008. pp. 307–348. [Google Scholar]
  • 4.Chiang, D., Knight, K.: An introduction to synchronous grammars (2006). Tutorial at 44th ACL. https://www3.nd.edu/~dchiang/papers/synchtut.pdf
  • 5.Comon, H., et al.: Tree automata techniques and applications (2007). http://tata.gforge.inria.fr
  • 6.Gécseg F, Steinby M. Tree languages. In: Rozenberg G, Salomaa A, editors. Handbook of Formal Languages. Heidelberg: Springer; 1997. pp. 1–68. [Google Scholar]
  • 7.Gécseg, F., Steinby, M.: Tree Automata. arXiv, 2nd edn. (2015). https://arxiv.org/abs/1509.06233
  • 8.Howcroft, D.M., Klakow, D., Demberg, V.: Toward Bayesian synchronous tree substitution grammars for sentence planning. In: Proceedings of the 11th NLG, pp. 391–396. ACL (2018)
  • 9.Joshi AK, Levy LS, Takahashi M. Tree adjunct grammars. J. Comput. Syst. Sci. 1975;10(1):136–163. doi: 10.1016/S0022-0000(75)80019-5. [DOI] [Google Scholar]
  • 10.Joshi, A.K., Schabes, Y.: Tree-adjoining grammars and lexicalized grammars. Technical report, MS-CIS-91-22, University of Pennsylvania (1991)
  • 11.Joshi AK, Schabes Y. Tree-adjoining grammars. In: Rozenberg G, Salomaa A, editors. Handbook of Formal Languages. Heidelberg: Springer; 1997. pp. 69–123. [Google Scholar]
  • 12.Jurafsky D, Martin JH. Speech and Language Processing. 2. Upper Saddle River: Prentice Hall; 2008. [Google Scholar]
  • 13.Maletti, A.: Why synchronous tree substitution grammars? In: Proceedings of the 2010 NAACL, pp. 876–884. ACL (2010)
  • 14.Maletti A. An alternative to synchronous tree substitution grammars. J. Nat. Lang. Eng. 2011;17(2):221–242. doi: 10.1017/S1351324911000027. [DOI] [Google Scholar]
  • 15.Sangati F, Keller F. Incremental tree substitution grammar for parsing and sentence prediction. Trans. ACL. 2013;1:111–124. [Google Scholar]
  • 16.Shindo, H., Miyao, Y., Fujino, A., Nagata, M.: Bayesian symbol-refined tree substitution grammars for syntactic parsing. In: Proceedings of the 50th ACL, pp. 440–448. ACL (2012)
  • 17.Wilhelm R, Seidl H, Hack S. Compiler Design. Heidelberg: Springer; 2013. [Google Scholar]
  • 18.Yu S. Regular languages. In: Rozenberg G, Salomaa A, editors. Handbook of Formal Languages. Heidelberg: Springer; 1997. pp. 41–110. [Google Scholar]
  • 19.Zhang J, Zhai F, Zong C. Syntax-based translation with bilingually lexicalized synchronous tree substitution grammars. IEEE Trans. Audio Speech Lang. Proc. 2013;21(8):1586–1597. doi: 10.1109/TASL.2013.2255283. [DOI] [Google Scholar]

Articles from Developments in Language Theory are provided here courtesy of Nature Publishing Group

RESOURCES