A classic result in language theory is Medvedev’s theorem for trees, stating that any regular tree language can be defined by the projection of a local tree language, i.e., of a language defined by its tiles of height 2, a.k.a. di-grams. The simple proof of the statement is based on a local alphabet whose size (linearly) depends on the number of states of the tree automaton recognizing the original language. We prove a new extended version of Medvedev’s theorem for trees, by using a k-locally testable tree language defined by tiles of height k≥ 2 (k-grams). The size of the local alphabet is just the double of the original one, hence it is independent from the complexity of the tree automaton. This result relies on an encoding of the states traversed by a tree automaton, by means of binary comma-free codes carefully laid on tree paths. We thus generalize from words to trees our recent extended Medvedev’s theorem for word languages that was based on strictly locally testable word languages. By applying the result to the syntax trees of context-free grammars, we characterize them by k-locally testable, binary-labeled trees.

Homomorphic Characterization of Tree Languages Based on Comma-Free Encoding

Crespi Reghizzi S.;San Pietro P.
2021-01-01

Abstract

A classic result in language theory is Medvedev’s theorem for trees, stating that any regular tree language can be defined by the projection of a local tree language, i.e., of a language defined by its tiles of height 2, a.k.a. di-grams. The simple proof of the statement is based on a local alphabet whose size (linearly) depends on the number of states of the tree automaton recognizing the original language. We prove a new extended version of Medvedev’s theorem for trees, by using a k-locally testable tree language defined by tiles of height k≥ 2 (k-grams). The size of the local alphabet is just the double of the original one, hence it is independent from the complexity of the tree automaton. This result relies on an encoding of the states traversed by a tree automaton, by means of binary comma-free codes carefully laid on tree paths. We thus generalize from words to trees our recent extended Medvedev’s theorem for word languages that was based on strictly locally testable word languages. By applying the result to the syntax trees of context-free grammars, we characterize them by k-locally testable, binary-labeled trees.
2021
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
978-3-030-68194-4
978-3-030-68195-1
File in questo prodotto:
File Dimensione Formato  
978-3-030-68195-1_19.pdf

Accesso riservato

: Publisher’s version
Dimensione 401.37 kB
Formato Adobe PDF
401.37 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1233583
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact