Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages

Alexandra Butoi, Tim Vieira, Ryan Cotterell, David Chiang


Abstract
The class of tree-adjoining languages can be characterized by various two-level formalisms, consisting of a context-free grammar (CFG) or pushdown automaton (PDA) controlling another CFG or PDA. These four formalisms are equivalent to tree-adjoining grammars (TAG), linear indexed grammars (LIG), pushdown-adjoining automata (PAA), and embedded pushdown automata (EPDA). We define semiring-weighted versions of the above two-level formalisms, and we design new algorithms for computing their stringsums (the weight of all derivations of a string) and allsums (the weight of all derivations). From these, we also immediately obtain stringsum and allsum algorithms for TAG, LIG, PAA, and EPDA. For LIG, our algorithm is more time-efficient by a factor of 𝒪(n|𝒩|) (where n is the string length and |𝒩| is the size of the nonterminal set) and more space-efficient by a factor of 𝒪(|𝛤|) (where 𝛤 is the size of the stack alphabet) than the algorithm of Vijay-Shanker and Weir (1989). For EPDA, our algorithm is both more space-efficient and time-efficient than the algorithm of Alonso et al. (2001) by factors of 𝒪(|𝛤|2) and 𝒪(|𝛤|3), respectively. Finally, we give the first PAA stringsum and allsum algorithms.
Anthology ID:
2023.emnlp-main.328
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5394–5416
Language:
URL:
https://aclanthology.org/2023.emnlp-main.328
DOI:
10.18653/v1/2023.emnlp-main.328
Bibkey:
Cite (ACL):
Alexandra Butoi, Tim Vieira, Ryan Cotterell, and David Chiang. 2023. Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 5394–5416, Singapore. Association for Computational Linguistics.
Cite (Informal):
Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages (Butoi et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.328.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.328.mp4