Martin Forst


2011

pdf bib
A Cascaded Classification Approach to Semantic Head Recognition
Lukas Michelbacher | Alok Kothari | Martin Forst | Christina Lioma | Hinrich Schütze
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing

2009

pdf bib
Human Evaluation of a German Surface Realisation Ranker
Aoife Cahill | Martin Forst
Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)

pdf bib
TBL-Improved Non-Deterministic Segmentation and POS Tagging for a Chinese Parser
Martin Forst | Ji Fang
Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009)

2007

pdf bib
Filling Statistics with Linguistics – Property Design for the Disambiguation of German LFG Parses
Martin Forst
ACL 2007 Workshop on Deep Linguistic Processing

pdf bib
Stochastic Realisation Ranking for a Free Word Order Language
Aoife Cahill | Martin Forst | Christian Rohrer
Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)

2006

pdf bib
Improving coverage and parsing quality of a large-scale LFG for German
Christian Rohrer | Martin Forst
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

We describe experiments in parsing the German TIGER Treebank. In parsing the complete treebank, 86.44% of the sentences receive full parses; 13.56% receive fragment parses. We discuss the methods used to enhance coverage and parsing quality and we present an evaluation on a gold standard, to our knowledge the first one for a deep grammar of German. Considering the selection performed by our current version of a stochastic disambiguation component, we achieve an f-score of 84.2%, the upper and lower bounds being 87.4% and 82.3% respectively.

pdf bib
The importance of precise tokenizing for deep grammars
Martin Forst | Ronald M. Kaplan
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

We present a non-deterministic finite-state transducer that acts as a tokenizer and normalizer for free text that is input to a broad-coverage LFG of German. We compare the basic tokenizer used in an earlier version of the grammar and the more sophisticated tokenizer that we now use. The revised tokenizer increases the coverage of the grammar in terms of full parses from 68.3% to 73.4% on sentences 8,001 through 10,000 of the TiGer Corpus.

2004

pdf bib
Towards a Dependency-Based Gold Standard for German Parsers. The TIGER Dependency Bank
Martin Forst | Núria Bertomeu | Berthold Crysmann | Frederik Fouvry | Silvia Hansen-Schirra | Valia Kordoni
Proceedings of the 5th International Workshop on Linguistically Interpreted Corpora

2003

pdf bib
Treebank Conversion - Establishing a testsuite for a broad-coverage LFG from the TIGER treebank
Martin Forst
Proceedings of 4th International Workshop on Linguistically Interpreted Corpora (LINC-03) at EACL 2003