Iustina Ilisei


2010

pdf bib
Romanian Zero Pronoun Distribution: A Comparative Study
Claudiu Mihăilă | Iustina Ilisei | Diana Inkpen
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

Anaphora resolution is still a challenging research field in natural language processing, lacking a algorithm that correctly resolves anaphoric pronouns. Anaphoric zero pronouns pose an even greater challenge, since this category is not lexically realised. Thus, their resolution is conditioned by their prior identification stage. This paper reports on the distribution of zero pronouns in Romanian in various genres: encyclopaedic, legal, literary, and news-wire texts. For this purpose, the RoZP corpus has been created, containing almost 50000 tokens and 800 zero pronouns which are manually annotated. The distribution patterns are compared across genres, and exceptional cases are presented in order to facilitate the methodological process of developing a future zero pronoun identification and resolution algorithm. The evaluation results emphasise that zero pronouns appear frequently in Romanian, and their distribution depends largely on the genre. Additionally, possible features are revealed for their identification, and a search scope for the antecedent has been determined, increasing the chances of correct resolution.

2009

pdf bib
A Rule-Based Approach to the Identification of Spanish Zero Pronouns
Luz Rello | Iustina Ilisei
Proceedings of the Student Research Workshop

pdf bib
Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning
Iustina Ilisei | Viktor Pekar | Silvia Bernardini
Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning