Combination of Contextualized and Non-Contextualized Layers for Lexical Substitution in French

Kévin Espasa, Emmanuel Morin, Olivier Hamon


Abstract
Lexical substitution task requires to substitute a target word by candidates in a given context. Candidates must keep meaning and grammatically of the sentence. The task, introduced in the SemEval 2007, has two objectives. The first objective is to find a list of substitutes for a target word. This list of substitutes can be obtained with lexical resources like WordNet or generated with a pre-trained language model. The second objective is to rank these substitutes using the context of the sentence. Most of the methods use vector space models or more recently embeddings to rank substitutes. Embedding methods use high contextualized representation. This representation can be over contextualized and in this way overlook good substitute candidates which are more similar on non-contextualized layers. SemDis 2014 introduced the lexical substitution task in French. We propose an application of the state-of-the-art method based on BERT in French and a novel method using contextualized and non-contextualized layers to increase the suggestion of words having a lower probability in a given context but that are more semantically similar. Experiments show our method increases the BERT based system on the OOT measure but decreases on the BEST measure in the SemDis 2014 benchmark.
Anthology ID:
2022.lrec-1.747
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
6914–6921
Language:
URL:
https://aclanthology.org/2022.lrec-1.747
DOI:
Bibkey:
Cite (ACL):
Kévin Espasa, Emmanuel Morin, and Olivier Hamon. 2022. Combination of Contextualized and Non-Contextualized Layers for Lexical Substitution in French. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 6914–6921, Marseille, France. European Language Resources Association.
Cite (Informal):
Combination of Contextualized and Non-Contextualized Layers for Lexical Substitution in French (Espasa et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.747.pdf