Unsupervised Semantic Variation Prediction using the Distribution of Sibling Embeddings

Taichi Aida, Danushka Bollegala


Abstract
Languages are dynamic entities, where the meanings associated with words constantly change with time. Detecting the semantic variation of words is an important task for various NLP applications that must make time-sensitive predictions. Existing work on semantic variation prediction have predominantly focused on comparing some form of an averaged contextualised representation of a target word computed from a given corpus. However, some of the previously associated meanings of a target word can become obsolete over time (e.g. meaning of gay as happy), while novel usages of existing words are observed (e.g. meaning of cell as a mobile phone).We argue that mean representations alone cannot accurately capture such semantic variations and propose a method that uses the entire cohort of the contextualised embeddings of the target word, which we refer to as the sibling distribution. Experimental results on SemEval-2020 Task 1 benchmark dataset for semantic variation prediction show that our method outperforms prior work that consider only the mean embeddings, and is comparable to the current state-of-the-art. Moreover, a qualitative analysis shows that our method detects important semantic changes in words that are not captured by the existing methods.
Anthology ID:
2023.findings-acl.429
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6868–6882
Language:
URL:
https://aclanthology.org/2023.findings-acl.429
DOI:
10.18653/v1/2023.findings-acl.429
Bibkey:
Cite (ACL):
Taichi Aida and Danushka Bollegala. 2023. Unsupervised Semantic Variation Prediction using the Distribution of Sibling Embeddings. In Findings of the Association for Computational Linguistics: ACL 2023, pages 6868–6882, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Unsupervised Semantic Variation Prediction using the Distribution of Sibling Embeddings (Aida & Bollegala, Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.429.pdf