Parameter-Efficient Finetuning for Robust Continual Multilingual Learning

Kartikeya Badola, Shachi Dave, Partha Talukdar


Abstract
We introduce and study the problem of Continual Multilingual Learning (CML) where a previously trained multilingual model is periodically updated using new data arriving in stages. If the new data is present only in a subset of languages, we find that the resulting model shows improved performance only on the languages included in the latest update (and a few closely related languages) while its performance on all the remaining languages degrade significantly. We address this challenge by proposing LAFT-URIEL, a parameter-efficient finetuning strategy which aims to increase the number of languages on which the model improves after an update, while reducing the magnitude of loss in performance for the remaining languages. LAFT-URIEL uses linguistic knowledge to balance overfitting and knowledge sharing across languages, allowing for an additional 25% of task languages to see an improvement in performance after an update, while also reducing the average magnitude of losses on the remaining languages by 78% relative.
Anthology ID:
2023.findings-acl.619
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9763–9780
Language:
URL:
https://aclanthology.org/2023.findings-acl.619
DOI:
10.18653/v1/2023.findings-acl.619
Bibkey:
Cite (ACL):
Kartikeya Badola, Shachi Dave, and Partha Talukdar. 2023. Parameter-Efficient Finetuning for Robust Continual Multilingual Learning. In Findings of the Association for Computational Linguistics: ACL 2023, pages 9763–9780, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Parameter-Efficient Finetuning for Robust Continual Multilingual Learning (Badola et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.619.pdf