Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering

Yi Su, Yixin Ji, Juntao Li, Hai Ye, Min Zhang


Abstract
Although pre-trained language models (PLM) have achieved great success in question answering (QA), their robustness is still insufficient to support their practical applications, especially in the face of distribution shifts. Recently, test-time adaptation (TTA) has shown great potential for solving this problem, which adapts the model to fit the test samples at test time. However, TTA sometimes causes model collapse, making almost all the model outputs incorrect, which has raised concerns about its stability and reliability. In this paper, we delve into why TTA causes model collapse and find that the imbalanced label distribution inherent in QA is the reason for it. To address this problem, we propose Anti-Collapse Fast test-time adaptation (Anti-CF), which utilizes the source model‘s output to regularize the update of the adapted model during test time. We further design an efficient side block to reduce its inference time. Extensive experiments on various distribution shift scenarios and pre-trained language models (e.g., XLM-RoBERTa, BLOOM) demonstrate that our method can achieve comparable or better results than previous TTA methods at a speed close to vanilla forward propagation, which is 1.8× to 4.4× speedup compared to previous TTA methods.
Anthology ID:
2023.emnlp-main.803
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12998–13011
Language:
URL:
https://aclanthology.org/2023.emnlp-main.803
DOI:
10.18653/v1/2023.emnlp-main.803
Bibkey:
Cite (ACL):
Yi Su, Yixin Ji, Juntao Li, Hai Ye, and Min Zhang. 2023. Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 12998–13011, Singapore. Association for Computational Linguistics.
Cite (Informal):
Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering (Su et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.803.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.803.mp4