Reliable and Interpretable Drift Detection in Streams of Short Texts

Ella Rabinovich, Matan Vetzler, Samuel Ackerman, Ateret Anaby Tavor


Abstract
Data drift is the change in model input data that is one of the key factors leading to machine learning models performance degradation over time. Monitoring drift helps detecting these issues and preventing their harmful consequences. Meaningful drift interpretation is a fundamental step towards effective re-training of the model. In this study we propose an end-to-end framework for reliable model-agnostic change-point detection and interpretation in large task-oriented dialog systems, proven effective in multiple customer deployments. We evaluate our approach and demonstrate its benefits with a novel variant of intent classification training dataset, simulating customer requests to a dialog system. We make the data publicly available.
Anthology ID:
2023.acl-industry.42
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Sunayana Sitaram, Beata Beigman Klebanov, Jason D Williams
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
438–446
Language:
URL:
https://aclanthology.org/2023.acl-industry.42
DOI:
10.18653/v1/2023.acl-industry.42
Bibkey:
Cite (ACL):
Ella Rabinovich, Matan Vetzler, Samuel Ackerman, and Ateret Anaby Tavor. 2023. Reliable and Interpretable Drift Detection in Streams of Short Texts. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), pages 438–446, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Reliable and Interpretable Drift Detection in Streams of Short Texts (Rabinovich et al., ACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.acl-industry.42.pdf