CUPID: Curriculum Learning Based Real-Time Prediction using Distillation

Arindam Bhattacharya; Ankith Ms; Ankit Gandhi; Vijay Huddar; Atul Saroop; Rahul Bhagat

doi:10.18653/v1/2023.acl-industry.69

CUPID: Curriculum Learning Based Real-Time Prediction using Distillation

Arindam Bhattacharya, Ankith Ms, Ankit Gandhi, Vijay Huddar, Atul Saroop, Rahul Bhagat

Abstract

Relevance in E-commerce Product Search is crucial for providing customers with accurate results that match their query intent. With recent advancements in NLP and Deep Learning, Transformers have become the default choice for relevance classification tasks. In such a setting, the relevance model uses query text and product title as input features, and estimates if the product is relevant for the customer query. While cross-attention in Transformers enables a more accurate relevance prediction in such a setting, its high evaluation latency makes it unsuitable for real-time predictions in which thousands of products must be evaluated against a user query within few milliseconds. To address this issue, we propose CUPID: a Curriculum learning based real-time Prediction using Distillation that utilizes knowledge distillation within a curriculum learning setting to learn a simpler architecture that can be evaluated within low latency budgets. In a bi-lingual relevance prediction task, our approach shows an 302 bps improvement on English and 676 bps improvement for low-resource Arabic, while maintaining the low evaluation latency on CPUs.

Anthology ID:: 2023.acl-industry.69
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Sunayana Sitaram, Beata Beigman Klebanov, Jason D Williams
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 720–728
Language:
URL:: https://aclanthology.org/2023.acl-industry.69
DOI:: 10.18653/v1/2023.acl-industry.69
Bibkey:
Cite (ACL):: Arindam Bhattacharya, Ankith Ms, Ankit Gandhi, Vijay Huddar, Atul Saroop, and Rahul Bhagat. 2023. CUPID: Curriculum Learning Based Real-Time Prediction using Distillation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), pages 720–728, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: CUPID: Curriculum Learning Based Real-Time Prediction using Distillation (Bhattacharya et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-industry.69.pdf

PDF Cite Search