DRAVIDIAN LANGUAGE@ LT-EDI 2024:Pretrained Transformer based Automatic Speech Recognition system for Elderly People

Abirami. J; Aruna Devi. S; Dharunika Sasikumar; Bharathi B

DRAVIDIAN LANGUAGE@ LT-EDI 2024:Pretrained Transformer based Automatic Speech Recognition system for Elderly People

Abirami. J, Aruna Devi. S, Dharunika Sasikumar, Bharathi B

Abstract

In this paper, the main goal of the study is to create an automatic speech recognition (ASR) system that is tailored to the Tamil language. The dataset that was employed includes audio recordings that were obtained from vulnerable populations in the Tamil region, such as elderly men and women and transgender individuals. The pre-trained model Rajaram1996/wav2vec2- large-xlsr-53-tamil is used in the engineering of the ASR system. This existing model is finetuned using a variety of datasets that include typical Tamil voices. The system is then tested with a specific test dataset, and the transcriptions that are produced are sent in for assessment. The Word Error Rate is used to evaluate the system’s performance. Our system has a WER of 37.733.

Anthology ID:: 2024.ltedi-1.31
Original:: 2024.ltedi-1.31v1
Version 2:: 2024.ltedi-1.31v2
Volume:: Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:: March
Year:: 2024
Address:: St. Julian's, Malta
Editors:: Bharathi Raja Chakravarthi, Bharathi B, Paul Buitelaar, Thenmozhi Durairaj, György Kovács, Miguel Ángel García Cumbreras
Venues:: LTEDI | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 244–248
Language:
URL:: https://aclanthology.org/2024.ltedi-1.31
DOI:
Bibkey:
Cite (ACL):: Abirami. J, Aruna Devi. S, Dharunika Sasikumar, and Bharathi B. 2024. DRAVIDIAN LANGUAGE@ LT-EDI 2024:Pretrained Transformer based Automatic Speech Recognition system for Elderly People. In Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 244–248, St. Julian's, Malta. Association for Computational Linguistics.
Cite (Informal):: DRAVIDIAN LANGUAGE@ LT-EDI 2024:Pretrained Transformer based Automatic Speech Recognition system for Elderly People (J et al., LTEDI-WS 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.ltedi-1.31.pdf

PDF (v2) PDF (v1) Cite Search