Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews

Gaurish Thakkar, Nives Mikelic Preradovic, Marko Tadić


Abstract
This paper introduces Cro-FiReDa, a sentiment-annotated dataset for Croatian in the domain of movie reviews. The dataset, which contains over 10,000 sentences, has been annotated at the sentence level. In addition to presentingthe overall annotation process, we also present benchmark results based on the transformer-based fine-tuning approach.
Anthology ID:
2023.bsnlp-1.4
Volume:
Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023)
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Jakub Piskorski, Michał Marcińczuk, Preslav Nakov, Maciej Ogrodniczuk, Senja Pollak, Pavel Přibáň, Piotr Rybak, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25–31
Language:
URL:
https://aclanthology.org/2023.bsnlp-1.4
DOI:
10.18653/v1/2023.bsnlp-1.4
Bibkey:
Cite (ACL):
Gaurish Thakkar, Nives Mikelic Preradovic, and Marko Tadić. 2023. Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews. In Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023), pages 25–31, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews (Thakkar et al., BSNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.bsnlp-1.4.pdf
Video:
 https://aclanthology.org/2023.bsnlp-1.4.mp4