Information Extraction from Polish Radiology Reports Using Language Models

Aleksander Obuchowski, Barbara Klaudel, Patryk Jasik


Abstract
Radiology reports are vital elements of directing patient care. They are usually delivered in free text form, which makes them prone to errors, such as omission in reporting radiological findings and using difficult-to-comprehend mental shortcuts. Although structured reporting is the recommended method, its adoption continues to be limited. Radiologists find structured reports too limiting and burdensome. In this paper, we propose the model, which is meant to preserve the benefits of free text, while moving towards a structured report. The model automatically parametrizes Polish radiology reports based on language models. The models were trained on a large dataset of 1200 chest computed tomography (CT) reports annotated by multiple medical experts reports with 44 observation tags. Experimental analysis shows that models based on language models are able to achieve satisfactory results despite being pre-trained on general domain corpora. Overall, the model achieves an F1 score of 81% and is able to successfully parametrize the most common radiological observations, allowing for potential adaptation in clinical practice. Our model is publically available.
Anthology ID:
2023.bsnlp-1.14
Volume:
Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023)
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Jakub Piskorski, Michał Marcińczuk, Preslav Nakov, Maciej Ogrodniczuk, Senja Pollak, Pavel Přibáň, Piotr Rybak, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
113–122
Language:
URL:
https://aclanthology.org/2023.bsnlp-1.14
DOI:
10.18653/v1/2023.bsnlp-1.14
Bibkey:
Cite (ACL):
Aleksander Obuchowski, Barbara Klaudel, and Patryk Jasik. 2023. Information Extraction from Polish Radiology Reports Using Language Models. In Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023), pages 113–122, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Information Extraction from Polish Radiology Reports Using Language Models (Obuchowski et al., BSNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.bsnlp-1.14.pdf
Video:
 https://aclanthology.org/2023.bsnlp-1.14.mp4