PSILENCE: A Pseudonymization Tool for International Law

Luis Adrián Cabrera-Diego, Akshita Gheewala


Abstract
Since the announcement of the GDPR, the pseudonymization of legal documents has become a high-priority task in many legal organizations. This means that for making public a document, it is necessary to redact the identity of certain entities, such as witnesses. In this work, we present the first results obtained by PSILENCE, a pseudonymization tool created for redacting semi-automatically international arbitration documents in English. PSILENCE has been built using a Named Entity Recognition (NER) system, along with a Coreference Resolution system. These systems allow us to find the people that we need to redact in a clustered way, but also to propose the same pseudonym throughout one document. This last aspect makes it easier to read and comprehend a redacted legal document. Different experiments were done on four different datasets, one of which was legal, and the results are promising, reaching a Macro F-score of up to 0.72 on the legal dataset.
Anthology ID:
2024.caldpseudo-1.4
Volume:
Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024)
Month:
March
Year:
2024
Address:
St. Julian’s, Malta
Editors:
Elena Volodina, David Alfter, Simon Dobnik, Therese Lindström Tiedemann, Ricardo Muñoz Sánchez, Maria Irena Szawerna, Xuan-Son Vu
Venues:
CALD-pseudo | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25–36
Language:
URL:
https://aclanthology.org/2024.caldpseudo-1.4
DOI:
Bibkey:
Cite (ACL):
Luis Adrián Cabrera-Diego and Akshita Gheewala. 2024. PSILENCE: A Pseudonymization Tool for International Law. In Proceedings of the Workshop on Computational Approaches to Language Data Pseudonymization (CALD-pseudo 2024), pages 25–36, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):
PSILENCE: A Pseudonymization Tool for International Law (Cabrera-Diego & Gheewala, CALD-pseudo-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.caldpseudo-1.4.pdf