Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity

Gabriel Simmons

doi:10.18653/v1/2023.acl-srw.40

Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity

Abstract

Large Language Models (LLMs) have demonstrated impressive capabilities in generating fluent text, as well as tendencies to reproduce undesirable social biases. This work investigates whether LLMs reproduce the moral biases associated with political groups in the United States, an instance of a broader capability herein termed moral mimicry. This work explores this hypothesis in the GPT-3/3.5 and OPT families of Transformer-based LLMs. Using tools from Moral Foundations Theory, this work shows that these LLMs are indeed moral mimics. When prompted with a liberal or conservative political identity, the models generate text reflecting corresponding moral biases. This study also explores the relationship between moral mimicry and model size, and similarity between human and LLM moral word use.

Anthology ID:: 2023.acl-srw.40
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Vishakh Padmakumar, Gisela Vallejo, Yao Fu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 282–297
Language:
URL:: https://aclanthology.org/2023.acl-srw.40
DOI:: 10.18653/v1/2023.acl-srw.40
Bibkey:
Cite (ACL):: Gabriel Simmons. 2023. Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 282–297, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity (Simmons, ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-srw.40.pdf

PDF Cite Search