Lingxi: A Diversity-aware Chinese Modern Poetry Generation System

Xinran Zhang, Maosong Sun, Jiafeng Liu, Xiaobing Li


Abstract
Chinese modern poetry generation has been a challenging task. One issue is the Chinese word segmentation (CWS) which is critical to comprehend the Chinese language but was not always considered in common tokenization methods. Another is the decoding (sampling) method which may induce repetition and boredom and severely lower the diversity of the generated poetry. To address these issues, we present Lingxi, a diversity-aware Chinese modern poetry generation system. For the CWS issue, we propose a novel framework that incorporates CWS in the tokenization process. The proposed method can achieve a high vocabulary coverage rate with a reasonable vocabulary size. For the decoding method and the diversity issue, we propose a novel sampling algorithm that flattens the high likelihood part of the predicted distribution of the language model to emphasize the comparatively low-likelihood words and increase the diversity of generated poetry. Empirical results show that even when the top 60% of cumulative probability mass of the predicted distribution is flattened, our method achieves comparable or even better performance than baseline sampling methods. Our system is available at http://lingxi.website.
Anthology ID:
2023.acl-demo.6
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Danushka Bollegala, Ruihong Huang, Alan Ritter
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
63–75
Language:
URL:
https://aclanthology.org/2023.acl-demo.6
DOI:
10.18653/v1/2023.acl-demo.6
Bibkey:
Cite (ACL):
Xinran Zhang, Maosong Sun, Jiafeng Liu, and Xiaobing Li. 2023. Lingxi: A Diversity-aware Chinese Modern Poetry Generation System. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 63–75, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Lingxi: A Diversity-aware Chinese Modern Poetry Generation System (Zhang et al., ACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.acl-demo.6.pdf
Video:
 https://aclanthology.org/2023.acl-demo.6.mp4