Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document

Xiangnan Chen, Qian Xiao, Juncheng Li, Duo Dong, Jun Lin, Xiaozhong Liu, Siliang Tang


Abstract
Visual Relation Extraction (VRE) is a powerful means of discovering relationships between entities within visually-rich documents. Existing methods often focus on manipulating entity features to find pairwise relations, yet neglect the more fundamental structural information that links disparate entity pairs together. The absence of global structure information may make the model struggle to learn long-range relations and easily predict conflicted results. To alleviate such limitations, we propose a GlObal Structure knowledge-guided relation Extraction (GOSE) framework. GOSE initiates by generating preliminary relation predictions on entity pairs extracted from a scanned image of the document. Subsequently, global structural knowledge is captured from the preceding iterative predictions, which are then incorporated into the representations of the entities. This “generate-capture-incorporate” cycle is repeated multiple times, allowing entity representations and global structure knowledge to be mutually reinforced. Extensive experiments validate that GOSE not only outperforms existing methods in the standard fine-tuning setting but also reveals superior cross-lingual learning capabilities; indeed, even yields stronger data-efficient performance in the low-resource setting.
Anthology ID:
2023.findings-emnlp.107
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1587–1598
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.107
DOI:
10.18653/v1/2023.findings-emnlp.107
Bibkey:
Cite (ACL):
Xiangnan Chen, Qian Xiao, Juncheng Li, Duo Dong, Jun Lin, Xiaozhong Liu, and Siliang Tang. 2023. Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 1587–1598, Singapore. Association for Computational Linguistics.
Cite (Informal):
Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document (Chen et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.107.pdf