Gradient-Based Adversarial Factual Consistency Evaluation for Abstractive Summarization

摘要

Neural abstractive summarization systems have gained significant progress in recent years. However, excessive abstractiveness inevitably leads to factual errors, which poses a challenge to the robustness of the systems. In this paper, we proposed an efficient weak-supervised adversarial data augmentation approach to form the factual consistency dataset. Based on the artificial dataset, we train an evaluation model that can not only make accurate and robust factual consistency discrimination but is also capable of making interpretable factual errors tracing by backpropagated gradient distribution on token embeddings. Experiments and analysis conduct on public annotated summarization and factual consistency datasets demonstrate our approach effective and reasonable.

会议
EMNLP 2021
曾致远
曾致远
硕士研究生

自然语言理解,文本生成

徐蔚然
徐蔚然
副教授,硕士生导师,博士生导师

信息检索,模式识别,机器学习