Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems

摘要

Recent advances in neural approaches greatly improve task-oriented dialogue (TOD) systems which assist users to accomplish their goals. However, such systems rely on costly manually labeled dialogs which are not available in practical scenarios. In this paper, we present our models for Track 2 of the SereTOD 2022 challenge, which is the first challenge of building semi-supervised and reinforced TOD systems on a large-scale real-world Chinese TOD dataset MobileCS. We build a knowledge-grounded dialog model to formulate dialog history and local KB as input and predict the system response. And we perform semi-supervised pre-training both on the labeled and unlabeled data. Our system achieves the first place both in the automatic evaluation and human interaction, especially with higher BLEU (+7.64) and Success (+13.6%) than the second place.

会议
EMNLP2022 workshop (SereTOD)
曾伟豪
曾伟豪
硕士研究生
何可清
硕士研究生

对话系统,摘要,预训练

王泽晨
王泽晨
硕士研究生
傅大源
傅大源
硕士研究生
董冠霆
董冠霆
硕士研究生

自然语言理解

耿若彤
耿若彤
硕士研究生

摘要生成

王霈
王霈
硕士研究生
徐蔚然
徐蔚然
副教授,硕士生导师,博士生导师

信息检索,模式识别,机器学习