Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery

摘要

Generalized intent discovery aims to extend a closed-set in-domain intent classifier to an open-world intent set including in-domain and out-of-domain intents. The key challenges lie in pseudo label disambiguation and representation learning. Previous methods suffer from a coupling of pseudo label disambiguation and representation learning, that is, the reliability of pseudo labels relies on representation learning, and representation learning is restricted by pseudo labels in turn. In this paper, we propose a decoupled prototype learning framework (DPL) to decouple pseudo label disambiguation and representation learning. Specifically, we firstly introduce prototypical contrastive representation learning (PCL) to get discriminative representations. And then we adopt a prototype-based label disambiguation method (PLD) to obtain pseudo labels. We theoretically prove that PCL and PLD work in a collaborative fashion and facilitate pseudo label disambiguation. Experiments and analysis on three benchmark datasets show the effectiveness of our method.

会议
ACL 2023
牟宇滔
牟宇滔
硕士研究生

任务型对话系统,自然语言理解

宋晓帅
宋晓帅
硕士研究生
何可清
硕士研究生

对话系统,摘要,预训练

曾晨
曾晨
硕士研究生
王霈
王霈
硕士研究生
徐蔚然
徐蔚然
副教授,硕士生导师,博士生导师

信息检索,模式识别,机器学习