Disentangling Confidence Score Distribution for Out-of-Domain Intent Detection with Energy-Based Learning

Abstract

Detecting Out-of-Domain (OOD) or unknown intents from user queries is essential in a task-oriented dialog system. Traditional softmax-based confidence scores are susceptible to the overconfidence issue. In this paper, we propose a simple but strong energy-based score function to detect OOD where the energy scores of OOD samples are higher than IND samples. Further,given a small set of labeled OOD samples, we introduce an energy-based margin objective for supervised OOD detection to explicitly distinguish OOD samples from INDs. Comprehensive experiments and analysis prove our method helps disentangle confidence score distributions of IND and OOD data.

Publication
EMNLP 2022 workshop
Yanan Wu
Yanan Wu
Postgraduate Student

Spoken Language Understading

Zhiyuan Zeng
Zhiyuan Zeng
Postgraduate Student

Spoken Language Understanding, Text Generation

Keqing He
Postgraduate Student

Dialogue System, Summarization, Pre-training Language Model

Yutao Mou
Yutao Mou
Postgraduate Student

Task-oriented Dialogue System, Spoken Language Understading

Pei Wang
Pei Wang
Postgraduate Student
Yuanmeng Yan
Yuanmeng Yan
Postgraduate Student

Spoken Language Understanding, Pre-training Language Model

Weiran Xu
Weiran Xu
Associate Professor, Master Supervisor, Ph.D Supervisor

Information Retrieval, Pattern Recognition, Machine Learning