Revisit input perturbation problems for llms: A unified robustness evaluation framework for noisy slot filling task

Guanting Dong, Jinxu Zhao, Tingfeng Hui, Daichi Guo, WenlongWan, BoqiFeng, YueyanQiu, ZhuomaGongQue, Keqing He, Zechen Wang, Weiran Xu

October 2023

PDF Code DOI NLPCC 2023

Abstract

We utilize a multi-level data augmentation method (character, word, and sentence levels) to construct a candidate data pool, and carefully design two ways of automatic task demonstration construction strategies (instance-level and entity-level) with various prompt templates. Our aim is to assess how well various robustness methods of LLMs perform in real-world noisy scenarios. The experiments have demonstrated that the current open-source LLMs generally achieve limited perturbation robustness performance. Based on these experimental observations, we make some forward-looking suggestions to fuel the research in this direction..

Type

Conference paper

Publication

NLPCC 2023

"LLMs"