Enhance Robustness of Language Models Against Variation Attack through Graph Integration

Abstract

The widespread use of pre-trained language models (PLMs) in natural languageprocessing (NLP) has greatly improved performance outcomes. However, thesemodels' vulnerability to adversarial attacks (e.g., camouflaged hints from drugdealers), particularly in the Chinese language with its rich characterdiversity/variation and complex structures, hatches vital apprehension. In thisstudy, we propose a novel method, CHinese vAriatioN Graph Enhancement (CHANGE),to increase the robustness of PLMs against character variation attacks inChinese content. CHANGE presents a novel approach for incorporating a Chinesecharacter variation graph into the PLMs. Through designing differentsupplementary tasks utilizing the graph structure, CHANGE essentially enhancesPLMs' interpretation of adversarially manipulated text. Experiments conductedin a multitude of NLP tasks show that CHANGE outperforms current languagemodels in combating against adversarial attacks and serves as a valuablecontribution to robust language model research. These findings contribute tothe groundwork on robust language models and highlight the substantialpotential of graph-guided pre-training strategies for real-world applications.

Quick Read (beta)

loading the full paper ...