. 2024 Mar 17;14:6420. doi: 10.1038/s41598-024-56259-z

Table 1.

Adversarial text attack methods and their main ideas across different levels.

Attack level	Attack	Main idea
Character-level	Generating Adversarial Text Against Real-world Applications (TextBugger)²³	Greedy word substitution and character manipulation
	Universal Adversarial Triggers for Attacking and Analyzing NLP (UAT)¹	Gradient-based word or character manipulation
	Visually Attacking and Shielding NLP Systems (VIPER) ²⁴	Visually similar character substitution
	Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers (DeepWordBug) ²⁵	Greedy character manipulation
	TextFooler (TF)²⁶	Greedy word substitution
Word-level	White-Box Adversarial Examples for Text Classification (HotFlip)²⁷	Gradient-based word or character substitution
	Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency (PWWS)²⁸	Greedy word substitution
	Generating Natural Language Adversarial Examples (Genetic)²⁹	Genetic algorithm-based word substitution
	Word-level Textual Adversarial Attacking as Combinatorial Optimization (SememePSO)³⁰	Particle swarm optimization-based word substitution
	Adversarial Attack Against BERT Using BERT (BERT-ATTACK)³¹	Greedy contextualized word substitution
	BERT-based Adversarial Examples for Text Classification (BAE)³²	Greedy contextualized word substitution and insertion
	Semantically Equivalent Adversarial Rules for Debugging NLP Models (SEA)³³	Rule-based paraphrasing
Sentence-level	Adversarial Example Generation with Syntactically Controlled Paraphrase Networks (SCPN)³⁴	Paraphrasing
Sentence-level	Generating Natural Adversarial Examples (GAN)³⁵	Text generation by encoder–decoder