hub / github.com/QData/TextAttack

github.com/QData/TextAttack @v0.3.11 sqlite

repository ↗ · DeepWiki ↗ · release v0.3.11 ↗

1,245 symbols 4,187 edges 270 files 544 documented · 44%

README

TextAttack 🐙

Generating adversarial examples for NLP models

[TextAttack Documentation on ReadTheDocs]

About • Setup • Usage • Design

TextAttack Demo GIF

About

TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.

If you're looking for information about TextAttack's menagerie of pre-trained models, you might want the TextAttack Model Zoo page.

Slack Channel

For help and realtime updates related to TextAttack, please join the TextAttack Slack!

Why TextAttack?

There are lots of reasons to use TextAttack:

Understand NLP models better by running different adversarial attacks on them and examining the output
Research and develop different NLP adversarial attacks using the TextAttack framework and library of components
Augment your dataset to increase model generalization and robustness downstream
Train NLP models using just a single command (all downloads included!)

Setup

Installation

You should be running Python 3.6+ to use this package. A CUDA-compatible GPU is optional but will greatly improve code speed. TextAttack is available through pip:

pip install textattack

Once TextAttack is installed, you can run it via command-line (textattack ...) or via python module (python -m textattack ...).

Tip: TextAttack downloads files to ~/.cache/textattack/ by default. This includes pretrained models, dataset samples, and the configuration file config.yaml. To change the cache path, set the environment variable TA_CACHE_DIR. (for example: TA_CACHE_DIR=/tmp/ textattack attack ...).

Usage

Help: `textattack --help`

TextAttack's main features can all be accessed via the textattack command. Two very common commands are textattack attack <args>, and textattack augment <args>. You can see more information about all commands using

textattack --help

or a specific command using, for example,

textattack attack --help

The examples/ folder includes scripts showing common TextAttack usage for training models, running attacks, and augmenting a CSV file.

The documentation website contains walkthroughs explaining basic usage of TextAttack, including building a custom transformation and a custom constraint..

Running Attacks: `textattack attack --help`

The easiest way to try out an attack is via the command-line interface, textattack attack.

Tip: If your machine has multiple GPUs, you can distribute the attack across them using the --parallel option. For some attacks, this can really help performance. (If you want to attack Keras models in parallel, please check out examples/attack/attack_keras_parallel.py instead)

Here are some concrete examples:

TextFooler on BERT trained on the MR sentiment classification dataset:

textattack attack --recipe textfooler --model bert-base-uncased-mr --num-examples 100

DeepWordBug on DistilBERT trained on the Quora Question Pairs paraphrase identification dataset:

textattack attack --model distilbert-base-uncased-cola --recipe deepwordbug --num-examples 100

Beam search with beam width 4 and word embedding transformation and untargeted goal function on an LSTM:

textattack attack --model lstm-mr --num-examples 20 \
 --search-method beam-search^beam_width=4 --transformation word-swap-embedding \
 --constraints repeat stopword max-words-perturbed^max_num_words=2 embedding^min_cos_sim=0.8 part-of-speech \
 --goal-function untargeted-classification

Tip: Instead of specifying a dataset and number of examples, you can pass --interactive to attack samples inputted by the user.

Attacks and Papers Implemented ("Attack Recipes"): `textattack attack --recipe [recipe_name]`

We include attack recipes which implement attacks from the literature. You can list attack recipes using textattack list attack-recipes.

To run an attack recipe: textattack attack --recipe [recipe_name]

TextAttack Overview

Attack Recipe Name	Goal Function	ConstraintsEnforced	Transformation	Search Method	Main Idea
Attacks on classification tasks, like sentiment classification and entailment:
`a2t`	_{Untargeted {Classification, Entailment}}	_{Percentage of words perturbed, Word embedding distance, DistilBERT sentence encoding cosine similarity, part-of-speech consistency}	_{Counter-fitted word embedding swap (or) BERT Masked Token Prediction}	_{Greedy-WIR (gradient)}	_{from (["Towards Improving Adversarial Training of NLP Models" (Yoo et al., 2021)](https://arxiv.org/abs/2109.00544))}
`alzantot`	_{Untargeted {Classification, Entailment}}	_{Percentage of words perturbed, Language Model perplexity, Word embedding distance}	_{Counter-fitted word embedding swap}	_{Genetic Algorithm}	_{from (["Generating Natural Language Adversarial Examples" (Alzantot et al., 2018)](https://arxiv.org/abs/1804.07998))}
`bae`	_{Untargeted Classification}	_{USE sentence encoding cosine similarity}	_{BERT Masked Token Prediction}	_Greedy-WIR	_{BERT masked language model transformation attack from (["BAE: BERT-based Adversarial Examples for Text Classification" (Garg & Ramakrishnan, 2019)](https://arxiv.org/abs/2004.01970)).}
`bert-attack`	_{Untargeted Classification}	_{USE sentence encoding cosine similarity, Maximum number of words perturbed}	_{BERT Masked Token Prediction (with subword expansion)}	_Greedy-WIR	_{(["BERT-ATTACK: Adversarial Attack Against BERT Using BERT" (Li et al., 2020)](https://arxiv.org/abs/2004.09984))}
`checklist`	_{{Untargeted, Targeted} Classification}	_{checklist distance}	_{contract, extend, and substitutes name entities}	_Greedy-WIR	_{Invariance testing implemented in CheckList . (["Beyond Accuracy: Behavioral Testing of NLP models with CheckList" (Ribeiro et al., 2020)](https://arxiv.org/abs/2005.04118))}
`clare`	_{Untargeted {Classification, Entailment}}	_{USE sentence encoding cosine similarity}	_{RoBERTa Masked Prediction for token swap, insert and merge}	_Greedy	_{["Contextualized Perturbation for Textual Adversarial Attack" (Li et al., 2020)](https://arxiv.org/abs/2009.07502))}
`deepwordbug`	_{{Untargeted, Targeted} Classification}	_{Levenshtein edit distance}	_{{Character Insertion, Character Deletion, Neighboring Character Swap, Character Substitution}}	_Greedy-WIR	_{Greedy replace-1 scoring and multi-transformation character-swap attack (["Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers" (Gao et al., 2018)](https://arxiv.org/abs/1801.04354)}
`faster-alzantot`	_{Untargeted {Classification, Entailment}}	_{Percentage of words perturbed, Language Model perplexity, Word embedding distance}	_{Counter-fitted word embedding swap}	_{Genetic Algorithm}	_{Modified, faster version of the Alzantot et al. genetic algorithm, from (["Certified Robustness to Adversarial Word Substitutions" (Jia et al., 2019)](https://arxiv.org/abs/1909.00986))}
`hotflip` (word swap)	_{Untargeted Classification}	_{Word Embedding Cosine Similarity, Part-of-speech match, Number of words perturbed}	_{Gradient-Based Word Swap}	_{Beam search}	_{(["HotFlip: White-Box Adversarial Examples for Text Classification" (Ebrahimi et al., 2017)](https://arxiv.org/abs/1712.06751))}
`iga`	_{Untargeted {Classification, Entailment}}	_{Percentage of words perturbed, Word embedding distance}	_{Counter-fitted word embedding swap}	_{Genetic Algorithm}	_{Improved genetic algorithm -based word substitution from (["Natural Language Adversarial Attacks and Defenses in Word Level (Wang et al., 2019)"](https://arxiv.org/abs/1909.06723)}
`input-reduction`	_{Input Reduction}		_{Word deletion}	_Greedy-WIR	_{Greedy attack with word importance ranking , Reducing the input while maintaining the prediction through word importance ranking (["Pathologies of Neural Models Make Interpretation Difficult" (Feng et al., 2018)](https://arxiv.org/pdf/1804.07781.pdf))}
`kuleshov`	_{Untargeted Classification}	_{Thought vector encoding cosine similarity, Language model similarity probability}	_{Counter-fitted word embedding swap}	_{Greedy word swap}	_{(["Adversarial Examples for Natural Language Classification Problems" (Kuleshov et al., 2018)](https://openreview.net/pdf?id=r1QZ3zbAZ))}
`pruthi`	_{Untargeted Classification}	_{Minimum word length, Maximum number of words perturbed}	_{{Neighboring Character Swap, Character Deletion, Character Insertion, Keyboard-Based Character Swap}}	_{Greedy search}	_{simulates common typos (["Combating Adversarial Misspellings with Robust Word Recognition" (Pruthi et al., 2019)](https://arxiv.org/abs/1905.11268)}
`pso`	_{Untargeted Classification}		_{HowNet Word Swap}	_{Particle Swarm Optimization}	_{(["Word-level Textual Adversarial Attacking as Combinatorial Optimization" (Zang et al., 2020)](https://www.aclweb.org/anthology/2020.acl-main.540/))}
`pwws`	_{Untargeted Classification}		_{WordNet-based synonym swap}	_{Greedy-WIR (saliency)}	_{Greedy attack with word importance ranking based on word saliency and synonym swap scores (["Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency" (Ren et al., 2019)](https://www.aclweb.org/anthology/P19-1103/))}
`textbugger` : (black-box)	_{Untargeted Classification}	_{USE sentence encoding cosine similarity}	_{{Character Insertion, Character Deletion, Neighboring Character Swap, Character Substitution}}	_Greedy-WIR	_{([(["TextBugger: Generating Adversarial Text Against Real-world Applications" (Li et al., 2018)](https://arxiv.org/abs/1812.05271)).}
`textfooler`	_{Untargeted {Classification, Entailment}}	_{Word Embedding Distance, Part-of-speech match, USE sentence encoding cosine}

Core symbols most depended-on inside this repo

from_pretrained

called by 51

textattack/models/helpers/t5_for_text_to_text.py

called by 44

textattack/models/wrappers/pytorch_model_wrapper.py

load

called by 32

textattack/shared/checkpoint.py

replace_word_at_index

called by 26

textattack/shared/attacked_text.py

strip

called by 25

textattack/models/wrappers/model_wrapper.py

augment

called by 24

textattack/augmentation/augmenter.py

delete_word_at_index

called by 18

textattack/shared/attacked_text.py

text_window_around_index

called by 16

textattack/shared/attacked_text.py

Shape

Method 897

Class 228

Function 116

Route 4

Languages

Python100%

Modules by API surface

textattack/shared/attacked_text.py40 symbols

textattack/constraints/grammaticality/language_models/google_language_model/lm_data_utils.py25 symbols

textattack/shared/word_embeddings.py24 symbols

textattack/augmentation/recipes.py24 symbols

textattack/shared/utils/strings.py21 symbols

tests/test_attacked_text.py20 symbols

textattack/goal_functions/goal_function.py19 symbols

textattack/attack.py17 symbols

textattack/trainer.py16 symbols

textattack/loggers/attack_log_manager.py16 symbols

textattack/constraints/semantics/sentence_encoders/infer_sent/infer_sent_model.py15 symbols

textattack/shared/checkpoint.py14 symbols

Dependencies from manifests, versioned

bert-score0.3.5 · 1×

datasets2.4.0 · 1×

numpy1.21.0 · 1×

pandas1.0.1 · 1×

pinyin0.4.0 · 1×

scipy1.4.1 · 1×

torch1.7.0 · 1×

transformers4.30.0 · 1×

For agents

$ claude mcp add TextAttack \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact

github.com/QData/TextAttack @v0.3.11 sqlite

TextAttack 🐙

About

Slack Channel

Why TextAttack?

Setup

Installation

Usage

Help: textattack --help

Running Attacks: textattack attack --help

Attacks and Papers Implemented ("Attack Recipes"): textattack attack --recipe [recipe_name]

Core symbols most depended-on inside this repo

Shape

Languages

Modules by API surface

Dependencies from manifests, versioned

For agents

Help: `textattack --help`

Running Attacks: `textattack attack --help`

Attacks and Papers Implemented ("Attack Recipes"): `textattack attack --recipe [recipe_name]`