Measuring if attention is explanation with ROAR

Last update: Nov 13, 2022

Related tags

Overview

NLP ROAR Interpretability

Official code for: Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining

Install

git clone https://github.com/AndreasMadsen/nlp-roar-interpretability.git
cd nlp-roar-interpretability
python -m pip install -e .

Experiments

Tasks

There are scripts for each dataset. Note that some tasks share a dataset. Use this list to identify how to train a model for each task.

SST: python experiments/stanford_sentiment.py
SNLI: python experiments/stanford_nli.py
IMDB: python experiments/imdb.py
MIMIC (Diabetes): python experiments/mimic.py --subset diabetes
MIMIC (Anemia): python experiments/mimic.py --subset anemia
bABI-1: python experiments/babi.py --task 1
bABI-2: python experiments/babi.py --task 2
bABI-3: python experiments/babi.py --task 3

Parameters

Each of the above scripts stanford_sentiment, stanford_nli, imdb, mimic, and babi take the same set of CLI arguments. You can learn about each argument with --help. The most important arguments which will allow you to run the experiments presented in the paper are:

--importance-measure: this specifies which importance measure is used. It can be either random, mutual-information, attention , gradient, or integrated-gradient.
--seed: specifies the seed used to initialize the model.
--roar-strategy: should ROAR masking be done absoloute (count) or relative (quantile),
--k: the proportion of tokens in % to mask if --roar-strategy quantile is used. The number of tokens if --roar-strategy count is used.
--recursive: indicates that model to use for computing the importance measure has --k set to --k - --recursive-step-size instead of 0 as used in classic ROAR.

Note, for --k > 0, the reference model must already be trained. For example, in the non-recursive case, this means that a model trained with --k 0 must already available.

Running on a HPC setup

For downloading dataset dependencies we provide a download.sh script.

Additionally, we provide script for submitting all jobs to a Slurm queue, in batch_jobs/. Note again, that the ROAR script assume there are checkpoints for the baseline --k 0 models.

The jobs automatically use $SCRATCH/nlproar as the presistent dir.

MIMIC

See https://mimic.physionet.org/gettingstarted/access/ for how to access MIMIC. You will need to download DIAGNOSES_ICD.csv.gz and NOTEEVENTS.csv.gz and place them in mimic/ relative to your presistent dir.

Measuring if attention is explanation with ROAR

Related tags

Overview

NLP ROAR Interpretability

Install

Experiments

Tasks

Parameters

Running on a HPC setup

MIMIC

Owner

Andreas Madsen

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

[CVPR 2022] Deep Equilibrium Optical Flow Estimation

Pixray is an image generation system

BigbrotherBENL - Face recognition on the Big Brother episodes in Belgium and the Netherlands.

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

Road Crack Detection Using Deep Learning Methods

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Python script that analyses the given datasets and comes up with the best polynomial regression representation with the smallest polynomial degree possible

Neuron class provides LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neurons learned with Gradient descent or LeLevenberg–Marquardt algorithm

Implementations for the ICLR-2021 paper: SEED: Self-supervised Distillation For Visual Representation.

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)

Official code for paper "ISNet: Costless and Implicit Image Segmentation for Deep Classifiers, with Application in COVID-19 Detection"

Official PaddlePaddle implementation of Paint Transformer

GLANet - The code for Global and Local Alignment Networks for Unpaired Image-to-Image Translation arxiv

A python implementation of Deep-Image-Analogy based on pytorch.

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Code in conjunction with the publication 'Contrastive Representation Learning for Hand Shape Estimation'

PyTorch implementation for Graph Contrastive Learning with Augmentations

Multi-Scale Geometric Consistency Guided Multi-View Stereo

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)