Anchor

This repository has code for the paper High-Precision Model-Agnostic Explanations.

An anchor explanation is a rule that sufficiently “anchors” the prediction locally – such that changes to the rest of the feature values of the instance do not matter. In other words, for instances on which the anchor holds, the prediction is (almost) always the same.

At the moment, we support explaining individual predictions for text classifiers or classifiers that act on tables (numpy arrays of numerical or categorical data). If there is enough interest, I can include code and examples for images.

The anchor method is able to explain any black box classifier, with two or more classes. All we require is that the classifier implements a function that takes in raw text or a numpy array and outputs a prediction (integer)

Installation

The Anchor package is on pypi. Simply run:

pip install anchor-exp

Or clone the repository and run:

python setup.py install

If you want to use AnchorTextExplainer, you have to run the following:

python -m spacy download en_core_web_lg

And if you want to use BERT to perturb inputs (recommended), also install transformers:

pip install torch transformers spacy && python -m spacy download en_core_web_sm

Examples

See notebooks folder for tutorials. Note that from version 0.0.1.0, it only works on python 3.

Citation

Here is the bibtex if you want to cite this work.

Code for "High-Precision Model-Agnostic Explanations" paper

Related tags

Overview

Anchor

Installation

Examples

Citation

Owner

Marco Tulio Correia Ribeiro

Visualizer for neural network, deep learning, and machine learning models

⬛ Python Individual Conditional Expectation Plot Toolbox

Python Library for Model Interpretation/Explanations

An intuitive library to add plotting functionality to scikit-learn objects.

A data-driven approach to quantify the value of classifiers in a machine learning ensemble.

Interpretability and explainability of data and machine learning models

Quickly and easily create / train a custom DeepDream model

A Practical Debugging Tool for Training Deep Neural Networks

pytorch implementation of "Distilling a Neural Network Into a Soft Decision Tree"

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.

FairML - is a python toolbox auditing the machine learning models for bias.

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Net2Vis automatically generates abstract visualizations for convolutional neural networks from Keras code.

Visualization Toolbox for Long Short Term Memory networks (LSTMs)

A game theoretic approach to explain the output of any machine learning model.

Lime: Explaining the predictions of any machine learning classifier

Visualizer for neural network, deep learning, and machine learning models

Implementation of linear CorEx and temporal CorEx.

👋🦊 Xplique is a Python toolkit dedicated to explainability, currently based on Tensorflow.