This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Last update: Jan 03, 2023

Related tags

Deep Learning summac

Overview

SummaC: Summary Consistency Detection

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

We release: (1) the trained SummaC models, (2) the SummaC Benchmark and data loaders, (3) training and evaluation scripts.

Trained SummaC Models

The two trained models SummaC-ZS and SummaC-Conv are implemented in model_summac.py (link):

SummaC-ZS does not require a model file (as the model is zero-shot and not trained): it can be used as seen at the bottom of the model_summac.py.
SummaC-Conv requires a start_file which contains the trained weight for the convolution layer. The default start_file used to compute results is available in this repository ( summac_conv_vitc_sent_perc_e.bin download link).

Example use

from model_summac import SummaCZS

model = SummaCZS(granularity="sentence", model_name="vitc")

document = """Scientists are studying Mars to learn about the Red Planet and find landing sites for future missions.
One possible site, known as Arcadia Planitia, is covered instrange sinuous features.
The shapes could be signs that the area is actually made of glaciers, which are large masses of slow-moving ice.
Arcadia Planitia is in Mars' northern lowlands."""

summary1 = "There are strange shape patterns on Arcadia Planitia. The shapes could indicate the area might be made of glaciers. This makes Arcadia Planitia ideal for future missions."
summary2 = "There are strange shape patterns on Arcadia Planitia. The shapes could indicate the area might be made of glaciers."

score1 = model.score([document], [summary1])
print("Summary Score 1 consistency: %.3f" % (score1["scores"][0])) # Prints: 0.587

score2 = model.score([document], [summary2])
print("Summary Score 2 consistency: %.3f" % (score2["scores"][0])) # Prints: 0.877

To load all the necessary files: (1) clone this repository, (2) add the reposity to Python path: export PYTHONPATH="${PYTHONPATH}:/path/to/summac/"

SummaC Benchmark

The SummaC Benchmark consists of 6 summary consistency datasets that have been standardized to a binary classification task. The datasets included are:

% Positive is the percentage of positive (consistent) summaries. IAA is the inter-annotator agreement (Fleiss Kappa). Source is the dataset used for the source documents (CNN/DM or XSum). # Summarizers is the number of summarizers (extractive and abstractive) included in the dataset. # Sublabel is the number of labels in the typology used to label summary errors.

The data-loaders for the benchmark are included in utils_summac_benchmark.py (link). Because the dataset relies on previously published work, the dataset requires the manual download of several datasets. For each of the 6 tasks, the link and instruction to download are present as a comment in the file. Once all the files have been compiled, the benchmark can be loaded and standardized by running:

from utils_summac_benchmark import SummaCBenchmark
benchmark_validation = SummaCBenchmark(benchmark_folder="/path/to/summac_benchmark/", cut="val")

Note: we have a plan to streamline the process by further improving to automatically download necessary files if not present, if you would like to participate please let us know. If encoutering an issue in the manual download process, please contact us.

Cite the work

If you make use of the code, models, or algorithm, please cite our paper. Bibtex to come.

Contributing

If you'd like to contribute, or have questions or suggestions, you can contact us at [email protected]. All contributions welcome, for example helping make the benchmark more easily downloadable, or improving model performance on the benchmark.

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Related tags

Overview

SummaC: Summary Consistency Detection

Trained SummaC Models

Example use

SummaC Benchmark

Cite the work

Contributing

Owner

Philippe Laban

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

Pocsploit is a lightweight, flexible and novel open source poc verification framework

Deep Learning Theory

验证码识别深度学习 tensorflow 神经网络

CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

Keras-retinanet - Keras implementation of RetinaNet object detection.

Prototypical python implementation of the trust-region algorithm presented in Sequential Linearization Method for Bound-Constrained Mathematical Programs with Complementarity Constraints by Larson, Leyffer, Kirches, and Manns.

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Implementation of Neonatal Seizure Detection using EEG signals for deploying on edge devices including Raspberry Pi.

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

HINet: Half Instance Normalization Network for Image Restoration

YOLOX-CondInst - Implement CondInst which is a instances segmentation method on YOLOX

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Accurate identification of bacteriophages from metagenomic data using Transformer

Simulation of Self Driving Car

Self-Supervised depth kalilia

A object detecting neural network powered by the yolo architecture and leveraging the PyTorch framework and associated libraries.

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Related tags

Overview

SummaC: Summary Consistency Detection

Trained SummaC Models

Example use

SummaC Benchmark

Cite the work

Contributing

Owner

Philippe Laban

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

Pocsploit is a lightweight, flexible and novel open source poc verification framework

Deep Learning Theory

验证码识别 深度学习 tensorflow 神经网络

CondLaneNet: a Top-to-down Lane Detection Framework Based on Conditional Convolution

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

Keras-retinanet - Keras implementation of RetinaNet object detection.

Prototypical python implementation of the trust-region algorithm presented in Sequential Linearization Method for Bound-Constrained Mathematical Programs with Complementarity Constraints by Larson, Leyffer, Kirches, and Manns.

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Implementation of Neonatal Seizure Detection using EEG signals for deploying on edge devices including Raspberry Pi.

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

HINet: Half Instance Normalization Network for Image Restoration

YOLOX-CondInst - Implement CondInst which is a instances segmentation method on YOLOX

PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)

Accurate identification of bacteriophages from metagenomic data using Transformer

Simulation of Self Driving Car

Self-Supervised depth kalilia

A object detecting neural network powered by the yolo architecture and leveraging the PyTorch framework and associated libraries.

验证码识别深度学习 tensorflow 神经网络