Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Last update: Oct 14, 2022

Overview

About this repository

This repo contains an Pytorch implementation for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks. The code framework is based on TextBox.

Environment

python >= 3.8.11
torch >= 1.6.0

Run install.sh to install other requirements.

Dataset

The processed dataset can be downloaded from Google Drive. Once finished, unzip the datafiles (train.src, train.tgt, ...) to ./data.

An overview of dataset: train: 287113 cases, dev: 13368 cases, test: 11490 cases

Paramters

# overall settings
data_path: 'data/'
checkpoint_dir: 'saved/'
generated_text_dir: 'generated/'
# dataset settings
max_vocab_size: 50000
src_len: 400
tgt_len: 100

# model settngs
decoding_strategy: 'beam_search'
beam_size: 4
is_attention: True
is_pgen: True
is_coverage: True
cov_loss_lambda: 1.0

Log file is located in ./log, more details can be found in yamls.

Note: Distributed Data Parallel (DDP) is not supported yet.

Train & Evaluation

From scratch run `fire.py`.

if __name__ == '__main__':
    config = Config(config_dict={'test_only': False,
                                 'load_experiment': None})
    train(config)

If you want to resume from a checkpoint, just set the 'load_experiment': './saved/$model_name$.pth'. Similarly, when 'test_only' is set to True, 'load_experiment' is required.

Results

The best model is trained on a TITAN Xp GPU (8GB usage).

Training loss

Ablation study

Model	Rouge-1	Rouge-2	Rouge-L
Seq2Seq	22.17	7.20	20.97
Seq2Seq+attn	29.35	12.58	27.38
Seq2Seq+attn+pgen	36.04	15.87	32.92
Seq2Seq+attn+pgen+coverage	39.52	17.85	36.40

Note: The architecture of the Seq2Seq model is based on lstm, I hope I can replace it with transformer in the future.

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run `fire.py`.

Results

Training loss

Ablation study

Owner

wxDai

Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Implementation of the state of the art beat-detection, downbeat-detection and tempo-estimation model

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

A simple rest api that classifies pneumonia infection weather it is Normal, Pneumonia Virus or Pneumonia Bacteria from a chest-x-ray image.

Self-Supervised Learning of Event-based Optical Flow with Spiking Neural Networks

Repository for Driving Style Recognition algorithms for Autonomous Vehicles

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

ML models and internal tensors 3D visualizer

A curated list of awesome Model-Based RL resources

R-package accompanying the paper "Dynamic Factor Model for Functional Time Series: Identification, Estimation, and Prediction"

YOLOX_AUDIO is an audio event detection model based on YOLOX

VoxHRNet - Whole Brain Segmentation with Full Volume Neural Network

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Plug-n-Play Reinforcement Learning in Python with OpenAI Gym and JAX

This is an open solution to the Home Credit Default Risk challenge 🏡

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Related tags

Overview

About this repository

Environment

Dataset

Paramters

Train & Evaluation

From scratch run fire.py.

Results

Training loss

Ablation study

Owner

wxDai

Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.

Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild

Implementation of the state of the art beat-detection, downbeat-detection and tempo-estimation model

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

A simple rest api that classifies pneumonia infection weather it is Normal, Pneumonia Virus or Pneumonia Bacteria from a chest-x-ray image.

Self-Supervised Learning of Event-based Optical Flow with Spiking Neural Networks

Repository for Driving Style Recognition algorithms for Autonomous Vehicles

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Numbering permanent and deciduous teeth via deep instance segmentation in panoramic X-rays

TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

ML models and internal tensors 3D visualizer

A curated list of awesome Model-Based RL resources

R-package accompanying the paper "Dynamic Factor Model for Functional Time Series: Identification, Estimation, and Prediction"

YOLOX_AUDIO is an audio event detection model based on YOLOX

VoxHRNet - Whole Brain Segmentation with Full Volume Neural Network

Angora is a mutation-based fuzzer. The main goal of Angora is to increase branch coverage by solving path constraints without symbolic execution.

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Plug-n-Play Reinforcement Learning in Python with OpenAI Gym and JAX

This is an open solution to the Home Credit Default Risk challenge 🏡

From scratch run `fire.py`.