A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

Source code for Acorn, the precision farming rover by Twisted Fields

Pytorch and Torch testing code of CartoonGAN

The pytorch implementation of DG-Font: Deformable Generative Networks for Unsupervised Font Generation

A package for music online and offline rhythmic information analysis including music Beat, downbeat, tempo and meter tracking.

[NeurIPS 2021] "Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems"

🤖 A Python library for learning and evaluating knowledge graph embeddings

🚀 An end-to-end ML applications using PyTorch, W&B, FastAPI, Docker, Streamlit and Heroku

nfelo: a power ranking, prediction, and betting model for the NFL

Code for Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid

Inkscape extensions for figure resizing and editing

A python implementation of Deep-Image-Analogy based on pytorch.

Official PyTorch implementation of N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras (ICCV 2021)

Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Transformer Tracking (CVPR2021)

PyTorch implementation of the REMIND method from our ECCV-2020 paper "REMIND Your Neural Network to Prevent Catastrophic Forgetting"

PyTorch implementation of TSception V2 using DEAP dataset

Full Stack Deep Learning Labs

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Deep Reinforcement Learning for Multiplayer Online Battle Arena