Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Last update: Dec 07, 2022

Related tags

Overview

Layerwise Anomaly

This repository contains the source code and data for our ACL 2021 paper: "How is BERT surprised? Layerwise detection of linguistic anomalies" by Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu, and Frank Rudzicz.

Citation

If you use our work in your research, please cite:

Li, B., Zhu, Z., Thomas, G., Xu, Y., and Rudzicz, F. (2021) How is BERT surprised? Layerwise detection of linguistic anomalies. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL).

@inproceedings{li2021layerwise,
  author = "Li, Bai and Zhu, Zining and Thomas, Guillaume and Xu, Yang and Rudzicz, Frank",
  title = "How is BERT surprised? Layerwise detection of linguistic anomalies",
  booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)",
  publisher = "Association for Computational Linguistics",
  year = "2021",
}

Dependencies

The project was developed with the following library versions. Running with other versions may crash or produce incorrect results.

Python 3.7.5
CUDA Version: 11.0
torch==1.7.1
transformers==4.5.1
numpy==1.19.0
pandas==0.25.3
scikit-learn==0.22

Setup Instructions

Clone this repo: git clone https://github.com/SPOClab-ca/layerwise-anomaly
Download BNC Baby (4m word sample) from this link and extract into data/bnc/
Run BNC preprocessing script: python scripts/process_bnc.py --bnc_dir=data/bnc/download/Texts --to=data/bnc.pkl
Clone BLiMP repo: cd data && git clone https://github.com/alexwarstadt/blimp

GMM experiments on BLiMP (Figure 2 and Appendix A)

PYTHONPATH=. time python scripts/blimp_anomaly.py \
  --bnc_path=data/bnc.pkl \
  --blimp_path=data/blimp/data/ \
  --out=blimp_result

Frequency correlation (Figure 3 and Appendix B)

Run the notebooks/FreqSurprisal.ipynb notebook.

Surprisal gap experiments (Figure 4)

PYTHONPATH=. time python scripts/run_surprisal_gaps.py \
  --bnc_path=data/bnc.pkl \
  --out=surprisal_gaps

Accuracy scores (Table 2)

PYTHONPATH=. time python scripts/run_accuracy.py \
  --model_name=roberta-base \
  --anomaly_model=gmm

Run unit tests

PYTHONPATH=. pytest tests

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Related tags

Overview

Layerwise Anomaly

Citation

Dependencies

Setup Instructions

GMM experiments on BLiMP (Figure 2 and Appendix A)

Frequency correlation (Figure 3 and Appendix B)

Surprisal gap experiments (Figure 4)

Accuracy scores (Table 2)

Run unit tests

Owner

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

How Effective is Incongruity? Implications for Code-mix Sarcasm Detection.

A light weight data augmentation tool for training CNNs and Viola Jones detectors

Romanian Automatic Speech Recognition from the ROBIN project

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Implementation of Self-supervised Graph-level Representation Learning with Local and Global Structure (ICML 2021).

Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)

An Implementation of Fully Convolutional Networks in Tensorflow.

The source codes for TME-BNA: Temporal Motif-Preserving Network Embedding with Bicomponent Neighbor Aggregation.

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

AI Summer's complete catalog of articles

3D Human Pose Machines with Self-supervised Learning

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

The 7th edition of NTIRE: New Trends in Image Restoration and Enhancement workshop will be held on June 2022 in conjunction with CVPR 2022.

PlenOctrees: NeRF-SH Training & Conversion

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

A Parameter-free Deep Embedded Clustering Method for Single-cell RNA-seq Data

Pytorch Implementation for Dilated Continuous Random Field

Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"