Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Last update: Aug 02, 2021

Related tags

Deep Learning low-resource-adapt

Overview

Wietse de Vries • Martijn Bartelds • Malvina Nissim • Martijn Wieling

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

This repository contains everything that is needed to replicate the results in the paper:

📝 Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

The best fine-tuned models for Gronings and West Frisian are available on the HuggingFace model hub:

Lexical layers

These models are identical to BERTje, but with different lexical layers (bert.embeddings.word_embeddings).

🤗 GroNLP/bert-base-dutch-cased (Dutch; source language)
🤗 GroNLP/bert-base-dutch-cased-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-frisian (West Frisian)

POS tagging

These models share the same fine-tuned Transformer layers + classification head, but with the retrained lexical layers from the models above.

🤗 GroNLP/bert-base-dutch-cased-upos-alpino (Dutch)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-frisian (West Frisian)

Development

Conda/mamba dependencies are listed in environment.yml. This repository contains all scripts and configs that are needed to replicate the results in the paper. A more extensive usage guide will be provided later.

BibTeX entry

The paper is to appear in Findings of ACL2021. The preprint can be cited as:

@misc{devries2021adapting,
      title={{Adapting Monolingual Models: Data can be Scarce when Language Similarity is High}}, 
      author={Wietse de Vries and Martijn Bartelds and Malvina Nissim and Martijn Wieling},
      year={2021},
      eprint={2105.02855},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Related tags

Overview

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

Lexical layers

POS tagging

Development

BibTeX entry

Owner

Wietse de Vries

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!

Implementation of CVPR'21: RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Attempt at implementation of a simple GAN using Keras

A PyTorch implementation of a Factorization Machine module in cython.

AdaDM: Enabling Normalization for Image Super-Resolution

PyTorch implementation of InstaGAN: Instance-aware Image-to-Image Translation

PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Pre-trained NFNets with 99% of the accuracy of the official paper

Wileless-PDGNet Implementation

An open source Jetson Nano baseboard and tools to design your own.

realsense d400 -> jpg + csv

Learning to Disambiguate Strongly Interacting Hands via Probabilistic Per-Pixel Part Segmentation [3DV 2021 Oral]

Graduation Project

Geometric Algebra package for JAX

Codebase for Diffusion Models Beat GANS on Image Synthesis.

ARAE-Tensorflow for Discrete Sequences (Adversarially Regularized Autoencoder)

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English