This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

Doing the asl sign language classification on static images using graph neural networks.

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

Script that attempts to force M1 macs into RGB mode when used with monitors that are defaulting to YPbPr.

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

Official PyTorch implementation of "Adversarial Reciprocal Points Learning for Open Set Recognition"

Unofficial implementation of "Coordinate Attention for Efficient Mobile Network Design"

Orchestrating Distributed Materials Acceleration Platform Tutorial

Official repository for "Orthogonal Projection Loss" (ICCV'21)

Score refinement for confidence-based 3D multi-object tracking

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Phy-Q: A Benchmark for Physical Reasoning

State of the art Semantic Sentence Embeddings

Project repo for Learning Category-Specific Mesh Reconstruction from Image Collections

pcnaDeep integrates cutting-edge detection techniques with tracking and cell cycle resolving models.

Show-attend-and-tell - TensorFlow Implementation of "Show, Attend and Tell"

[CVPR'21] Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration

Learning Continuous Signed Distance Functions for Shape Representation

基于AlphaPose的TensorRT加速

This repo is for segmentation of T2 hyp regions in gliomas.

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers