Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

Last update: Nov 30, 2022

Overview

Knowledge Distillation for BERT Unsupervised Domain Adaptation

Official PyTorch implementation | Paper

Abstract

A pre-trained language model, BERT, has brought significant performance improvements across a range of natural language processing tasks. Since the model is trained on a large corpus of diverse topics, it shows robust performance for domain shift problems in which data distributions at training (source data) and testing (target data) differ while sharing similarities. Despite its great improvements compared to previous models, it still suffers from performance degradation due to domain shifts. To mitigate such problems, we propose a simple but effective unsupervised domain adaptation method, adversarial adaptation with distillation (AAD), which combines the adversarial discriminative domain adaptation (ADDA) framework with knowledge distillation. We evaluate our approach in the task of cross-domain sentiment classification on 30 domain pairs, advancing the state-of-the-art performance for unsupervised domain adaptation in text sentiment classification.

Requirements

pandas
pytorch
transformers

Run the test

$ python main.py --pretrain --adapt --src books --tgt dvd

How to cite

@article{ryu2020knowledge,
  title={Knowledge Distillation for BERT Unsupervised Domain Adaptation},
  author={Ryu, Minho and Lee, Kichun},
  journal={arXiv preprint arXiv:2010.11478},
  year={2020}
}

Adversarial Adaptation with Distillation for BERT Unsupervised Domain Adaptation

Related tags

Overview

Knowledge Distillation for BERT Unsupervised Domain Adaptation

Abstract

Requirements

Run the test

How to cite

Owner

Minho Ryu

An end-to-end framework for mixed-integer optimization with data-driven learned constraints.

Collection of NLP model explanations and accompanying analysis tools

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Train the HRNet model on ImageNet

This is an official implementation of the CVPR2022 paper "Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots".

Dataloader tools for language modelling

Official pytorch implement for “Transformer-Based Source-Free Domain Adaptation”

To build a regression model to predict the concrete compressive strength based on the different features in the training data.

Keras Model Implementation Walkthrough

Pytorch implementation for M^3L

This is the formal code implementation of the CVPR 2022 paper 'Federated Class Incremental Learning'.

Official repository for "On Improving Adversarial Transferability of Vision Transformers" (2021)

code for Image Manipulation Detection by Multi-View Multi-Scale Supervision

General Multi-label Image Classification with Transformers

PyJokes - Joking around with Python library pyjokes

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

Back to Event Basics: SSL of Image Reconstruction for Event Cameras

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

2021 National Underwater Robotics Vision Optics