PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Last update: Nov 12, 2021

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

NLP applications for code-mixed (CM) or mix-lingual text have gained a significant momentum recently, the main reason being the prevalence of language mixing in social media communications in multi-lingual societies like India, Mexico, Europe, parts of USA etc. Word embeddings are basic building blocks of any NLP system today, yet, word embedding for CM languages is an unexplored territory. The major bottleneck for CM word embeddings is switching points, where the language switches. These locations lack in contextually and statistical systems fail to model this phenomena due to high variance in the seen examples. In this paper we present our initial observations on applying switching point based positional encoding techniques for CM language, specifically Hinglish (Hindi - English). Results are only marginally better than SOTA, but it is evident that positional encoding could be an effective way to train position sensitive language models for CM text.

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

@inproceedings{ali-etal-relative,
title = {PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages},
author = {Mohsin Ali and Kandukuri Sai Teja and Sumanth Manduru and Parth Patwa and Amitava Das}
booktitle =  {Proceedings of the AAAI Conference on Artificial Intelligence},
year = {2022},}

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Related tags

Overview

PESTO: Switching Point based Dynamic and Relative Positional Encoding for Code-Mixed Languages

Abstract

PESTO Architecture

Switch Point Attention

If you find this useful, please cite our paper below:

Owner

Mohsin Ali, Mohammed

Transfer Learning for Pose Estimation of Illustrated Characters

Repository for "Toward Practical Monocular Indoor Depth Estimation" (CVPR 2022)

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth, in ICCV 2021 (oral)

Official implementation of the paper "Steganographer Detection via a Similarity Accumulation Graph Convolutional Network"

Code for AutoNL on ImageNet (CVPR2020)

The sixth place winning solution (6/220) in 2021 Gaofen Challenge.

Physics-informed Neural Operator for Learning Partial Differential Equation

Repository for the electrical and ICT benchmark model developed in the ERIGrid 2.0 project.

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

Topic Modelling for Humans

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

PyTorch implementation of adversarial patch

A simplified framework and utilities for PyTorch

This is the code for the paper "Contrastive Clustering" (AAAI 2021)

Machine learning, in numpy

Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"

R-Drop: Regularized Dropout for Neural Networks

Hyper-parameter optimization for sklearn