Multilingual Emotion classification using BERT (fine-tuning). Published at the WASSA workshop (ACL2022).

Last update: Sep 17, 2022

Overview

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Abstract

Detecting emotion in text allows social and computational scientists to study how people behave and react to online events. However, developing these tools for different languages requires data that is not always available. This paper collects the available emotion detection datasets across 19 languages. We train a multilingual emotion prediction model for social media data, XLM-EMO. The model shows competitive performance in a zero-shot setting, suggesting it is helpful in the context of low-resource languages. We release our model to the community so that interested researchers can directly use it.

See the paper for additional details:

Bianchi, F., Nozza, & D., Hovy. "XLM-EMO: Multilingual Emotion Prediction in Social Media Text". In Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (Forthcoming). Association for Computational Linguistics, 2022. Link.

Free software: MIT license

Installing

pip install -U xlm-emo

Important: If you want to use CUDA you need to install the correct version of the CUDA systems that matches your distribution, see PyTorch.

Features

from xlm_emo.classifier import  EmotionClassifier
ec = EmotionClassifier()

ec.predict(["senti testa di cazzo", "I am very happy"])

>> ["anger", "joy"]

Models

Model	Link	Macro F1 on Test Set
XLM-EMO-T	https://huggingface.co/MilaNLProc/xlm-emo-t	0.85
XLM-EMO-B	TBD	TBD
XLM-EMO-L	TBD	TBD

Reference

If you use this tool please cite the following paper:

@inproceedings{bianchi-etal-2022-xlmemo,
title = {{XLM-EMO}: Multilingual Emotion Prediction in Social Media Text},
author = "Bianchi, Federico and Nozza, Debora and Hovy, Dirk",
booktitle = "Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis",
year = "2022",
publisher = "Association for Computational Linguistics"
}

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Multilingual Emotion classification using BERT (fine-tuning). Published at the WASSA workshop (ACL2022).

Related tags

Overview

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Abstract

Installing

Features

Models

Reference

Credits

Owner

MilaNLP

A simple chatbot based on chatterbot that you can use for anything has basic features

Toy example of an applied ML pipeline for me to experiment with MLOps tools.

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

Outreachy TFX custom component project

A Transformer Implementation that is easy to understand and customizable.

This project is part of Eleuther AI's quest to create a massive repository of high quality text data for training language models.

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

NumPy String-Indexed is a NumPy extension that allows arrays to be indexed using descriptive string labels

Part of Speech Tagging using Hidden Markov Model (HMM) POS Tagger and Brill Tagger

A fast, efficient universal vector embedding utility package.

Python library for interactive topic model visualization. Port of the R LDAvis package.

Library for fast text representation and classification.

Official PyTorch implementation of SegFormer

Kerberoast with ACL abuse capabilities

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

FB ID CLONER WUTHOT CHECKPOINT, FACEBOOK ID CLONE FROM FILE

Transformation spoken text to written text

multi-label，classifier，text classification，多标签文本分类，文本分类，BERT，ALBERT，multi-label-classification，seq2seq，attention，beam search

ADCS - Automatic Defect Classification System (ADCS) for SSMC