TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Last update: Feb 07, 2022

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Named entity recognition (NER), which aims at identifying real-world entity mentions from texts, is a fundamental task in natural language processing with a wide range of applications. Previous approaches mainly focus on the original pure sentence but the Part of speech (POS) contains rich semantic information and contribute to the success of the Natural Language Processing task. To further improve the performance of the NER task, we proposed the five methods that employed POS tags fused with the original tokens based on the BERT model to achieve the NER task, including concatenating token and POS as one or two sentences, adding POS embedding as one of the embedding elements, model ensemble, and conduct the multi-attention between the token representations and POS representations. In this work, we addressed the CoNLL-2003 and Groningen Meaning Bank (GMB) datasets which can provide both NER tags and POS tags. From our experiments on two datasets, part of the proposed methods can show performance improvement in comparison with the baseline methods.

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

Owner

2021 2학기 데이터크롤링 기말프로젝트

txtai: Build AI-powered semantic search applications in Go

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Findings of ACL 2021

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

MRC approach for Aspect-based Sentiment Analysis (ABSA)

LewusBot - Twitch ChatBot built in python with twitchio library

This repo is to provide a list of literature regarding Deep Learning on Graphs for NLP

A demo of chinese asr

This repository contains the code for "Generating Datasets with Pretrained Language Models".

ProteinBERT is a universal protein language model pretrained on ~106M proteins from the UniRef90 dataset.

End-to-end image captioning with EfficientNet-b3 + LSTM with Attention

Get list of common stop words in various languages in Python

Multilingual word vectors in 78 languages

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新

基于“Seq2Seq+前缀树”的知识图谱问答

Amazon Multilingual Counterfactual Dataset (AMCD)

A simple Streamlit App to classify swahili news into different categories.