A repo for materials relating to the tutorial of CS-332 NLP

Last update: Feb 15, 2022

Overview

CS-332-NLP

A repo for materials relating to the tutorial of CS-332 NLP

Tutorial 1:
- Introduction
- Corpus
- Regular expression
- Tokenization
Tutorial 2:
- Normalization
- Parsing
- Morpheme
- Stemming
- Lemmatization

Acknowledgements

Speech and Language Processing. Daniel Jurafsky & James H. Martin. (Edition 2 & 3)
Marcinkiewicz, M. A. (1994). Building a large annotated corpus of English: The Penn Treebank. Using Large Corpora, 273.
http://su.diva-portal.org/smash/record.jsf?pid=diva2%3A686162&dswid=9114

Owner

Alok singh

GitHub Repository

Image2pcl - Enter the metaverse with 2D image to 3D projections

Image2PCL Enter the metaverse with 2D image to 3D projections! This is an implem

0 Feb 05, 2022

A toolkit for document-level event extraction, containing some SOTA model implementations

Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker Source code for ACL-IJCNLP 2021 Long paper: Document-le

84 Dec 15, 2022

Findings of ACL 2021

Assessing Dialogue Systems with Distribution Distances [arXiv][code] We propose to measure the performance of a dialogue system by computing the distr

16 Feb 24, 2022

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Attention is all you need: A Pytorch Implementation This is a PyTorch implementation of the Transformer model in "Attention is All You Need" (Ashish V

7.1k Jan 05, 2023

基于“Seq2Seq+前缀树”的知识图谱问答

KgCLUE-bert4keras 基于“Seq2Seq+前缀树”的知识图谱问答简介博客：https://kexue.fm/archives/8802 环境软件：bert4keras=0.10.8 硬件：目前的结果是用一张Titan RTX（24G）跑出来的。运行第一次运行的时候，会给知

65 Dec 12, 2022

profile tools for pytorch nn models

nnprof Introduction nnprof is a profile tool for pytorch neural networks. Features multi profile mode: nnprof support 4 profile mode: Layer level, Ope

42 Jul 09, 2022

SciBERT is a BERT model trained on scientific text.

1.2k Dec 24, 2022

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Summarization, translation, Q&A, text generation and more at blazing speed using a T5 version implemented in ONNX. This package is still in alpha stag

211 Dec 28, 2022

BiNE: Bipartite Network Embedding

BiNE: Bipartite Network Embedding This repository contains the demo code of the paper: BiNE: Bipartite Network Embedding. Ming Gao, Leihui Chen, Xiang

214 Nov 24, 2022

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

Neural Machine Translation communication system The model is basically direct to convert one source language to another targeted language using encode

7 Sep 22, 2022

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Well-formed Limericks and Haikus with GPT2 📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation In collaboration with Matthew Korahais &

2 May 26, 2022

A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.

Persian-Image-Captioning We fine-tuning the Vision Encoder Decoder Model for the task of image captioning on the coco-flickr-farsi dataset. The implem

15 Aug 25, 2022

Top2Vec is an algorithm for topic modeling and semantic search.

Top2Vec is an algorithm for topic modeling and semantic search. It automatically detects topics present in text and generates jointly embedded topic, document and word vectors.

2.4k Jan 06, 2023

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources Description This is the repository for the paper Unifying Cross-

16 Sep 09, 2022

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

wav2vec_finetune Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks Initial test: gender recognition on this dat

8 Aug 11, 2022

Mkdocs + material + cool stuff

Modern-Python-Doc-Example mkdocs + material + cool stuff Doc is live here Features out of the box amazing good looking website thanks to mkdocs.org an

61 Oct 26, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

Simple NLP based project without any use of AI

1 Apr 26, 2022

DeLighT: Very Deep and Light-Weight Transformers

DeLighT: Very Deep and Light-weight Transformers This repository contains the source code of our work on building efficient sequence models: DeFINE (I

440 Dec 18, 2022

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

GPT2-Pytorch with Text-Generator Better Language Models and Their Implications Our model, called GPT-2 (a successor to GPT), was trained simply to pre

775 Jan 08, 2023

A repo for materials relating to the tutorial of CS-332 NLP

Related tags

Overview

CS-332-NLP

Contents

Acknowledgements

Owner

Alok singh

Image2pcl - Enter the metaverse with 2D image to 3D projections

A toolkit for document-level event extraction, containing some SOTA model implementations

Findings of ACL 2021

A PyTorch implementation of the Transformer model in "Attention is All You Need".

基于“Seq2Seq+前缀树”的知识图谱问答

profile tools for pytorch nn models

SciBERT is a BERT model trained on scientific text.

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

BiNE: Bipartite Network Embedding

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.

Top2Vec is an algorithm for topic modeling and semantic search.

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Test finetuning of XLSR (multilingual wav2vec 2.0) for other speech classification tasks

Mkdocs + material + cool stuff

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Simple NLP based project without any use of AI

DeLighT: Very Deep and Light-Weight Transformers

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation