Natural Language Processing at EDHEC, 2022

Last update: Feb 04, 2022

Related tags

Overview

Natural Language Processing

Here you will find the teaching materials for the "Natural Language Processing" course at EDHEC Business School, 2022

What is the course about?

The course is designed as an introduction to the basics of natural language processing for analyzing unstructured, user-generated content. It is for beginners to the topic (and NLP in general), but it will be helpful to have basic knowledge of Python and a familarity with data science techniques.

Topics covered include:

text preprocessing in Python,
collecting your own data from Twitter and Reddit,
content analysis,
text embeddings, and
supervised learning with text data.

What materials are available here?

The sildes will be posted on the course BlackBoard page. They mostly serve as a high-level introduction to the examples and exercies (in Colab notebooks), which are linked to from the slides themselves. Copies of the Colab notebooks can also be found in the folder called /colab in this repository.

Can I work through the material on my own?

If you didn't attend the class, you can certainly work through the materials on your own (the Colab notebooks are designed to be readable and doable for individuals working at their own pace). The slides posted on BlackBoard will guide you through the content. The notebooks are intendend to be worked through in order. Each one will have examples to view and 1 or 2 practice exercises to complete.

Aknowledgements

I would like to aknowledge Steve Wilson at Oakland University for making his DS3 workshop materials publically available with an MIT license.

Natural Language Processing at EDHEC, 2022

Related tags

Overview

Natural Language Processing

What is the course about?

What materials are available here?

Can I work through the material on my own?

Aknowledgements

Owner

NSFW A chatbot based on GPT2-chitchat

Задания КЕГЭ по информатике 2021 на Python

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

A python gui program to generate reddit text to speech videos from the id of any post.

Translates basic English sentences into the Huna language (hoo-NAH)

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

Various Algorithms for Short Text Mining

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

A Structured Self-attentive Sentence Embedding

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

[ICLR'19] Trellis Networks for Sequence Modeling

Easy to start. Use deep nerual network to predict the sentiment of movie review.

Convolutional 2D Knowledge Graph Embeddings resources

Part of Speech Tagging using Hidden Markov Model (HMM) POS Tagger and Brill Tagger

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Speech to text streamlit app

Natural Language Processing at EDHEC, 2022

Related tags

Overview

Natural Language Processing

What is the course about?

What materials are available here?

Can I work through the material on my own?

Aknowledgements

Owner

**NSFW** A chatbot based on GPT2-chitchat

Задания КЕГЭ по информатике 2021 на Python

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

A python gui program to generate reddit text to speech videos from the id of any post.

Translates basic English sentences into the Huna language (hoo-NAH)

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

Various Algorithms for Short Text Mining

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

A Structured Self-attentive Sentence Embedding

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

[ICLR'19] Trellis Networks for Sequence Modeling

Easy to start. Use deep nerual network to predict the sentiment of movie review.

Convolutional 2D Knowledge Graph Embeddings resources

Part of Speech Tagging using Hidden Markov Model (HMM) POS Tagger and Brill Tagger

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Predict the spans of toxic posts that were responsible for the toxic label of the posts

Speech to text streamlit app

NSFW A chatbot based on GPT2-chitchat