Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Last update: Dec 30, 2022

Overview

Build a Discord AI Chatbot that Speaks like Your Favorite Character!

This is a Discord AI Chatbot that uses the Microsoft DialoGPT conversational model fine-tuned on the game transcript of The World Ends With You (TWEWY). Read my tutorial on freeCodeCamp or watch my video tutorial on YouTube. I've also made a JavaScript version of the tutorial using Discord.js.

I trained the model using the lines of my favorite quirky character, Joshua (left in the image below). He has about 700 lines in total in the entire game.

Here is a demo of the Discord bot in action.

You can also directly chat with the model hosted on Hugging Face's Model Hub.

Structure of this Project

model_train_upload_workflow.ipyb: Notebook to be run in Google Colab to train and upload the model to Hugging Face's Model Hub
discord_bot.py: Script to be imported into a Repl.it Python Discord.py project
discord_bot.js: Script to be imported into a Repl.it JavaScript Discord.js project

Twewy-discord-chatbot - Build a Discord AI Chatbot that Speaks like Your Favorite Character

Related tags

Overview

Build a Discord AI Chatbot that Speaks like Your Favorite Character!

Structure of this Project

Resource Links

Owner

Lynn Zheng

PIZZA - a task-oriented semantic parsing dataset

Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.

Chinese segmentation library

Watson Natural Language Understanding and Knowledge Studio

PyTorch implementation of Microsoft's text-to-speech system FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.

VMD Audio/Text control with natural language

PyWorld3 is a Python implementation of the World3 model

A 10000+ hours dataset for Chinese speech recognition

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

A Chinese to English Neural Model Translation Project

ByT5: Towards a token-free future with pre-trained byte-to-byte models

Official implementation of Meta-StyleSpeech and StyleSpeech

a chinese segment base on crf

Implementation for paper BLEU: a Method for Automatic Evaluation of Machine Translation

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"

Collection of scripts to pinpoint obfuscated code

Google and Stanford University released a new pre-trained model called ELECTRA

SDL: Synthetic Document Layout dataset