SAINT PyTorch implementation

Last update: Dec 25, 2022

Overview

SAINT-pytorch

A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arxiv.org/abs/2002.07033.

SAINT: Separated Self-AttentIve Neural Knowledge Tracing. SAINT has an encoder-decoder structure where exercise and response embedding sequence separately enter the encoder and the decoder respectively, which allows to stack attention layers multiple times.

SAINT model architecture

Usage

import torch
import torch.nn as nn
import torch.nn.functional as F
import numpy as np
import copy

from saint import saint, random_data

seq_len = 100
total_ex = 1200
total_cat = 234
total_in = 2

in_ex, in_cat, in_de = random_data(64, 
                                seq_len , 
                                total_ex, 
                                total_cat, 
                                total_in)


model = saint(dim_model=128,
            num_en=6,
            num_de=6,
            heads_en=8,
            heads_de=8,
            total_ex=total_ex,
            total_cat=total_cat,
            total_in=total_in )

outs = model(in_ex, in_cat, in_de)

print(outs.shape)
# torch.Size([64, 100, 1])

Parameters

dim_model: int.
Dimension of model ( embeddings, attention, linear layers).
num_en: int.
Number of encoder layers.
num_de: int.
Number of decoder layers.
heads_en: int.
Number of heads in multi-head attention block in each layer of encoder.
heads_de: int.
Number of heads in multi-head attention block in each layer of decoder.
total_ex: int.
Total number of unique excercise.
total_cat: int.
Total number of unique concept categories.
total_in: int.
Total number of unique interactions.

todo

change positional embedding to sine.

Citations

@article{choi2020towards,
  title={Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing},
  author={Choi, Youngduck and Lee, Youngnam and Cho, Junghyun and Baek, Jineon and Kim, Byungsoo and Cha, Yeongmin and Shin, Dongmin and Bae, Chan and Heo, Jaewe},
  journal={arXiv preprint arXiv:2002.07033},
  year={2020}
}

@misc{vaswani2017attention,
    title   = {Attention Is All You Need},
    author  = {Ashish Vaswani and Noam Shazeer and Niki Parmar and Jakob Uszkoreit and Llion Jones and Aidan N. Gomez and Lukasz Kaiser and Illia Polosukhin},
    year    = {2017},
    eprint  = {1706.03762},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

SAINT PyTorch implementation

Related tags

Overview

SAINT-pytorch

SAINT model architecture

Usage

Parameters

todo

Citations

Owner

Arshad Shaikh

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

JaQuAD: Japanese Question Answering Dataset

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

aMLP Transformer Model for Japanese

NSFW A chatbot based on GPT2-chitchat

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

Codename generator using WordNet parts of speech database

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Text Analysis & Topic Extraction on Android App user reviews

code for modular summarization work published in ACL2021 by Krishna et al

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Natural Language Processing Specialization

iBOT: Image BERT Pre-Training with Online Tokenizer

Espial is an engine for automated organization and discovery of personal knowledge

SAINT PyTorch implementation

Related tags

Overview

SAINT-pytorch

SAINT model architecture

Usage

Parameters

todo

Citations

Owner

Arshad Shaikh

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

JaQuAD: Japanese Question Answering Dataset

This script just scrapes the most recent Nepali news from Kathmandu Post and notifies the user about current events at regular intervals.It sends out the most recent news at random!

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

aMLP Transformer Model for Japanese

**NSFW** A chatbot based on GPT2-chitchat

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

Codename generator using WordNet parts of speech database

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Text Analysis & Topic Extraction on Android App user reviews

code for modular summarization work published in ACL2021 by Krishna et al

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Natural Language Processing Specialization

iBOT: Image BERT Pre-Training with Online Tokenizer

Espial is an engine for automated organization and discovery of personal knowledge

NSFW A chatbot based on GPT2-chitchat