The official implementation of Variable-Length Piano Infilling (VLI).

Last update: Sep 01, 2022

Overview

Variable-Length-Piano-Infilling

The official implementation of Variable-Length Piano Infilling (VLI). (paper: Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding)

VLI is a new Transformer-based model for music score infilling, i.e., to generate a polyphonic music sequence that fills in the gap between given past and future contexts. Our model can infill a variable number of notes for different time spans.

Installation

Clone and install the modified Huggingface Transformer package.
Clone this repo and install the required packages.

git clone https://github.com/reichang182/Variable-Length-Piano-Infilling.git
cd  Variable-Length-Piano-Infilling
pip install -r requirement.txt

Download and unzip the AIlabs.tw Pop1K7 dataset. (Download link: here).

Training & Testing

# Prepare data
python prepare_data.py \
	--midi-folder datasets/midi/midi_synchronized/ \
	--save-folder ./

# Train the model
python train.py --train

# Test the trained model
python train.py

Baselines

The codes to run baselines in our paper are in the baselines folder. We implement ILM and FELIX according to their paper (ILM and FELIX) and based on the implementation of Transformer-XL and BERT in Huggingface Transformer. They can also be trained and tested through the same command as our model does above.

# cd baselines/ILM or cd baselines/FELIX

# Train the model
python train.py --train \
	--dict-file ../../dictionary.pickle \
	--data-file ../../worded_data.pickle

# Test the trained model
python train.py \
	--dict-file ../../dictionary.pickle \
	--data-file ../../worded_data.pickle

Architecture

Results

The training NLL-loss curves of ours and the baseline models.

The objective metrics evaluated on the music pieces generated by VLI(ours), ILM, FELIX, and the real music.

Results of the user study: mean opinion scores in 1–5 in M(melodic fluency), R(rhythmic fluency), I(im-pression), and percentage of votes in F(favorite), from ‘all’ the participants or only the music ‘pro’-fessionals.

The official implementation of Variable-Length Piano Infilling (VLI).

Related tags

Overview

Variable-Length-Piano-Infilling

Installation

Training & Testing

Baselines

Architecture

Results

Owner

Repo for the Video Person Clustering dataset, and code for the associated paper

Image restoration with neural networks but without learning.

Learning Multiresolution Matrix Factorization and its Wavelet Networks on Graphs

Gym environment for FLIPIT: The Game of "Stealthy Takeover"

An AutoML Library made with Optuna and PyTorch Lightning

Pytorch implementation of ICASSP 2022 paper Attention Probe: Vision Transformer Distillation in the Wild

Official PyTorch implementation of "VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization" (CVPR 2021)

Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"

PixelPick This is an official implementation of the paper "All you need are a few pixels: semantic segmentation with PixelPick."

Message Passing on Cell Complexes

A PyTorch library for Vision Transformers

This is the replication package for paper submission: Towards Training Reproducible Deep Learning Models.

Deep learning library for solving differential equations and more

Rule Based Classification Project

An experimental technique for efficiently exploring neural architectures.

EsViT: Efficient self-supervised Vision Transformers

Scheme for training and applying a label propagation framework

Visual Tracking by TridenAlign and Context Embedding

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.

A PyTorch implementation of the architecture of Mask RCNN