Users can transcribe their favorite piano recordings to MIDI files after installation

Last update: Dec 17, 2022

Related tags

Overview

Piano transcription inference

This toolbox is a piano transcription inference package that can be easily installed. Users can transcribe their favorite piano recordings to MIDI files after installation. To see how the piano transcription system is trained, please visit: https://github.com/bytedance/piano_transcription.

Demos

Here is a demo of our piano transcription system: https://www.youtube.com/watch?v=5U-WL0QvKCg

Installation

The piano transcription system is developed with Python 3.7 and PyTorch 1.4.0 (Should work with other versions, but not fully tested). Install PyTorch following https://pytorch.org/. Users should have ffmpeg installed to transcribe mp3 files.

pip install piano_transcription_inference

Installation is finished!

Usage

Want to try it out but don't want to install anything? We have set up a Google Colab.

python3 example.py --audio_path='resources/cut_liszt.mp3' --output_midi_path='cut_liszt.mid' --cuda

This will download the pretrained model from https://zenodo.org/record/4034264.

Users could also execute the inference code line by line:

from piano_transcription_inference import PianoTranscription, sample_rate, load_audio

# Load audio
(audio, _) = load_audio(audio_path, sr=sample_rate, mono=True)

# Transcriptor
transcriptor = PianoTranscription(device='cuda', checkpoint_path=None)  # device: 'cuda' | 'cpu'

# Transcribe and write out to MIDI file
transcribed_dict = transcriptor.transcribe(audio, 'cut_liszt.mid')

Visualization of piano transcription

Demo. Lang Lang: Franz Liszt - Love Dream (Liebestraum) [audio] [transcribed_midi]

FAQs

This repo support Linux and Mac. Windows has not been tested.

If users met "audio.exceptions.NoBackendError", then check if ffmpeg is installed.

If users met the problem of "Killed". This is caused by there are not sufficient memory.

Applications

We have built a large-scale classical piano MIDI dataset https://github.com/bytedance/GiantMIDI-Piano using our piano transcription system.

Cite

[1] High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times, [To appear], 2020

Users can transcribe their favorite piano recordings to MIDI files after installation

Related tags

Overview

Piano transcription inference

Demos

Installation

Usage

Visualization of piano transcription

FAQs

Applications

Cite

Owner

Reading list for research topics in sound event detection

controls volume using hand gestures

Code to work with wave files!

Sparse Beta-Divergence Tensor Factorization Library

In this project we can see how we can generate automatic music using character RNN.

Pythonic bindings for FFmpeg's libraries.

Codes for "Efficient Long-Range Attention Network for Image Super-resolution"

Audio pitch-shifting & re-sampling utility, based on the EMU SP-1200

Pianote - An application that helps musicians practice piano ear training

ianZiPu is a way to write notation for Guqin (古琴) music.

praudio provides audio preprocessing framework for Deep Learning audio applications

A Python wrapper for the high-quality vocoder "World"

DaisyXmusic ❤ A bot that can play music on Telegram Group and Channel Voice Chats

Convert complex chord names to midi notes

A Python library and tools AUCTUS A6 based radios.

digital audio workstation, instrument and effect plugins, wave editor

Make an audio file (really) long-winded

Scalable audio processing framework written in Python with a RESTful API

pyo is a Python module written in C to help digital signal processing script creation.

FPGA based USB 2.0 high speed audio interface featuring multiple optical ADAT inputs and outputs

Users can transcribe their favorite piano recordings to MIDI files after installation

Related tags

Overview

Piano transcription inference

Demos

Installation

Usage

Visualization of piano transcription

FAQs

Applications

Cite

Owner

Reading list for research topics in sound event detection

controls volume using hand gestures

Code to work with wave files!

Sparse Beta-Divergence Tensor Factorization Library

In this project we can see how we can generate automatic music using character RNN.

﻿﻿Pythonic bindings for FFmpeg's libraries.

Codes for "Efficient Long-Range Attention Network for Image Super-resolution"

Audio pitch-shifting & re-sampling utility, based on the EMU SP-1200

Pianote - An application that helps musicians practice piano ear training

ianZiPu is a way to write notation for Guqin (古琴) music.

praudio provides audio preprocessing framework for Deep Learning audio applications

A Python wrapper for the high-quality vocoder "World"

DaisyXmusic ❤ A bot that can play music on Telegram Group and Channel Voice Chats

Convert complex chord names to midi notes

A Python library and tools AUCTUS A6 based radios.

digital audio workstation, instrument and effect plugins, wave editor

Make an audio file (really) long-winded

Scalable audio processing framework written in Python with a RESTful API

pyo is a Python module written in C to help digital signal processing script creation.

FPGA based USB 2.0 high speed audio interface featuring multiple optical ADAT inputs and outputs

Pythonic bindings for FFmpeg's libraries.