E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation

Equivariant GNN for the prediction of atomic multipoles up to quadrupoles.

A time series processing library

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI

OntoProtein: Protein Pretraining With Ontology Embedding

Modular Probabilistic Programming on MXNet

Python utility to generate filesystem content for Obsidian.

Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

simple demo codes for Learning to Teach with Dynamic Loss Functions

MoveNet Single Pose on DepthAI

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

Justmagic - Use a function as a method with this mystic script, like in Nim

simple artificial intelligence utilities

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

[AAAI 2021] MVFNet: Multi-View Fusion Network for Efficient Video Recognition

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)