Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Last update: Nov 17, 2022

Overview

Building Shazam from scratch

In this repository we tried to implement a simplified copy of the Shazam application able to tell you the name of a song listening to a short sample.

Overview

Converting the songs from mp3 to wav with Librosa and extraction of the peaks
MinHashing with permutations on the shingles matrix
Locality sensitive hashing to divide the songs in buckets
Shazam!

pickle is a folder that contains the songs peaks, the shingles array and the shingle matrix in pickle format.
ShazamLSH.ipynb is the main notebook that only contains the explanation of the steps and some comments
function.py contains all the implemented function needed to execute the notebook

Resources

This is the dataset we used and processed:

https://www.kaggle.com/dhrumil140396/mp3s32k

We also share some useful links can help to understand what is the process behind Min Hashing and LSH in order to recognise song:

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Related tags

Overview

Building Shazam from scratch

Overview

Contents

Resources

Owner

Arturo Ghinassi

PyTorch implementation of CVPR'18 - Perturbative Neural Networks

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Tf alloc - Simplication of GPU allocation for Tensorflow2

Hyperbolic Procrustes Analysis Using Riemannian Geometry

PyTorch implementation of the ACL, 2021 paper Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks.

CVPR 2021 - Official code repository for the paper: On Self-Contact and Human Pose.

A Pytorch implementation of MoveNet from Google. Include training code and pre-train model.

Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.

CUAD

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

This repository contains the code for designing risk bounded motion plans for car-like robot using Carla Simulator.

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Prometheus exporter for Cisco Unified Computing System (UCS) Manager

PyTorch implementation of our ICCV 2021 paper, Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents.

An implementation on "Curved-Voxel Clustering for Accurate Segmentation of 3D LiDAR Point Clouds with Real-Time Performance"

Generate indoor scenes with Transformers

TensorFlow implementation of Style Transfer Generative Adversarial Networks: Learning to Play Chess Differently.

COIN the currently largest dataset for comprehensive instruction video analysis.