harmony2vec

Audio embeddings for music similarity search

Requirements

It uses CoverHunterMPS to extract embeddings from audio files, downloaded on folder $COVERHUNTER_FOLDER.

git clone https://github.com/alanngnet/CoverHunterMPS A pretrained model is required in $MODEL_FOLDER
ffmpeg is used to covert the audio files
Install dependencies with pip install -r requirements.txt

1. Run python src/generate_dataset.py to download the audio files from youtube to $MP3_FOLDER, convert them to mono wav files at 16kHz into $WAV_FOLDER, prepare the wav folder to be used by CoverHunterMPS to extract features and copy the hparams.yaml file to the wav folder
1. Run python src/get_embeddings_with_coverhunter.py to get embeddings from the wav files after generating the features with CoverHunterMPS
1. Run python src/convert_embeddings_to_csv.py to convert the embeddings to a csv file
1. Run python src/get_neighbors_to_estimate_precision.py to estimate the precision of the model to get closest neighbors with same chords
1. Run python src/convert_csv_to_json.py to convert the csv files to a json file compliant with Milvus vector database

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt