Reproducibility repository

With this repository we aim to make the ISMIR 2018 paper Music detection in broadcast audio recordings: a non-binary approach with relative loudness annotations as reproducible as possible.

Getting started:

Clone this repository

git clone https://github.com/BlaiMelendezCatalan/ISMIR2018.git

Download audio (and optionally Lidy's features):

Object	File name	Description	Link
Audio	audio.zip	Audio used to train and test the algorithms of the comparative analysis	https://zenodo.org/record/1216054
Lidy's Features	features.npz	Features used for training in Lidy's algorithm	https://zenodo.org/record/1216062

Extract audio.zip, estimations.zip, annotations.zip and features.npz to the corresponding folder:

File name	Extract to
audio.zip	comparative_analysis/
estimations.zip	comparative_analysis/
annotations.zip	comparative_analysis/
features.npz	comparative_analysis/algorithms/lidy/features/

NOTE: estimations.zip and annotations.zip can be found in comparative_analysis/

Feature extraction and model training and testing

Follow the READMEs in the folder of each author (in comparative_analysis/algorithms/) to obtain their estimations for the testing dataset. The complete process including feature extraction, the training of the model and its testing can only be done for the Tsipas algorithm. For Lidy's and Marolt's algorithms only the testing part is available as the algorithms' code is not public. The result of the process for each author is already available in comparative_analysis/estimations/author/raw_estimations/.

Format estimations

Follow the README in comparative_analysis/format_estimations/ to format the estimations of each algorithm. The result of the formatting is already available in comparative_analysis/estimations/author/formatted_estimations/

Plot results

Follow the README in comparative_analysis/evaluation/ to obtain the plots in the paper.

ISMIR2018

Commits

Minor changes in READMEs

Partly commented format.py

Include annotations and estimations as zip files