GitXplorerGitXplorer
x

crepe_notes

public
24 stars
3 forks
8 issues

Commits

List of commits on branch main.
Unverified
17bb18de073e3d02d1d63e19431d3b10259cd9bc

Version bump and exclude tests from wheel

xxavriley committed a year ago
Unverified
693782d1525dc8b8883c2e7975c71ab4ba609b87

Try excluding tests from dist

xxavriley committed a year ago
Unverified
923a82ef8fa05b93a0e562ee054a171f3198f0d3

Bugfix release

xxavriley committed a year ago
Unverified
3dd3c4c0e31a825d58663c53d5dd2e1deceaebed

Fix bug with file paths

xxavriley committed a year ago
Unverified
b1fe2120799a490c3d9dd5d93eb36970af9fb7b7

Bump version

xxavriley committed 2 years ago
Unverified
b7975227ea91e6dba808bd665f1c87d781f129b5

Fix readme, warn about Python 3.10 +

xxavriley committed 2 years ago

README

The README file for this repository.

CREPE notes

Post-processing for CREPE to turn f0 pitch estimates into discrete notes (MIDI)

https://github.com/xavriley/crepe_notes/assets/369527/4cc895f5-9bfe-47af-809d-0152933dc4c9

Demo video for pypi users also available here

Features

  • outputs midi notes for monophonic audio
  • include MIDI velocity information
  • includes options to filter notes that are too quiet or too short

Installation

pip install tensorflow # if you haven't already
pip install crepe-notes

Warning Python 3.10 and above will fail to run, due to this long running issue with madmom.

As a workaround, please run !pip install -e git+https://github.com/CPJKU/madmom#egg=madmom after installation

Basic Usage

crepe_notes [path_to_original_audio]

A '.mid' file will be created in the location of the audio file with the name [audio_file_stem].transcription.mid.

For additional options check out crepe_notes --help.

Min duration, min velocity and sensitivity

These are the three params you may need to tweak to get optimal results.

  • --min-duration is specified in seconds (e.g. 0.03 is 30ms). For fast, virtuosic music this is a reasonable default but for things like vocals and double bass lines a longer min duration (50ms or higher) may reduce the number of errors in your transcription.

  • --min-velocity is expressed as in MIDI e.g. 0 - 127. The default is 6 which removes any notes with velocities at or below that value, but you may find recordings with a higher noise floor benefit from a higher threshold.

  • --sensitivity relates to the peak picking threshold used on the combined signal (see paper for details) and defaults to 0.001. If the source material has an unstable pitch profile which results in a lot of short notes either side of a longer target note, increasing the sensitivity to 0.002 may help.

Caching data files

If you are running crepe_notes over an entire dataset, we recommend using the --save-analysis-files flag. This will write the following results:

  • crepe to [audio_file_stem].f0.csv.
  • madmom onset activations to [audio_file_stem].onsets.npz
  • amplitude envelope calculations to [audio_file_stem].amp_envelope.npz

This will speed up run times at the expense of some disk space.

About

This repo is the code to accompany the following paper:

X. Riley and S. Dixon, “CREPE Notes: A new method for segmenting pitch contours into discrete notes,” in Proceedings of the 20th Sound and Music Computing Conference, Stockholm, Sweden, 2023, pp. 1–5.

In the paper we propose a method of combining two things:

a) the gradient of the pitch contour from CREPE b) the (inverse) confidence of the pitch estimate from CREPE

This gives us a new signal which is a reliable indicator of note onsets, which we can then use to segment the pitch contour into discrete notes. For more details please see the paper or the demo video above.

Results

How good is it? For the datasets we've tested so far it looks promising.

  • FiloSax (24 hrs solo saxophone audio) - 90% F-measure (no offsets)
  • ITM GT Flute 99 - (20mins Irish trad flute) - 74% F-measure (no offsets) +7% over Basic Pitch
  • FiloBass (4 hrs double bass source separated stems) - 72% F-measure (no offsets) +10% over Basic Pitch

Please open a Github issue if you get results for any other public datasets - we'll try to include them in this repo.

Caveats

CREPE only works for monophonic audio, which means CREPE Notes only works for monophonic audio too. If you need polyphonic transcription, check out Basic Pitch.

Due to the way the algorithm works, repeated notes at the same pitch are treated as a special case and have to fall back to using a standard onset detector (madmom). The results might vary depending on the type of music you want to transcribe. For example, in a jazz saxophone solo it's relatively uncommon to repeat the same note. In a rock bass line however the opposite is true.

The onset detection library we use (madmom) has a licence which restricts commercial use. This restrictions is conferred onto CREPE Notes as a result - if you have a commercial use case please contact the madmom authors to discuss this.

Roadmap

  • [ ] (Distant goal) Add UI to aid with picking thresholds for velocity and note length
  • [x] Experiment with edge preserving smoothing on the confidence thresholds to reduce spurious grace notes/glissandi

Credits

This package was created with Cookiecutter_ and the audreyr/cookiecutter-pypackage_ project template.