Using midi note numbers for pitch rather than Hz #192

blueyred · 2024-05-15T08:12:38Z

blueyred
May 15, 2024

I was wondering when generating losses for the pitch estimator if there would be a benefit to using the "midi" number rather than Hz.
Depending on the note ranges being calculated the deviation from a "wanted" note in cents is non linear when using Hz.
As an example if a singer is trying to hit C3 and misses by 10% vs a singer attempting C6 and missing by 10%, the pitch loss calculated in Hz would be higher for the C6 singer. It seems sensible to do the pitch losses in the midi number (librosa.hz_to_midi)

I was also wondering if there would be a benefit to the model when estimating the f0 to do it in the more linear midi number domain as it would mean the model wouldn't have the complication of "inherently" learning the scaling between freq & notes, eg
freq =440⋅2(n−69)/12. Keeping all the note data in the midi number domain would mean it could learn linear relationships between different pitches more easily

yqzhishen · 2024-05-15T10:14:55Z

yqzhishen
May 15, 2024
Maintainer

NN-based pitch estimators do not calculate losses on Hz. Their losses are based on 2D probability graphs, where the pitch is represented by Gaussian-blurred bins, and bins are equidistant in log domain. For more details you can check out the CREPE paper: https://arxiv.org/abs/1802.06182

In DiffSinger, the pitch predictor processes pitch in MIDI domain. The acoustic model maps all f0 values to their mel frequencies (something similar to log domain) before sending them to NN. So there should be no problem with the sensitivity.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using midi note numbers for pitch rather than Hz #192

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Using midi note numbers for pitch rather than Hz #192

blueyred May 15, 2024

Replies: 1 comment

yqzhishen May 15, 2024 Maintainer

blueyred
May 15, 2024

yqzhishen
May 15, 2024
Maintainer