dear community,
i have different recordings of a sequency of tones with increasing frequency. this tone sequence is repeated and sung at different base speeds (about 60 bpm up to 240 bpm). i'm not only interested in the precise catching of the fundamental frequency, but also in the tones' stability.
to get a precise analysis of the fundamental frequency, i am analyzing using a as big tone time window as possible, since the fft bin widths are reciprocal to the time window length.
concerning the stability of those sung tones i see two problems: since the tones are of different lengths, the time windiws and fft bins are of different length/width and as a consequence the bandwdth of e.g. the fundamental frequency peak bin is not comparable.
the other problem is the possible presence of a vibrato, which is a frequency modulation (a periodic sinusoidal jitter in frequency) which leads to a broader peak in frequency domain, but which influence i want to remove. saying that i am only interested in unperiodic changes in frequency per sung tone. my only idea to remove the influence of the vibrato is to analyse base frequency with a realy small window, looking at the sequence of extracted fundamental frequencies and temoving that using a lowpass (those vibrato frequency can be e.g. 8Hz) but by analysing with such small windows i loose the precision of the general fundamental frequency.
i hope you got my basic idea and problem and i would be happy to hear some ideas to solve my dilemma.
EDIT: I now added a sample file in which many tones contain a vibrato