Hi -- I'm playing with the stft:
[data is a single channel of floats, sample rate 48k]
spec = librosa.stft(data, n_fft=4096, hop_length=512)
bf = dict(enumerate(librosa.fft_frequencies(sr=48000, n_fft=4096)))
...that generates a 2049 element array representing bin frequencies.
Are these frequency values representing the boundaries of the bins or the center frequencies? E.g. bf[23] is the left edge or center of bin index 23?
Given the length of the array, I would assume these are bin boundaries?
So, spec[0][t] is the DC at index t, spec[1][t] is the spectral value at time t for bin centered at frequency (bf[1]+bf[2])/2 ?
Thanks for any clues!