I downloaded the NSynth tfrecord file (either split works for me) from Magenta's website
, which contains the original features for each sound, e.g., note, instrument, pitch, velocity, sample_rate,
etc. I now want to employ it in the DDSP framework, probably using the class ddsp.training.data.TFRecordProvider (I don't want to use tensorflow_datasets for practical reasons).
So I need to create an adapted TFRecord file which contains DDSP features such as f0_hz, f0_confidence or loudness_db. How to do that from the original TFRecord file ?
The ddsp_prepare_tfrecord lib seems to be usable only from .wav files, and it seems a bit cumbersome to regenerate .wav files from waveforms in the original TFRecord.
I didn't find any useful insight in the tutorials or demos. Sorry if this is trivial, any help would be very useful :)