--
Magenta project: magenta.tensorflow.org
To post to this group, send email to magenta...@tensorflow.org
To unsubscribe from this group, send email to magenta-discu...@tensorflow.org
---
You received this message because you are subscribed to the Google Groups "Magenta Discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to magenta-discu...@tensorflow.org.
Hi Sean,Unfortunately, this is normal (or at least expected) for large datasets. Our initial data conversion pipeline does the onehot conversion before saving the record instead of doing that in the graph. Some of our later models do the correct thing and wait to expand to the onehot encoding until the data is actually being used. If you'd like to send a PR our way for updating the models that do the onehot conversion in the wrong place, we'd be happy to look at it.-Fjord
On Thu, Feb 22, 2018 at 6:06 PM Sean Farrell <se...@a52.com> wrote:
Hey all!--I am trying to make my own TF model from my own MIDI data. I can successfully create the initial dataset tfrecord (https://github.com/tensorflow/magenta/blob/master/magenta/scripts/README.md), but when I go to the next step to create SequenceExamples, the tfrecord balloons in size to well over 100GB.I figured that this is not normal. Could someone confirm that this is off, and perhaps offer some suggestions on what might be causing the issue.I also repeated these steps with the recommended data set (Lakh Midi Dataset) with the same results.Any help is appreciated!Thanks and y'all rock,Sean
Magenta project: magenta.tensorflow.org
To post to this group, send email to magenta...@tensorflow.org
To unsubscribe from this group, send email to magenta-discuss+unsubscribe@tensorflow.org
---
You received this message because you are subscribed to the Google Groups "Magenta Discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to magenta-discuss+unsubscribe@tensorflow.org.
Hey thanks for getting back to me CurtisI did mount a large external disk to hold that big ol tfrecord (almost 700 gb!), and the rest of the training went really smoothly!I am very pleased with the results so far. I am "sampling" the outputs I like to make new tracks, and its pretty awesome to have that performance feel straight from the model.Great work again,SeanOn Mon, Mar 12, 2018 at 5:52 PM, Curtis "Fjord" Hawthorne <fj...@google.com> wrote:Hi Sean,Unfortunately, this is normal (or at least expected) for large datasets. Our initial data conversion pipeline does the onehot conversion before saving the record instead of doing that in the graph. Some of our later models do the correct thing and wait to expand to the onehot encoding until the data is actually being used. If you'd like to send a PR our way for updating the models that do the onehot conversion in the wrong place, we'd be happy to look at it.-FjordOn Thu, Feb 22, 2018 at 6:06 PM Sean Farrell <se...@a52.com> wrote:Hey all!--I am trying to make my own TF model from my own MIDI data. I can successfully create the initial dataset tfrecord (https://github.com/tensorflow/magenta/blob/master/magenta/scripts/README.md), but when I go to the next step to create SequenceExamples, the tfrecord balloons in size to well over 100GB.I figured that this is not normal. Could someone confirm that this is off, and perhaps offer some suggestions on what might be causing the issue.I also repeated these steps with the recommended data set (Lakh Midi Dataset) with the same results.Any help is appreciated!Thanks and y'all rock,Sean
Magenta project: magenta.tensorflow.org
To post to this group, send email to magenta...@tensorflow.org
To unsubscribe from this group, send email to magenta-discu...@tensorflow.org
---
You received this message because you are subscribed to the Google Groups "Magenta Discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to magenta-discu...@tensorflow.org.