I assume you're using the parallel file reader, h2o.importFile,
and not h2o.uploadFile? If not, make that switch and that will
speed things up a lot. H2O can read a zipped CSV file but I am
not sure if that's going to speed anything up, it might just be
better for storage.
ARFF may be slightly faster than CSV because all the column types
are pre-defined by the user.
-Erin
--
You received this message because you are subscribed to the Google Groups "H2O Open Source Scalable Machine Learning - h2ostream" group.
To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
-- Erin LeDell Ph.D. Statistician & Machine Learning Scientist | H2O.ai
Aaron,
Good call, we will add this to the docs. cc-ing Angela Bartz, Head of Documentation.
-Erin
To unsubscribe from this group and stop receiving emails from it, send an email to h2ostream+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.