--
You received this message because you are subscribed to the Google Groups "xarray" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xarray+un...@googlegroups.com.
To post to this group, send email to xar...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJAZx5BpHFEkcr23qbTnU0n7_C3oZLv0FJ%2BLzi6s8cAkEGAteg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJ8oX-ELcpW1PdswNuyywyhYnrkc%2BV%3DLEXB5L%2Br7v8%2BrC_puMw%40mail.gmail.com.
Dask.array will happily store on-disk array data in chunks
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJ8oX-ELcpW1PdswNuyywyhYnrkc%2BV%3DLEXB5L%2Br7v8%2BrC_puMw%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJAZx5Bg3ATAo6DDSujNduZUchbrhPT5aeSPUZNcbOL4HtkTBQ%40mail.gmail.com.
Xarray should indeed write data out to netcdf using chunks. This only works with the netcdf4 and h5netcdf back ends though -- scipy does not support incremental writes.
On Thu, Jun 23, 2016 at 4:39 PM Ryan Abernathey <ryan.ab...@gmail.com> wrote:
On Thu, Jun 23, 2016 at 10:20 AM, Matthew Rocklin <mroc...@gmail.com> wrote:Dask.array will happily store on-disk array data in chunksYes, I know this. What I don't know is whether xarray's .to_netcdf method will write each chunk one at a time or will instead read all the chunks into memory before writing.
On Thu, Jun 23, 2016 at 6:48 AM, Ryan Abernathey <ryan.ab...@gmail.com> wrote:
Hi xarrayers,I want to use xarray to open several large netcdf files, concatenate them into one even bigger dataset, and write to disk using .to_netcdf. The final file size will be ~350 GB, larger than my RAM.Will the entire concatenated dataset have to be read into memory (impossible), or can I make it write in chunks?Thanks,Ryan
--
You received this message because you are subscribed to the Google Groups "xarray" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xarray+unsubscribe@googlegroups.com.
To post to this group, send email to xar...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJAZx5BpHFEkcr23qbTnU0n7_C3oZLv0FJ%2BLzi6s8cAkEGAteg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "xarray" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xarray+unsubscribe@googlegroups.com.
To post to this group, send email to xar...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJ8oX-ELcpW1PdswNuyywyhYnrkc%2BV%3DLEXB5L%2Br7v8%2BrC_puMw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "xarray" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xarray+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJAZx5Bg3ATAo6DDSujNduZUchbrhPT5aeSPUZNcbOL4HtkTBQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "xarray" group.
To unsubscribe from this group and stop receiving emails from it, send an email to xarray+unsubscribe@googlegroups.com.
To post to this group, send email to xar...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAEQ_TvfEnyQOqOYws2qQBOAwuBzVD9EpDGTsHwfESoWmJp6Bsw%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAJAZx5Ct%3D0MCn9bFShWtcp16wx%2BjoJDExfkLmwKCXbVHsrCP2A%40mail.gmail.com.
A few other ideas:- Can you do computation that doesn't involve doing a write in a streaming fashion, e.g., calculating .mean()?
- It might help to set chunks in the resulting netCDF file, using "chunksizes" in encoding.
- It would be interesting to see if writing multiple netCDF files works helps, using save_mfdataset.
To view this discussion on the web visit https://groups.google.com/d/msgid/xarray/CAEQ_TvcK6oHPvu45NFGaRPBw2WgKNdxZq0-D-gQoOcYrk0Z%3DoQ%40mail.gmail.com.