Blah. I had the wrong ‘From’ address on this, so Google Groups bounced it.
-Joe
> Begin forwarded message:
>
> From: Joe Hourcle <
one...@dcr.net>
> Subject: Re: {SunPy} Re: VSO Client skip already downloaded data
> Date: January 9, 2017 at 9:07:25 AM EST
> To:
su...@googlegroups.com
>
>
>> On Jan 8, 2017, at 6:08 PM, Joe Llama <
joe....@lowell.edu> wrote:
>>
>> Just wondering if anyone could help me out with this, I never managed to find a solution.
>
> I’m not sure how SunPy handles downloading from the VSO, but on the VSO side, it’s really, really messy.
>
> The archive at Stanford stores the data without scientific headers, so a process has to be run to add them. That process also sets the timestamp for the image, which affects what the file gets named.
>
> So you can’t do the normal HTTP process of passing back the Last-Modified header, so you can ask for only updates since that time — not only because you don’t know what the filename is for what you’re about to be requesting, but because process that we pass off to isn’t smart enough to check that. (it wasn’t written to be a CGI).
>
> The only way to get around it is what the IDL client is doing — send a HEAD first, which will get you the file size and filename. (it will *not* have an accurate Last-Modified time, however) … and then compare those.
>
> Of course, if only the header values are modified, or you’re asking for uncompressed data and the data has changed … the file size stays the same.
>
> (and the web server actually has to run the full processing to determine file size, so it can beat down the server if you’re asking for tarballs).
>
> Hopefully in the next month or two, this won’t be a problem for *most* of the AIA data. The SDAC has gotten some additional storage, and I’ve begun generating static files for AIA. (I just haven’t yet set up something to ‘watch’ for changes, and copying from Stanford is not that fast, so it’s going to take me a while to get back to the beginning of the mission).
>
> -Joe