Feb 6 2013 label

16 views
Skip to first unread message

Charlie Clark

unread,
Feb 7, 2013, 3:55:45 AM2/7/13
to httpa...@googlegroups.com
Hi,

is there a problem with Feb 6th crawl results? The data dump is call
httparchive_Feb_6_2013_prev and only 992 rows long.

Charlie
--
Charlie Clark
Managing Director
Clark Consulting & Research
German Office
Kronenstr. 27a
Düsseldorf
D- 40217
Tel: +49-211-600-3657
Mobile: +49-178-782-6226

Patrick Meenan

unread,
Feb 7, 2013, 9:09:45 AM2/7/13
to httpa...@googlegroups.com
Ignore the crawls after 2/1 and before 2/15. We are doing some
experiments to track down some issues and it looks like at least one
of them accidentally made it into the public data. We'll get it
cleaned up.


-----------------
Sent from my slab of glass with no keyboard so it will be a miracle if
you receive what I meant to type.

On Feb 7, 2013, at 12:55 AM, Charlie Clark
> --
> You received this message because you are subscribed to the Google Groups "HTTP Archive" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to httparchive...@googlegroups.com.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>

Charlie Clark

unread,
Feb 21, 2013, 10:02:34 AM2/21/13
to httpa...@googlegroups.com
Am 07.02.2013, 15:09 Uhr, schrieb Patrick Meenan <patm...@gmail.com>:

> Ignore the crawls after 2/1 and before 2/15.

Just for absolute clarity: 2013-02-01 to 2013-02-15, right?

> We are doing some
> experiments to track down some issues and it looks like at least one
> of them accidentally made it into the public data. We'll get it
> cleaned up.

I notice that 2013-02-15 is now online and the site contains results for
2013-02-06 but not the dataset, which I assume is still under
investigation.

Pat Meenan

unread,
Feb 21, 2013, 11:35:35 AM2/21/13
to httpa...@googlegroups.com
2013-02-01 and 2013-02015 themselves are good, just any between the two
(2013-02-06 should be ignored for example).

Charlie Clark

unread,
Feb 21, 2013, 11:39:16 AM2/21/13
to httpa...@googlegroups.com
Am 21.02.2013, 17:35 Uhr, schrieb Pat Meenan <patm...@gmail.com>:

> 2013-02-01 and 2013-02015 themselves are good, just any between the two
> (2013-02-06 should be ignored for example).

Okay, it's just that 2013-02-01 is still missing from
http://www.archive.org/httparchive_downloads/

Is it just lost in transit?

Pat Meenan

unread,
Feb 21, 2013, 12:33:41 PM2/21/13
to httpa...@googlegroups.com
Yep, looks like something didn't complete the cycle properly - I'll wait
for Steve to take a look though as I'd probably do more harm than good :-)

Steve Souders

unread,
Feb 24, 2013, 4:04:53 PM2/24/13
to httpa...@googlegroups.com
I just noticed this email - sorry!

Feb 1 is missing for "IE".

Feb 15 has incorrect dumps for "requests" for "IE" and "iphone".

I'll regenerate all those now...

-Steve

Steve Souders

unread,
Feb 27, 2013, 11:49:29 PM2/27/13
to httpa...@googlegroups.com
All the dumps for Feb 1 & Feb 15 are now complete and available on the downloads page: http://httparchive.org/downloads.php

-Steve

Charlie Clark

unread,
Feb 28, 2013, 10:28:52 AM2/28/13
to httpa...@googlegroups.com
Am 28.02.2013, 05:49 Uhr, schrieb Steve Souders
<steveso...@gmail.com>:

> All the dumps for Feb 1 & Feb 15 are now complete and available on the
> downloads page: http://httparchive.org/downloads.php

Are there any changes to the files that were available on Sunday?
Reply all
Reply to author
Forward
0 new messages