Hi Peter,
The most obvious issue would be if
www.olympics.sk/index.php was not (correctly) harvested.
Probably best to look into the WARC files and extract the response record for both URLs. Probably worth checking how their CDX lines look as well.
Assuming you're using gzipped WARCs, you can bash utility 'zcat' to look into the WARC. I find it best to pipe the output into 'less' and then uses less's search capabilities.
Best,
Kris
-------------------------------------------------------------------------
Landsbókasafn Íslands - Háskólabókasafn | Arngrímsgötu 3 - 107 Reykjavík
Sími/Tel:
+354 5255600 |
www.landsbokasafn.is
-------------------------------------------------------------------------
fyrirvari/disclaimer -
http://fyrirvari.landsbokasafn.is
> --
> You received this message because you are subscribed to the Google Groups
> "openwayback-dev" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to
openwayback-d...@googlegroups.com.
> For more options, visit
https://groups.google.com/d/optout.