Hi List.
I have started to test FITS in the SCAPE project. Especially on web
archive content.
And I have run in to some problems with some HTML-files that make FITS
hang forever with a quite high CPU-load. It just hangs until I kill
it.
I have to run FITS on millions of files in batches of the entire
content of one ARC-file at a time (typically around 4000 objects) and
I have had to set up a cron job that automatically kills any FITS-
process that has not generated new output for 1 hour.
So far I have analyzed around 15 million objects (quite a lot - but
far from the 7 billion objects currently in Netarchive.dk) and my auto-
killing script killed around 15 processes in total so far.
The problem is 100 % reproducable and I have an example object that
makes FITS fail. Can I report a bug somewhere and attach the sample
object ?
best
Bjarne Andersen
State and University Library, Aarhus
Denmark