Thanks. For the first two --> scaffold00002:hit:1026:1.3.0.12
The value 1026 is held in a global iterator, so it cannot repeat the same
value during the life of the process. And 1.3.0.12 is generated from the
point in the code the ID is being generated. This means that two distinct
processses had to write to the same file at the same point in the code,
which should normally be impossible.
However, there are ways to make this happen. First if you turn file locks
off (-nolock) option and then run MAKER multiple times on the same dataset
you can get process collisions (because you disabled the locks that stop
this). If your NFS file system does not support hard links (FhGFS for
example) then you cannot lock the files (which is the same as setting
-nolock). Or you have other serious IO failures over NFS. Note that NFS
is your Network Mounted Storage.
The last example you give shows the preceding line being truncated. This
suggests that two processes are trying to write to the same file
simultaneously (inserting lines in between other lines), or serious IO
failures are occurring where writes are not completing but true is being
returned for the operations (can happen on unreliable NFS implementations).
So in summary either your NFS storage implementation is giving IO errors,
you have run MAKER with -nolock set and then started MAKER multiple times
in the same directory (process collisions), or your NFS implementation
doesn't support hardlinks and won't allow MAKER to lock files (process
collisions). If it is one of the latter two, you will have to make sure
you never start MAKER more than once simultaneously on the same dataset.
You can still run via MPI fro parallelization, but you won't be able to
start a second MPI process while the first one is still running.
Thanks,
Carson