No it does not use ROMIO.
The locking may be do to how your NFS is implemented. MAKER does a lot of
small writes. Some NFS implementations do not handle that well and only
like large infrequent writes and frequent reads?
MAKER also uses a variant of the File:::NFSLock module which uses
hardlinks to force a flush of the NFS IO cache when asyncrynous IO is
enabled (described here
http://www.time-travellers.org/shane/papers/NFS_considered_harmful.html).
I know that the FhGFS implementation of NFS has broken hard link
functionality.
Also make sure you do not set TMP= in the maker_opt.ctl file to an NFS
mounted location. It must be local (/tmp for example). This is because
certain types of operations are not always NFS safe and need a local
location to work with (anything involving berkley DB or SQLite for
example). Make sure you are not setting that to an NFS mounted scratch
location. The mpi**** files, are examples of some short lived files that
should not be in NFS. They hold chunks of data from threads that are
processing the genome and are very rapidly created and deleted. They will
be cleaned up automatically when maker finished or killed by standard
signals such as when you hit ^C or use kill 15.
Thanks,
Carson
I'm glad your getting better results.
With respect to environmental variables. One common error in MPI
execution is that the environment variables will not always be the same on
the other nodes since only the root node is attached to a terminal, so
variables in launch scripts (.bashrc etc.) may not be available on all
nodes. Many clusters that are part of the XSEDE network and use SGE for
example have scripts that wrap mpiexec to guarantee export of all
environmental variables when using MPI to avoid just this type of common
error. So like anything, you start with the most common cause of errors
and then work to the less common. Kernel bugs usually rank low on the
list :-) But I'm glad it's working for you now.
Thanks,
Carson
Could you run the following command for me and share the ouptut with me?
mpiexec -n 8 perl -e 'use File::Spec; print File::Spec->tmpdir()."\n"'
Date: Monday, 20 May, 2013 4:36 PM
Cc: "maker...@yandell-lab.org<mailto:maker...@yandell-lab.org>" <maker...@yandell-lab.org<mailto:maker...@yandell-lab.org>>, "Heywood, Todd" <hey...@cshl.edu<mailto:hey...@cshl.edu>>
Subject: Re: [maker-devel] MPI MAKER hanging NFS
Hi Carson,
><hey...@cshl.edu<mailto:hey...@cshl.edu><mailto:hey...@cshl.edu<mailto:hey...@cshl.edu>>> wrote:
>
>>We have been getting hung NFS mounts on some nodes when running MPI MAKER
>>(version 2.27). Processes go into a "D" state and cannot be killed. We
>>end up having to reboot nodes to recover them. We are running MPICH2
>>version 1.4.1p1
>>with RHEL 6.3. Questions:
>>
>>(1) Does MPI MAKER use MPI-IO (ROMIO)? The state "D" processes are hung
>>on a sync_page system call under NFS. That *might* imply some locking
>>issues.
>>
>>(2) Has anyone else seen this?
>>
>>(3) The root directory (parent of genome.maker.output directory) has lots
>>of mpi***** files, all of which have the first line
>>"pst0Process::MpiChunk". Is this expected?
>>
>>I'm able to reproducibly hang NFS on some nodes when using at least 4
>>32-core nodes and 128 running MPI tasks.
>>
>>Thanks,
>>
>>Todd Heywood
>>CSHL
>>
>>
>
>
>_______________________________________________
>maker-devel mailing list
>maker...@box290.bluehost.com<mailto:maker...@box290.bluehost.com><mailto:maker...@box290.bluehost.com<mailto:maker...@box290.bluehost.com>>
Hi Carson,
We are currently working on the annotation of Helicoverpa genome project. Maker has been chosen as the preliminary tool for the task. By checking the annotation results by using maker 2.10, we saw some loci have the fusion problem: two separate neighbour genes are likely to be fused together and regarded as a single candidate output by maker. If we go further by looking at the outputs from each individual de novo algorithm, e.g. augustus or snap, the prediction was correct. We are also using RNA-Seq assembly from cufflinks and some protein evidence data from closely related insects.
We noticed that the parameters “pred_flank” in maker v2.10 and “correct_est_fusion” in maker v2.27 might be useful for maker to decide when to merge models or not. If possible, can you please explain what these two parameters can do with the predicted genes, RNA-Seq and protein evidence?
Also, our current plan is to install maker 2.27, train the algorithms to predict UTRs, enlarge the protein evidence datasets and input our previous annotations as model_gff. We are facing with an critical question: in which way we could effectively improve the gene fusing problem? 1) setting the pred_flank lower than 100? 2) turn the correct_est_fusion on? 3) anything else?
Thank you.
With best regards,
Xi (Sean) Li, Ph. D.
Bioinformatics Analyst, Bioinformatics Core,
CSIRO Mathematics, Informatics and Statistics
Phone: +61 2 6216 7138
Address: GPO Box 664, Canberra, ACT 2601
_______________________________________________
maker-devel mailing list
maker...@box290.bluehost.com
http://box290.bluehost.com/mailman/listinfo/maker-devel_yandell-lab.org
Thanks Barry and Carson for your detailed explanation. Now I have a better understand of “pred_flank”.
1. To run maker, we use transcripts produced by tophat+cufflink approach instead of de novo trinity. Will it avoid the possible merging of RNA-Seq reads?
2. If my understanding is correct, the “correct_est_fusion” parameter needs to be turned off when we don’t ask Maker/prediction algorithms to predict UTRs? Also, it makes me wonder, in such case, when Maker turn off UTRs, but our RNA-Seq data has got long UTRs, will these UTRs been added into the maker gene model?
Regards,
Sean
Thank you Carson. It has been a very helpful conversation with you! I will pass these information back to our group.
Best regards,