Dear Gyozo,
many thanks for sharing your solution!
I just double-checked, and on our system (SLURM 18.08.4, CentOS 7.6.1810) both options - with and
without "srun -N 1 -n 1" - work just fine.
In any case, I'm currently working on implementing coarse-grained parallelization directly in
raxml-ng, so I hope these problems will become irrelevant soon :)
Best,
Alexey
On 28.05.19 15:08, Győző Kaján wrote:
> Dear Alexey,
>
> Found the solution. Are you sure this srun workaround is needed? It is working wonderfully for me
> without it (see below). And you don't have srun in the bootstrap
> <
https://github.com/amkozlov/raxml-ng/wiki/Parallelization> part either. And found the solution for
> the multiple nodes as well: the -p does the trick at least on our Slurm (v17.11.13) system. In the
> example below, the tree inferences are running on 2 nodes (40 cores altogether) parallelly.
>
> Best regards,
> Gyozo
>
>
> #! /bin/bash -l
> #SBATCH -A XXXXXXX
> #SBATCH -J PRRSV_ORF5
> #SBATCH -t 10:00:00
> #SBATCH --mail-type=ALL
> #SBATCH *-p node*
> #SBATCH -n 40
> #SBATCH -B 2:10:1
> #SBATCH --threads-per-core=1
> #SBATCH --cpus-per-task=1
> #SBATCH --mail-type=ALL
> #SBATCH --qos=short
> #SBATCH -M snowy
>
>
> module load bioinfo-tools
> module load RAxML-NG/0.7.0b
>
>
> cd /domus/h1/gyozo/Feri/PRRSV1_ORF5/
>
>
> for i in `seq 1 40`;
> do
> raxml-ng --msa alignment.phy --model GTR+G+I --force --prefix parallel_$i --threads 1 --seed $RANDOM &
> done
>
> wait
>
>
>
>
>
> On Monday, 27 May 2019 13:40:41 UTC+2, Győző Kaján wrote:
>
> Dear Alexey,
>
> Running the attached bash file on the same dataset
>
> * in January, the calculation was running on 20 cores *parallelly*.
> * now, it runs the 20 "units" of work *sequentially*, one after the other.
>
> What is your gut feeling, what system change might cause this strange behaviour?
>
> We have Slurm 17.11.13 now, no idea about the version No of January.
>
> Best regards,
> Gyozo
>
> On Tue, 12 Feb 2019 at 14:08, Alexey Kozlov <
alexei...@gmail.com
> <mailto:
alexei...@gmail.com>> wrote:
>
> Hi Gyozo,
>
> for multiple nodes, I recommend using our ParGenes pipeline:
>
>
https://github.com/BenoitMorel/ParGenes <
https://github.com/BenoitMorel/ParGenes>
>
> Or please feel free to dig into SLURM documentation and share your
> findings here :)
>
> Best,
> Alexey
>
> On 07.02.19 22:43, Kaján Győző wrote:
> > Dear Alexey,
> >
> > How could I go beyond one node? I have tried -N 2 or -n 32, but not all
> > cores were used.
> >
> > Thx,
> > Gyozo
> >
> > On Tue, 15 Jan 2019 at 02:12, Alexey Kozlov <
alexei...@gmail.com
> <mailto:
alexei...@gmail.com>
> > <mailto:
alexei...@gmail.com <mailto:
alexei...@gmail.com>>> wrote:
> >
> > Great, thank you for testing this, I will update the tutorial
> > accordingly!
> >
> > Alexey
> >
> > On 14.01.19 23:07, Kaján Győző wrote:
> > > Dear Alexey,
> > >
> > > The first was the winner one:
> > > srun -N 1 -n 1 --*exclusive *raxml-ng ...
> > >
> > > Complete runtime: 58 sec.
> > >
> > > Thank you very much!
> > > Gyozo
> > >
> > > On Mon, 14 Jan 2019 at 18:48, Alexey Kozlov
> > <
alexei...@gmail.com <mailto:
alexei...@gmail.com>
> > <mailto:
alexei...@gmail.com <mailto:
alexei...@gmail.com>>>> wrote:
> > >
> > > Hi Gyozo,
> > >
> > > very weird, apparently there are some differences in
> > configuration.
> > >
> > > Could you please try
> > >
> > > srun -N 1 -n 1 --exclusive raxml-ng ...
> > >
> > > and/ or
> > >
> > > srun -N 1 -n 1 --cpu-bind=cores raxml-ng ...
> > >
> > > According to SLURM documentation, this should help:
> > >
> > >
https://slurm.schedmd.com/srun.html#OPT_cpu-bind
> <
https://slurm.schedmd.com/srun.html#OPT_cpu-bind>
> > >
> > > Best,
> > > Alexey
> > >
> > > On 14.01.19 17:14, Kaján Győző wrote:
> > > > Dear Alexey,
> > > >
> > > > Tried it, but did no help. Strange, because it seems, it
> > starts the
> > > > trees in parallel but on a single core. See job statistics
> > > attached for
> > > > the prev. run too.
> > > > We have slurm 17.11.12.
> > > >
> > > > Thx,
> > > > Gyozo
> > > >
> > > >
> > > > On Mon, 7 Jan 2019 at 15:27, Alexey Kozlov
> > > <
alexei...@gmail.com <mailto:
alexei...@gmail.com>
> > > > <mailto:
alexei...@gmail.com <mailto:
alexei...@gmail.com>
> > <mailto:
alexei...@gmail.com <mailto:
alexei...@gmail.com>>>>> wrote:
> > > >
> > > > Dear Gyozo,
> > > >
> > > > please try to change submission script header as follows:
> > > >
> > > > #SBATCH -n 16
> > > > #SBATCH --cpus-per-task=1
> > > >
> > > > (i.e. 16 tasks x 1 core instead of 1 task x 16 cores).
> > > >
> > > > Interestingly, both configurations work fine on our
> > cluster,
> > > but this
> > > > might depend on SLURM version and/or configuration (we
> > have
> > > > slurm/
17.11.7.1 <
http://17.11.7.1> <
http://17.11.7.1> <
http://17.11.7.1>
> > <
http://17.11.7.1> here).
> > > >
> > > > Please let me know if the alternative solution above works
> > > for you - if
> > > > yes, I'll change the tutorial accordingly.
> > > >
> > > > Thanks in advance!
> > > >
> > > > Best,
> > > > Alexey
> > > >
> > > > On 29.12.18 01:41, Kaján Győző wrote:
> > > > > Dear Alexey,
> > > > >
> > > > > Hope you have nice holidays, and sorry to bother you!
> > > > > But I tried the recommended
> > > > >
> > > >
> > >
> >
> <
https://github.com/amkozlov/raxml-ng/wiki/Parallelization#coarse-grained-parallelization-for-short-alignments <
https://github.com/amkozlov/raxml-ng/wiki/Parallelization#coarse-grained-parallelization-for-short-alignments>>
> <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>>
> <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>>
> > > > >
> > > > >
> > > > >
> > > <#m_8355361735492220274_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
> > > > >
> > > > > On Sun, 9 Dec 2018 at 12:37, Alexandros Stamatakis
> > <mailto:
alexandros...@gmail.com <mailto:
alexandros...@gmail.com>>>>>>
> > > <mailto:
raxml%25252Bun...@googlegroups.com
> <mailto:
raxml%2525252Bu...@googlegroups.com>
> > <mailto:
raxml%2525252Bu...@googlegroups.com
> <mailto:
raxml%252525252B...@googlegroups.com>>>>>
> > > <mailto:
raxml%25252Bun...@googlegroups.com
> <mailto:
raxml%2525252Bu...@googlegroups.com>
> > <mailto:
raxml%2525252Bu...@googlegroups.com
> <mailto:
raxml%252525252B...@googlegroups.com>>>>>>.
> > > > > > For more options, visit
> > > >
https://groups.google.com/d/optout <
https://groups.google.com/d/optout>.
> <
https://groups.google.com/d/topic/raxml/PYCTUdWJJOA/unsubscribe>.
> > > > > To unsubscribe from this group and all its
> > topics,
> > > send
> > > > an email
> > > > > to
raxml+un...@googlegroups.com
> > > <mailto:
raxml%25252Bun...@googlegroups.com
> <mailto:
raxml%2525252Bu...@googlegroups.com>
> > <mailto:
raxml%2525252Bu...@googlegroups.com
> <mailto:
raxml%252525252B...@googlegroups.com>>>>>.
> > > > > For more options, visit
> > >
https://groups.google.com/d/optout <
https://groups.google.com/d/optout>.
> > > > >
> > > > >
> > > > >
> > > >
> > >
> >
> <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>>
> <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>>
> > > > >
> > > > >
> > > > >
> > > <#m_8355361735492220274_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
> > > > >
> > > > > --
> > > > > You received this message because you are subscribed to
> > > the Google
> > > > > Groups "raxml" group.
> > > > > To unsubscribe from this group and stop receiving
> > emails
> > > from it,
> > > > send
> > > > > an email to
raxml+un...@googlegroups.com
> <mailto:
raxml%2525252Bu...@googlegroups.com>>>>>.
> > > > > For more options, visit
> >
https://groups.google.com/d/optout <
https://groups.google.com/d/optout>.
> <
https://groups.google.com/d/topic/raxml/PYCTUdWJJOA/unsubscribe>.
> > > > To unsubscribe from this group and all its topics, send an
> > > email to
> <mailto:
raxml%2525252Bu...@googlegroups.com>>>>.
> > > > For more options, visit
> >
https://groups.google.com/d/optout <
https://groups.google.com/d/optout>.
> > > >
> > > > --
> > > > You received this message because you are subscribed to
> > the Google
> > > > Groups "raxml" group.
> > > > To unsubscribe from this group and stop receiving emails
> > from it,
> > > send
> <mailto:
raxml%25252Bun...@googlegroups.com>>>>.
> <
https://groups.google.com/d/optout>.
> <
https://groups.google.com/d/topic/raxml/PYCTUdWJJOA/unsubscribe>.
> > > To unsubscribe from this group and all its topics, send an
> > email to
> <mailto:
raxml%25252Bun...@googlegroups.com>>>.
> <
https://groups.google.com/d/optout>.
> > >
> > > --
> > > You received this message because you are subscribed to the Google
> > > Groups "raxml" group.
> > > To unsubscribe from this group and stop receiving emails from it,
> > send
> <mailto:
raxml%252Buns...@googlegroups.com>>>.
> <
https://groups.google.com/d/optout>.
> <
https://groups.google.com/d/topic/raxml/PYCTUdWJJOA/unsubscribe>.
> > To unsubscribe from this group and all its topics, send an email to
> <mailto:
raxml%252Buns...@googlegroups.com>>.
> <
https://groups.google.com/d/optout>.
> >
> > --
> > You received this message because you are subscribed to the Google
> > Groups "raxml" group.
> > To unsubscribe from this group and stop receiving emails from it, send
> > <mailto:
raxml+un...@googlegroups.com <mailto:
raxml%2Bunsu...@googlegroups.com>>.
> <
https://groups.google.com/d/optout>.
> <
https://groups.google.com/d/topic/raxml/PYCTUdWJJOA/unsubscribe>.
> To unsubscribe from this group and all its topics, send an email to
>
raxml+un...@googlegroups.com <mailto:
raxml%2Bunsu...@googlegroups.com>.
> For more options, visit
https://groups.google.com/d/optout <
https://groups.google.com/d/optout>.
>
> --
> You received this message because you are subscribed to the Google Groups "raxml" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
>
raxml+un...@googlegroups.com <mailto:
raxml+un...@googlegroups.com>.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/raxml/e5fa4040-035b-4059-9289-20bcfa59a626%40googlegroups.com
> <
https://groups.google.com/d/msgid/raxml/e5fa4040-035b-4059-9289-20bcfa59a626%40googlegroups.com?utm_medium=email&utm_source=footer>.