@M01271:62:000000000-A798C:1:1101:15420:1031 1:N:0:1
NCATCTATAGAAATAATAAGTCTAAGGTATAGGACTTCCGTTTTATAAAACTTGTATTTATCTATATCTAATTATAGGCTAGCTCTATATAGGGCTTATAGGACTAGTTTAACGTGTTTTTAGTATTTCGCTAGATTATTGCTATATATTAAGATATCGTTAATATANGTAGTATAGAAGATGTCTAGGTANGGGCGTAGGGTATTATTTATAAAGTATTAGNATAAACTTAGGGCGTTTATATCACGGAGTAAGTNTAAGCG
+
#8ACCGGGGGGGGGGGGGGGFDFGFFFFGGGGGGGGGGGGGGGGGGGGGGGGGGGGFGDFGGGGGGFGGGGGGGGGGGGGGGGDG9FCCGGGGGGFGDGFGGGGCGGGGGGGGGEFGGG?DGGGGGGGGGDGGGGGFEFG9CEFGGFFFGGGGGFGGGGBFGGFFFE#9AFFFGGGGGGGGGGGAFGFGGD#6@CD?CEGGGD8DFDGG9FGF,@A7:8EG,#6@E6EG8EGF?+?7=9;F9F+<<@D>DCDCFGA#3;94@4
It's paired end sequence is:
@M01271:62:000000000-A798C:1:1101:15420:1031 2:N:0:1
NGCTTATACTTACTCCGTGATATAAACGCCCTAAGTTTATTCTAATACTTTANAAATAATACCCTACGCCCGTACCTAGACATCTTCTANACTACTTATATTANNNNNNNNNTAATATATAGCAATAATCTAGCGAAATACTAAAAACACGTTAANNNNNNCCTATAAGCCCTATATAGAGCNAGCCTATAATTAGATATAGATNANNACNNNNTTTNTANAACGGAAGTCCTATACCTTAGACTTATTATTTCTATAGATGG
+
#8ACCGGGGGGGGGGGGGGGGFGGGFGGGGGGGGGGDGGGGGGGGGGGGGGG#:CFFGGDFGGGGGGGGGGGGGGEGGC9FGGGGGGGF#:BDGGGGGGGGGG#########::DFGGGGGGGGGGGEFGGGGGGGGEFGGGGGGGGGGGGFFGG######46@EEEGGGGGGGGFGGAFGD#6@@DG8EFACFA7FGGGGGG+#5##5*####303#33#3:@FFB>9FCFFAC=E<B<E=4:*7*/79@C<5@EFFF3EC8
The perl script also returns the following error:
Unable to build /scratch2/ludom/cortexVariants/binaries/uncleaned/31/B35.1.unclean.kmer31.q5.ctx at /home/ludom/CORTEX_release_v1.0.5.21/scripts/calling/run_calls.pl line 2097.
Can you tell me what is going wrong?
In addition to this particular problem, is there a reason the analysis starts with the sample B35.1 and not with the first sample mentioned in the index file (A14.1)?
Best regards,
Ludo.
I have ran Trimmomatic again to remove the adapters and merged the binaries sequentially as you suggested, starting with 30 jobs merging 2 binaries followed by 15 jobs merging the output from the previous jobs (so merging 4 binaries) etc. The run times were as follows: about 20 minutes for 2 binaries, 30 minutes for 4 binaries, 45 minutes for 8 binaries and 55 minutes for 16 binaries.
However, when I try to merge 32 binaries cortex_var seems to stall and the job is running now for more than 24 hours.
How many samples in total Ludo?
I have ran Trimmomatic again to remove the adapters and merged the binaries sequentially as you suggested, starting with 30 jobs merging 2 binaries followed by 15 jobs merging the output from the previous jobs (so merging 4 binaries) etc. The run times were as follows: about 20 minutes for 2 binaries, 30 minutes for 4 binaries, 45 minutes for 8 binaries and 55 minutes for 16 binaries.Wait, what does this mean? You first have a bunch of processes, each merging 2 binaries, and the average runtime for those was 20 mins?And then you wanted to merge 2 of these output files, which took 30 mins on avg?Or you merged 4 output files?
However, when I try to merge 32 binaries cortex_var seems to stall and the job is running now for more than 24 hours.Can you be more explicit about total number of samples and how you are collapsing the tree? Is this the final job or do you have a long way to go.My big merges for 1000 genomes took 3 days, but I only had to do this once