I have a diploid plant genome with expected size of 2GB. I have 50 cells pacbio sequenced and one run of illumina 25000.
I had an initial go with spades with and without pacbio reads. Number of contigs resulted were quite high.
Tried using illumina contigs, illumina reads, a closer reference and pacbio reads via redundans. First it was failing on fastq2sspace , no space left on device. We fixed it by writing not into temp but into current directory. Now it says:
Welcome to SNAP version 1.0beta.23.
Loading index from directory... Unable to open file 'IBL_redundans_ref_long_short_reads_scaffolds_noreductions__2/contigs.fa.snap/GenomeIndex' for read.
Index load failed, aborting.
Aligning.
seems index was not loaded from directory by SNAP, how big is this problem? How much memory should I allow and number of cpus? I am allowing at present 250GB ram and 64 CPUS.
Any help will be appreciated.
General log says:
##################################################
[Wed Sep 27 22:50:10 2017] Estimating parameters of libraries...
Aligning 279357551 mates per library...
Insert size statistics Mates orientation stats
FastQ files read length median mean stdev FF FR RF RR
/scratch/dragon/intel/cbrc-PrlMill_Babil/Data/Interm/Assemblies/IlluminHiseq2500/29A/S_17_690_Pm-IBL_AD12_L008.A5SpadesAssembly/S_17_690_Pm-IBL_AD12_L008.assembly.s2/S_17_690_Pm-IBL_AD12_L008.assembly/corrected/S_17_690_Pm-IBL_AD12_L008.assembly.ec_1.00.0_0.cor.fastq.gz /scratch/dragon/intel/cbrc-PrlMill_Babil/Data/Interm/Assemblies/IlluminHiseq2500/29A/S_17_690_Pm-IBL_AD12_L008.A5SpadesAssembly/S_17_690_Pm-IBL_AD12_L008.assembly.s2/S_17_690_Pm-IBL_AD12_L008.assembly/corrected/S_17_690_Pm-IBL_AD12_L008.assembly.ec_2.00.0_0.cor.fastq.gz 125 417 427.17 85.44
4 9720 273 3
##################################################
[Wed Sep 27 22:51:41 2017] Scaffolding...
iteration 1.1: IBL_redundans_ref_long_short_reads_scaffolds_noreductions__2/contigs.fa 2426611 1396787756 48.607 182615 795059649 1911 193 0 97707
Many Thanks for help and clues.
Best,
IA
--
--
Intikhab Alam, PhD
Research Scientist
Computational Bioscience Research Centre (CBRC), Building #3, Office #4328
4700 King Abdullah University of Science and Technology (KAUST)
Thuwal 23955-6900, KSA
W: http://www.kaust.edu.sa
T +966 (0) 2 808-2423 F +966 (2) 802 0127
This message and its contents including attachments are intended solely for the original recipient. If you are not the intended recipient or have received this message in error, please notify me immediately and delete this message from your computer system. Any unauthorized use or distribution is prohibited. Please consider the environment before printing this email.
--
You received this message because you are subscribed to the Google Groups "Redundans" group.
To unsubscribe from this group and stop receiving emails from it, send an email to redundans+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/redundans/CAAwYpuYdQyjxu9ke9OOwUsi2WD-ARcyKL%2BcVEYP7eTfABJ_6wg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.