Variant calling error

117 views
Skip to first unread message

Vanessa Guerra

unread,
Dec 7, 2017, 5:40:59 PM12/7/17
to trinityrnaseq-users
Dear all, 

I'm having problems with the variant calling pipeline. I'm copying below the error that I'm getting. I used the same script that I used to run the test files that worked well (instead of copying the output of the test file to the post, I am attaching it as a document to this post to reduce any confusion).

To minimize problems due to the variability of sequences between my two biological groups, I'm using the data of just one individual (A7773) at a time (assembly also built from just one individual (~90,00 transcripts)). 

I used paired and unpaired sequences to make my Trinity transcript. But for the variant calling step, I'm only using the files with paired sequences (no unpaired sequence). Could this be creating the problem? 

Best, 
Vanessa

===
command
===

WORKING_DIR=/home/vguerra/scratch/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5

if [[ ! -d $WORKING_DIR ]]; then mkdir -p $WORKING_DIR; fi

cd $WORKING_DIR


OUT_DIR="/home/vguerra/scratch/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5"

if [[ ! -d $OUT_DIR ]]; then mkdir -p $OUT_DIR; fi


/cvmfs/soft.computecanada.ca/easybuild/software/2017/avx2/Compiler/intel2016.4/trinity/2.5.0/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py \

    --threads 32 \

  --st_fa trinity_genes.fasta\

               --st_gtf trinity_genes.gtf \

               -p A37773_1_combined_first-line-edited_paired-clean.fq.gz A37773_2_combined_first-line-edited_paired-clean.fq.gz \

      --maxram 69873294549 \

               -o test_varcalling 

===
slurm output
===

/var/spool/slurmd/job3170009/slurm_script: line 13: /home/vguerra/ENV_MiH5/bin/activate: No such file or directory

To execute picard run: java -jar $EBROOTPICARD/picard.jar

    To execute GATK run: java -jar $EBROOTGATK/GenomeAnalysisTK.jar

    Generating SuperTranscript index.

CMD: samtools faidx /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.fasta already processed. Skipping.

Generating Picard dictionary.

Generating genome folder for STAR

CMD: mkdir star_genome_idx already processed. Skipping.

Running: STAR --runThreadN 32 --runMode genomeGenerate --genomeDir star_genome_idx --genomeFastaFiles /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.fasta --sjdbGTFfile /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.gtf --sjdbOverhang 150 --limitGenomeGenerateRAM 69873294549

Running: touch /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/test_varcalling/trinity_genes.fasta.gatk_chkpts/star_genome_generate.ok

Running STAR alignment.

Running: STAR --runThreadN 32 --genomeDir star_genome_idx --runMode alignReads --twopassMode Basic --alignSJDBoverhangMin 10 --outSAMtype BAM SortedByCoordinate --limitBAMsortRAM 69873294549 --readFilesIn /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/A37773_1_combined_first-line-edited_paired-clean.fq.gz /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/A37773_2_combined_first-line-edited_paired-clean.fq.gz --readFilesCommand gunzip -c

Running: touch /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/test_varcalling/trinity_genes.fasta.gatk_chkpts/star_aln.ok

Cleaning and Converting sam file with Picard-Tools.

Running: java -jar /cvmfs/soft.computecanada.ca/easybuild/software/2017/Core/picard/2.10.7/picard.jar AddOrReplaceReadGroups I=Aligned.sortedByCoord.out.bam O=rg_added_sorted.bam SO=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample

[Thu Dec 07 13:58:12 PST 2017] picard.sam.AddOrReplaceReadGroups INPUT=Aligned.sortedByCoord.out.bam OUTPUT=rg_added_sorted.bam SORT_ORDER=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample    VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false GA4GH_CLIENT_SECRETS=client_secrets.json USE_JDK_DEFLATER=false USE_JDK_INFLATER=false

[Thu Dec 07 13:58:12 PST 2017] Executing as vgu...@cdr465.int.cedar.computecanada.ca on Linux 3.10.0-693.5.2.el7.x86_64 amd64; Java HotSpot(TM) 64-Bit Server VM 1.8.0_121-b13; Deflater: Intel; Inflater: Intel; Picard version: 2.10.7-SNAPSHOT

INFO 2017-12-07 13:58:13 AddOrReplaceReadGroups Created read group ID=id PL=platform LB=library SM=sample


INFO 2017-12-07 13:58:22 AddOrReplaceReadGroups Processed     1,000,000 records.  Elapsed time: 00:00:09s.  Time for last 1,000,000:    9s.  Last read position: TRINITY_DN24378_c4_g2:413

INFO 2017-12-07 13:58:29 AddOrReplaceReadGroups Processed     2,000,000 records.  Elapsed time: 00:00:15s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27638_c1_g1:15

INFO 2017-12-07 13:58:35 AddOrReplaceReadGroups Processed     3,000,000 records.  Elapsed time: 00:00:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25455_c0_g1:1,580

INFO 2017-12-07 13:58:42 AddOrReplaceReadGroups Processed     4,000,000 records.  Elapsed time: 00:00:28s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28030_c1_g3:159

INFO 2017-12-07 13:58:48 AddOrReplaceReadGroups Processed     5,000,000 records.  Elapsed time: 00:00:34s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23475_c6_g1:687

INFO 2017-12-07 13:58:54 AddOrReplaceReadGroups Processed     6,000,000 records.  Elapsed time: 00:00:41s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23453_c1_g2:1,705

INFO 2017-12-07 13:59:01 AddOrReplaceReadGroups Processed     7,000,000 records.  Elapsed time: 00:00:47s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24233_c5_g3:2,593

INFO 2017-12-07 13:59:07 AddOrReplaceReadGroups Processed     8,000,000 records.  Elapsed time: 00:00:54s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27537_c2_g1:8,157

INFO 2017-12-07 13:59:13 AddOrReplaceReadGroups Processed     9,000,000 records.  Elapsed time: 00:01:00s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25587_c4_g2:2,516

INFO 2017-12-07 13:59:20 AddOrReplaceReadGroups Processed    10,000,000 records.  Elapsed time: 00:01:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23815_c3_g1:1,250

INFO 2017-12-07 13:59:26 AddOrReplaceReadGroups Processed    11,000,000 records.  Elapsed time: 00:01:12s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24444_c0_g1:874

INFO 2017-12-07 13:59:32 AddOrReplaceReadGroups Processed    12,000,000 records.  Elapsed time: 00:01:18s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25596_c2_g1:2,397

INFO 2017-12-07 13:59:38 AddOrReplaceReadGroups Processed    13,000,000 records.  Elapsed time: 00:01:25s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28518_c0_g1:7,981

INFO 2017-12-07 13:59:45 AddOrReplaceReadGroups Processed    14,000,000 records.  Elapsed time: 00:01:31s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25242_c7_g5:214

INFO 2017-12-07 13:59:51 AddOrReplaceReadGroups Processed    15,000,000 records.  Elapsed time: 00:01:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN29035_c6_g1:530

INFO 2017-12-07 13:59:58 AddOrReplaceReadGroups Processed    16,000,000 records.  Elapsed time: 00:01:44s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28834_c4_g1:352

INFO 2017-12-07 14:00:04 AddOrReplaceReadGroups Processed    17,000,000 records.  Elapsed time: 00:01:51s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25894_c2_g2:673

INFO 2017-12-07 14:00:11 AddOrReplaceReadGroups Processed    18,000,000 records.  Elapsed time: 00:01:57s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28789_c4_g2:324

INFO 2017-12-07 14:00:17 AddOrReplaceReadGroups Processed    19,000,000 records.  Elapsed time: 00:02:03s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28509_c3_g1:6,552

INFO 2017-12-07 14:00:23 AddOrReplaceReadGroups Processed    20,000,000 records.  Elapsed time: 00:02:10s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24477_c11_g1:1,624

INFO 2017-12-07 14:00:30 AddOrReplaceReadGroups Processed    21,000,000 records.  Elapsed time: 00:02:16s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27264_c4_g1:475

INFO 2017-12-07 14:00:36 AddOrReplaceReadGroups Processed    22,000,000 records.  Elapsed time: 00:02:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27220_c2_g2:1,000

INFO 2017-12-07 14:00:42 AddOrReplaceReadGroups Processed    23,000,000 records.  Elapsed time: 00:02:29s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22929_c0_g1:1,602

INFO 2017-12-07 14:00:49 AddOrReplaceReadGroups Processed    24,000,000 records.  Elapsed time: 00:02:35s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN26734_c3_g2:2,810

INFO 2017-12-07 14:00:55 AddOrReplaceReadGroups Processed    25,000,000 records.  Elapsed time: 00:02:41s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22909_c0_g1:825

[Thu Dec 07 14:00:58 PST 2017] picard.sam.AddOrReplaceReadGroups done. Elapsed time: 2.77 minutes.

Runtime.totalMemory()=9236381696

To get help, see http://broadinstitute.github.io/picard/index.html#GettingHelp

Exception in thread "main" htsjdk.samtools.SAMFormatException: SAM validation error: ERROR: Read name HS28_47:1:1114:2015:7790, Read CIGAR M operator maps off end of reference

at htsjdk.samtools.SAMUtils.processValidationErrors(SAMUtils.java:451)

at htsjdk.samtools.BAMRecord.getCigar(BAMRecord.java:253)

at htsjdk.samtools.SAMRecord.getAlignmentEnd(SAMRecord.java:603)

at htsjdk.samtools.SAMRecord.computeIndexingBin(SAMRecord.java:1547)

at htsjdk.samtools.SAMRecord.isValid(SAMRecord.java:2054)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.advance(BAMFileReader.java:811)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:797)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:765)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:576)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:548)

at picard.sam.AddOrReplaceReadGroups.doWork(AddOrReplaceReadGroups.java:140)

at picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:228)

at picard.cmdline.PicardCommandLine.instanceMain(PicardCommandLine.java:94)

at picard.cmdline.PicardCommandLine.main(PicardCommandLine.java:104)


Traceback (most recent call last):

  File "/cvmfs/soft.computecanada.ca/easybuild/software/2017/avx2/Compiler/intel2016.4/trinity/2.5.0/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py", line 212, in <module>

    main()

  File "/cvmfs/soft.computecanada.ca/easybuild/software/2017/avx2/Compiler/intel2016.4/trinity/2.5.0/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py", line 178, in main

    pipeliner.run()

  File "/cvmfs/soft.computecanada.ca/easybuild/software/2017/avx2/Compiler/intel2016.4/trinity/2.5.0/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/../../../PyLib/Pipeliner.py", line 58, in run

    run_cmd(cmd.get_cmd())

  File "/cvmfs/soft.computecanada.ca/easybuild/software/2017/avx2/Compiler/intel2016.4/trinity/2.5.0/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/../../../PyLib/Pipeliner.py", line 22, in run_cmd

    raise RuntimeError("Error while running command \"" + str(cmd) + "\":\n" + error)

RuntimeError: Error while running command "['java', '-jar', '/cvmfs/soft.computecanada.ca/easybuild/software/2017/Core/picard/2.10.7/picard.jar', 'AddOrReplaceReadGroups', 'I=Aligned.sortedByCoord.out.bam', 'O=rg_added_sorted.bam', 'SO=coordinate', 'RGID=id', 'RGLB=library', 'RGPL=platform', 'RGPU=machine', 'RGSM=sample']":

[Thu Dec 07 13:58:12 PST 2017] picard.sam.AddOrReplaceReadGroups INPUT=Aligned.sortedByCoord.out.bam OUTPUT=rg_added_sorted.bam SORT_ORDER=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample    VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false GA4GH_CLIENT_SECRETS=client_secrets.json USE_JDK_DEFLATER=false USE_JDK_INFLATER=false

[Thu Dec 07 13:58:12 PST 2017] Executing as vgu...@cdr465.int.cedar.computecanada.ca on Linux 3.10.0-693.5.2.el7.x86_64 amd64; Java HotSpot(TM) 64-Bit Server VM 1.8.0_121-b13; Deflater: Intel; Inflater: Intel; Picard version: 2.10.7-SNAPSHOT

INFO 2017-12-07 13:58:13 AddOrReplaceReadGroups Created read group ID=id PL=platform LB=library SM=sample


INFO 2017-12-07 13:58:22 AddOrReplaceReadGroups Processed     1,000,000 records.  Elapsed time: 00:00:09s.  Time for last 1,000,000:    9s.  Last read position: TRINITY_DN24378_c4_g2:413

INFO 2017-12-07 13:58:29 AddOrReplaceReadGroups Processed     2,000,000 records.  Elapsed time: 00:00:15s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27638_c1_g1:15

INFO 2017-12-07 13:58:35 AddOrReplaceReadGroups Processed     3,000,000 records.  Elapsed time: 00:00:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25455_c0_g1:1,580

INFO 2017-12-07 13:58:42 AddOrReplaceReadGroups Processed     4,000,000 records.  Elapsed time: 00:00:28s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28030_c1_g3:159

INFO 2017-12-07 13:58:48 AddOrReplaceReadGroups Processed     5,000,000 records.  Elapsed time: 00:00:34s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23475_c6_g1:687

INFO 2017-12-07 13:58:54 AddOrReplaceReadGroups Processed     6,000,000 records.  Elapsed time: 00:00:41s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23453_c1_g2:1,705

INFO 2017-12-07 13:59:01 AddOrReplaceReadGroups Processed     7,000,000 records.  Elapsed time: 00:00:47s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24233_c5_g3:2,593

INFO 2017-12-07 13:59:07 AddOrReplaceReadGroups Processed     8,000,000 records.  Elapsed time: 00:00:54s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27537_c2_g1:8,157

INFO 2017-12-07 13:59:13 AddOrReplaceReadGroups Processed     9,000,000 records.  Elapsed time: 00:01:00s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25587_c4_g2:2,516

INFO 2017-12-07 13:59:20 AddOrReplaceReadGroups Processed    10,000,000 records.  Elapsed time: 00:01:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23815_c3_g1:1,250

INFO 2017-12-07 13:59:26 AddOrReplaceReadGroups Processed    11,000,000 records.  Elapsed time: 00:01:12s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24444_c0_g1:874

INFO 2017-12-07 13:59:32 AddOrReplaceReadGroups Processed    12,000,000 records.  Elapsed time: 00:01:18s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25596_c2_g1:2,397

INFO 2017-12-07 13:59:38 AddOrReplaceReadGroups Processed    13,000,000 records.  Elapsed time: 00:01:25s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28518_c0_g1:7,981

INFO 2017-12-07 13:59:45 AddOrReplaceReadGroups Processed    14,000,000 records.  Elapsed time: 00:01:31s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25242_c7_g5:214

INFO 2017-12-07 13:59:51 AddOrReplaceReadGroups Processed    15,000,000 records.  Elapsed time: 00:01:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN29035_c6_g1:530

INFO 2017-12-07 13:59:58 AddOrReplaceReadGroups Processed    16,000,000 records.  Elapsed time: 00:01:44s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28834_c4_g1:352

INFO 2017-12-07 14:00:04 AddOrReplaceReadGroups Processed    17,000,000 records.  Elapsed time: 00:01:51s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25894_c2_g2:673

INFO 2017-12-07 14:00:11 AddOrReplaceReadGroups Processed    18,000,000 records.  Elapsed time: 00:01:57s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28789_c4_g2:324

INFO 2017-12-07 14:00:17 AddOrReplaceReadGroups Processed    19,000,000 records.  Elapsed time: 00:02:03s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28509_c3_g1:6,552

INFO 2017-12-07 14:00:23 AddOrReplaceReadGroups Processed    20,000,000 records.  Elapsed time: 00:02:10s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24477_c11_g1:1,624

INFO 2017-12-07 14:00:30 AddOrReplaceReadGroups Processed    21,000,000 records.  Elapsed time: 00:02:16s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27264_c4_g1:475

INFO 2017-12-07 14:00:36 AddOrReplaceReadGroups Processed    22,000,000 records.  Elapsed time: 00:02:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27220_c2_g2:1,000

INFO 2017-12-07 14:00:42 AddOrReplaceReadGroups Processed    23,000,000 records.  Elapsed time: 00:02:29s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22929_c0_g1:1,602

INFO 2017-12-07 14:00:49 AddOrReplaceReadGroups Processed    24,000,000 records.  Elapsed time: 00:02:35s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN26734_c3_g2:2,810

INFO 2017-12-07 14:00:55 AddOrReplaceReadGroups Processed    25,000,000 records.  Elapsed time: 00:02:41s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22909_c0_g1:825

[Thu Dec 07 14:00:58 PST 2017] picard.sam.AddOrReplaceReadGroups done. Elapsed time: 2.77 minutes.

Runtime.totalMemory()=9236381696

To get help, see http://broadinstitute.github.io/picard/index.html#GettingHelp

Exception in thread "main" htsjdk.samtools.SAMFormatException: SAM validation error: ERROR: Read name HS28_47:1:1114:2015:7790, Read CIGAR M operator maps off end of reference

at htsjdk.samtools.SAMUtils.processValidationErrors(SAMUtils.java:451)

at htsjdk.samtools.BAMRecord.getCigar(BAMRecord.java:253)

at htsjdk.samtools.SAMRecord.getAlignmentEnd(SAMRecord.java:603)

at htsjdk.samtools.SAMRecord.computeIndexingBin(SAMRecord.java:1547)

at htsjdk.samtools.SAMRecord.isValid(SAMRecord.java:2054)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.advance(BAMFileReader.java:811)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:797)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:765)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:576)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:548)

at picard.sam.AddOrReplaceReadGroups.doWork(AddOrReplaceReadGroups.java:140)

at picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:228)

at picard.cmdline.PicardCommandLine.instanceMain(PicardCommandLine.java:94)

at picard.cmdline.PicardCommandLine.main(PicardCommandLine.java:104)

Slurm_output_test.rtf

Brian Haas

unread,
Dec 7, 2017, 8:42:55 PM12/7/17
to Vanessa Guerra, trinityrnaseq-users
Hi Vanessa,

Here's an updated script that sets the Picard validation stringency to lenient.  Let's see if this helps.

best,

~brian

--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.



--
--
Brian J. Haas
The Broad Institute
http://broadinstitute.org/~bhaas

 
run_variant_calling.py

Vanessa Guerra

unread,
Dec 8, 2017, 12:55:32 PM12/8/17
to trinityrnaseq-users
Hi Brian, 

No luck yet with the new script, I got a similar error (copied below). 

I have double check that the fasta files were made with the the same individual as noted below (the main difference being that I also used single reads for the assembly). 

One more note is that I have 2 biological groups. One group has less genetic variation (# of SNP) than the second group. The original script is working with the group that has low genetic variation. Neither the original script nor the edited script is working for the samples that I expect will have more genetic variation. 

Any thoughts on this problem would be appreciated. 

Best, 
Vanessa


===
command: 
===

python /home/vguerra/MyModules/Trinity/2.5.0vguerra/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py \

    --threads 32 \

  --st_fa trinity_genes.fasta\

               --st_gtf trinity_genes.gtf \

               -p A37773_1_combined_first-line-edited_paired-clean.fq.gz A37773_2_combined_first-line-edited_paired-clean.fq.gz \

      --maxram 69873294549 \

               -o test_varcalling 

===
output
===

/var/spool/slurmd/job3198693/slurm_script: line 13: /home/vguerra/ENV/bin/activate: No such file or directory

To execute picard run: java -jar $EBROOTPICARD/picard.jar

    To execute GATK run: java -jar $EBROOTGATK/GenomeAnalysisTK.jar

    Generating SuperTranscript index.

CMD: samtools faidx /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.fasta already processed. Skipping.

Generating Picard dictionary.

Generating genome folder for STAR

CMD: mkdir star_genome_idx already processed. Skipping.

CMD: STAR --runThreadN 32 --runMode genomeGenerate --genomeDir star_genome_idx  --genomeFastaFiles /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.fasta  --sjdbGTFfile /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/trinity_genes.gtf  --sjdbOverhang 150  --limitGenomeGenerateRAM 69873294549 already processed. Skipping.

Running STAR alignment.

CMD: STAR --runThreadN 32 --genomeDir star_genome_idx  --runMode alignReads  --twopassMode Basic  --alignSJDBoverhangMin 10  --outSAMtype BAM SortedByCoordinate  --limitBAMsortRAM 69873294549  --readFilesIn /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/A37773_1_combined_first-line-edited_paired-clean.fq.gz /scratch/vguerra/analyses_Cryptasterina/SuperTranscripts/SuperTranscripts_MiH5/Individual_MiH5/A37773_2_combined_first-line-edited_paired-clean.fq.gz --readFilesCommand 'gunzip -c'  already processed. Skipping.

Cleaning and Converting sam file with Picard-Tools.

Running: java -jar /cvmfs/soft.computecanada.ca/easybuild/software/2017/Core/picard/2.10.7/picard.jar AddOrReplaceReadGroups I=Aligned.sortedByCoord.out.bam O=rg_added_sorted.bam SO=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample

[Fri Dec 08 09:45:31 PST 2017] picard.sam.AddOrReplaceReadGroups INPUT=Aligned.sortedByCoord.out.bam OUTPUT=rg_added_sorted.bam SORT_ORDER=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample    VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false GA4GH_CLIENT_SECRETS=client_secrets.json USE_JDK_DEFLATER=false USE_JDK_INFLATER=false

[Fri Dec 08 09:45:31 PST 2017] Executing as vgu...@cdr783.int.cedar.computecanada.ca on Linux 3.10.0-693.5.2.el7.x86_64 amd64; Java HotSpot(TM) 64-Bit Server VM 1.8.0_121-b13; Deflater: Intel; Inflater: Intel; Picard version: 2.10.7-SNAPSHOT

INFO 2017-12-08 09:45:32 AddOrReplaceReadGroups Created read group ID=id PL=platform LB=library SM=sample


INFO 2017-12-08 09:45:40 AddOrReplaceReadGroups Processed     1,000,000 records.  Elapsed time: 00:00:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24378_c4_g2:413

INFO 2017-12-08 09:45:46 AddOrReplaceReadGroups Processed     2,000,000 records.  Elapsed time: 00:00:13s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27638_c1_g1:15

INFO 2017-12-08 09:45:53 AddOrReplaceReadGroups Processed     3,000,000 records.  Elapsed time: 00:00:19s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25455_c0_g1:1,580

INFO 2017-12-08 09:45:59 AddOrReplaceReadGroups Processed     4,000,000 records.  Elapsed time: 00:00:26s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28030_c1_g3:159

INFO 2017-12-08 09:46:05 AddOrReplaceReadGroups Processed     5,000,000 records.  Elapsed time: 00:00:32s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23475_c6_g1:687

INFO 2017-12-08 09:46:11 AddOrReplaceReadGroups Processed     6,000,000 records.  Elapsed time: 00:00:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23453_c1_g2:1,705

INFO 2017-12-08 09:46:18 AddOrReplaceReadGroups Processed     7,000,000 records.  Elapsed time: 00:00:44s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24233_c5_g3:2,593

INFO 2017-12-08 09:46:24 AddOrReplaceReadGroups Processed     8,000,000 records.  Elapsed time: 00:00:51s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27537_c2_g1:8,157

INFO 2017-12-08 09:46:30 AddOrReplaceReadGroups Processed     9,000,000 records.  Elapsed time: 00:00:57s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25587_c4_g2:2,516

INFO 2017-12-08 09:46:36 AddOrReplaceReadGroups Processed    10,000,000 records.  Elapsed time: 00:01:03s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23815_c3_g1:1,250

INFO 2017-12-08 09:46:42 AddOrReplaceReadGroups Processed    11,000,000 records.  Elapsed time: 00:01:09s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24444_c0_g1:874

INFO 2017-12-08 09:46:49 AddOrReplaceReadGroups Processed    12,000,000 records.  Elapsed time: 00:01:15s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25596_c2_g1:2,397

INFO 2017-12-08 09:46:55 AddOrReplaceReadGroups Processed    13,000,000 records.  Elapsed time: 00:01:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28518_c0_g1:7,981

INFO 2017-12-08 09:47:01 AddOrReplaceReadGroups Processed    14,000,000 records.  Elapsed time: 00:01:28s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25242_c7_g5:214

INFO 2017-12-08 09:47:08 AddOrReplaceReadGroups Processed    15,000,000 records.  Elapsed time: 00:01:34s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN29035_c6_g1:530

INFO 2017-12-08 09:47:14 AddOrReplaceReadGroups Processed    16,000,000 records.  Elapsed time: 00:01:40s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28834_c4_g1:352

INFO 2017-12-08 09:47:20 AddOrReplaceReadGroups Processed    17,000,000 records.  Elapsed time: 00:01:47s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25894_c2_g2:673

INFO 2017-12-08 09:47:27 AddOrReplaceReadGroups Processed    18,000,000 records.  Elapsed time: 00:01:53s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28789_c4_g2:324

INFO 2017-12-08 09:47:33 AddOrReplaceReadGroups Processed    19,000,000 records.  Elapsed time: 00:02:00s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28509_c3_g1:6,552

INFO 2017-12-08 09:47:40 AddOrReplaceReadGroups Processed    20,000,000 records.  Elapsed time: 00:02:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24477_c11_g1:1,624

INFO 2017-12-08 09:47:46 AddOrReplaceReadGroups Processed    21,000,000 records.  Elapsed time: 00:02:13s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27264_c4_g1:475

INFO 2017-12-08 09:47:52 AddOrReplaceReadGroups Processed    22,000,000 records.  Elapsed time: 00:02:19s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27220_c2_g2:1,000

INFO 2017-12-08 09:47:59 AddOrReplaceReadGroups Processed    23,000,000 records.  Elapsed time: 00:02:25s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22929_c0_g1:1,602

INFO 2017-12-08 09:48:05 AddOrReplaceReadGroups Processed    24,000,000 records.  Elapsed time: 00:02:32s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN26734_c3_g2:2,810

INFO 2017-12-08 09:48:12 AddOrReplaceReadGroups Processed    25,000,000 records.  Elapsed time: 00:02:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22909_c0_g1:825

[Fri Dec 08 09:48:15 PST 2017] picard.sam.AddOrReplaceReadGroups done. Elapsed time: 2.74 minutes.

Runtime.totalMemory()=4781506560

To get help, see http://broadinstitute.github.io/picard/index.html#GettingHelp

Exception in thread "main" htsjdk.samtools.SAMFormatException: SAM validation error: ERROR: Read name HS28_47:1:1114:2015:7790, Read CIGAR M operator maps off end of reference

at htsjdk.samtools.SAMUtils.processValidationErrors(SAMUtils.java:451)

at htsjdk.samtools.BAMRecord.getCigar(BAMRecord.java:253)

at htsjdk.samtools.SAMRecord.getAlignmentEnd(SAMRecord.java:603)

at htsjdk.samtools.SAMRecord.computeIndexingBin(SAMRecord.java:1547)

at htsjdk.samtools.SAMRecord.isValid(SAMRecord.java:2054)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.advance(BAMFileReader.java:811)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:797)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:765)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:576)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:548)

at picard.sam.AddOrReplaceReadGroups.doWork(AddOrReplaceReadGroups.java:140)

at picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:228)

at picard.cmdline.PicardCommandLine.instanceMain(PicardCommandLine.java:94)

at picard.cmdline.PicardCommandLine.main(PicardCommandLine.java:104)


Traceback (most recent call last):

  File "/home/vguerra/MyModules/Trinity/2.5.0vguerra/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py", line 213, in <module>

    main()

  File "/home/vguerra/MyModules/Trinity/2.5.0vguerra/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/run_variant_calling.py", line 179, in main

    pipeliner.run()

  File "/home/vguerra/MyModules/Trinity/2.5.0vguerra/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/../../../PyLib/Pipeliner.py", line 58, in run

    run_cmd(cmd.get_cmd())

  File "/home/vguerra/MyModules/Trinity/2.5.0vguerra/trinityrnaseq-Trinity-v2.5.0/Analysis/SuperTranscripts/AllelicVariants/../../../PyLib/Pipeliner.py", line 22, in run_cmd

    raise RuntimeError("Error while running command \"" + str(cmd) + "\":\n" + error)

RuntimeError: Error while running command "['java', '-jar', '/cvmfs/soft.computecanada.ca/easybuild/software/2017/Core/picard/2.10.7/picard.jar', 'AddOrReplaceReadGroups', 'I=Aligned.sortedByCoord.out.bam', 'O=rg_added_sorted.bam', 'SO=coordinate', 'RGID=id', 'RGLB=library', 'RGPL=platform', 'RGPU=machine', 'RGSM=sample']":

[Fri Dec 08 09:45:31 PST 2017] picard.sam.AddOrReplaceReadGroups INPUT=Aligned.sortedByCoord.out.bam OUTPUT=rg_added_sorted.bam SORT_ORDER=coordinate RGID=id RGLB=library RGPL=platform RGPU=machine RGSM=sample    VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false GA4GH_CLIENT_SECRETS=client_secrets.json USE_JDK_DEFLATER=false USE_JDK_INFLATER=false

[Fri Dec 08 09:45:31 PST 2017] Executing as vgu...@cdr783.int.cedar.computecanada.ca on Linux 3.10.0-693.5.2.el7.x86_64 amd64; Java HotSpot(TM) 64-Bit Server VM 1.8.0_121-b13; Deflater: Intel; Inflater: Intel; Picard version: 2.10.7-SNAPSHOT

INFO 2017-12-08 09:45:32 AddOrReplaceReadGroups Created read group ID=id PL=platform LB=library SM=sample


INFO 2017-12-08 09:45:40 AddOrReplaceReadGroups Processed     1,000,000 records.  Elapsed time: 00:00:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24378_c4_g2:413

INFO 2017-12-08 09:45:46 AddOrReplaceReadGroups Processed     2,000,000 records.  Elapsed time: 00:00:13s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27638_c1_g1:15

INFO 2017-12-08 09:45:53 AddOrReplaceReadGroups Processed     3,000,000 records.  Elapsed time: 00:00:19s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25455_c0_g1:1,580

INFO 2017-12-08 09:45:59 AddOrReplaceReadGroups Processed     4,000,000 records.  Elapsed time: 00:00:26s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28030_c1_g3:159

INFO 2017-12-08 09:46:05 AddOrReplaceReadGroups Processed     5,000,000 records.  Elapsed time: 00:00:32s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23475_c6_g1:687

INFO 2017-12-08 09:46:11 AddOrReplaceReadGroups Processed     6,000,000 records.  Elapsed time: 00:00:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23453_c1_g2:1,705

INFO 2017-12-08 09:46:18 AddOrReplaceReadGroups Processed     7,000,000 records.  Elapsed time: 00:00:44s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24233_c5_g3:2,593

INFO 2017-12-08 09:46:24 AddOrReplaceReadGroups Processed     8,000,000 records.  Elapsed time: 00:00:51s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27537_c2_g1:8,157

INFO 2017-12-08 09:46:30 AddOrReplaceReadGroups Processed     9,000,000 records.  Elapsed time: 00:00:57s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25587_c4_g2:2,516

INFO 2017-12-08 09:46:36 AddOrReplaceReadGroups Processed    10,000,000 records.  Elapsed time: 00:01:03s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN23815_c3_g1:1,250

INFO 2017-12-08 09:46:42 AddOrReplaceReadGroups Processed    11,000,000 records.  Elapsed time: 00:01:09s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24444_c0_g1:874

INFO 2017-12-08 09:46:49 AddOrReplaceReadGroups Processed    12,000,000 records.  Elapsed time: 00:01:15s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25596_c2_g1:2,397

INFO 2017-12-08 09:46:55 AddOrReplaceReadGroups Processed    13,000,000 records.  Elapsed time: 00:01:22s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28518_c0_g1:7,981

INFO 2017-12-08 09:47:01 AddOrReplaceReadGroups Processed    14,000,000 records.  Elapsed time: 00:01:28s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25242_c7_g5:214

INFO 2017-12-08 09:47:08 AddOrReplaceReadGroups Processed    15,000,000 records.  Elapsed time: 00:01:34s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN29035_c6_g1:530

INFO 2017-12-08 09:47:14 AddOrReplaceReadGroups Processed    16,000,000 records.  Elapsed time: 00:01:40s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28834_c4_g1:352

INFO 2017-12-08 09:47:20 AddOrReplaceReadGroups Processed    17,000,000 records.  Elapsed time: 00:01:47s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN25894_c2_g2:673

INFO 2017-12-08 09:47:27 AddOrReplaceReadGroups Processed    18,000,000 records.  Elapsed time: 00:01:53s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28789_c4_g2:324

INFO 2017-12-08 09:47:33 AddOrReplaceReadGroups Processed    19,000,000 records.  Elapsed time: 00:02:00s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN28509_c3_g1:6,552

INFO 2017-12-08 09:47:40 AddOrReplaceReadGroups Processed    20,000,000 records.  Elapsed time: 00:02:06s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN24477_c11_g1:1,624

INFO 2017-12-08 09:47:46 AddOrReplaceReadGroups Processed    21,000,000 records.  Elapsed time: 00:02:13s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27264_c4_g1:475

INFO 2017-12-08 09:47:52 AddOrReplaceReadGroups Processed    22,000,000 records.  Elapsed time: 00:02:19s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN27220_c2_g2:1,000

INFO 2017-12-08 09:47:59 AddOrReplaceReadGroups Processed    23,000,000 records.  Elapsed time: 00:02:25s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22929_c0_g1:1,602

INFO 2017-12-08 09:48:05 AddOrReplaceReadGroups Processed    24,000,000 records.  Elapsed time: 00:02:32s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN26734_c3_g2:2,810

INFO 2017-12-08 09:48:12 AddOrReplaceReadGroups Processed    25,000,000 records.  Elapsed time: 00:02:38s.  Time for last 1,000,000:    6s.  Last read position: TRINITY_DN22909_c0_g1:825

[Fri Dec 08 09:48:15 PST 2017] picard.sam.AddOrReplaceReadGroups done. Elapsed time: 2.74 minutes.

Runtime.totalMemory()=4781506560

To get help, see http://broadinstitute.github.io/picard/index.html#GettingHelp

Exception in thread "main" htsjdk.samtools.SAMFormatException: SAM validation error: ERROR: Read name HS28_47:1:1114:2015:7790, Read CIGAR M operator maps off end of reference

at htsjdk.samtools.SAMUtils.processValidationErrors(SAMUtils.java:451)

at htsjdk.samtools.BAMRecord.getCigar(BAMRecord.java:253)

at htsjdk.samtools.SAMRecord.getAlignmentEnd(SAMRecord.java:603)

at htsjdk.samtools.SAMRecord.computeIndexingBin(SAMRecord.java:1547)

at htsjdk.samtools.SAMRecord.isValid(SAMRecord.java:2054)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.advance(BAMFileReader.java:811)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:797)

at htsjdk.samtools.BAMFileReader$BAMFileIterator.next(BAMFileReader.java:765)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:576)

at htsjdk.samtools.SamReader$AssertingIterator.next(SamReader.java:548)

at picard.sam.AddOrReplaceReadGroups.doWork(AddOrReplaceReadGroups.java:140)

at picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:228)

at picard.cmdline.PicardCommandLine.instanceMain(PicardCommandLine.java:94)

at picard.cmdline.PicardCommandLine.main(PicardCommandLine.java:104)


To post to this group, send email to trinityrn...@googlegroups.com.

Brian Haas

unread,
Dec 8, 2017, 1:04:51 PM12/8/17
to Vanessa Guerra, trinityrnaseq-users

Sorry, looks like I missed one of the picard commands w/ the update to the validation stringency setting.

Please try the attached and we'll take it from there.

best,

~b
run_variant_calling.py

Vanessa Guerra

unread,
Dec 9, 2017, 1:49:01 PM12/9/17
to trinityrnaseq-users
Hi Brian, 

Thank you for the fix, that worked. I'm posting a couple of the log.final.out files below for general knowledge. 

Best, 
Vanessa
===
output from file with edited run_variant_calling.py script (sample MiH9)
===

                                 Started job on | Dec 08 18:00:53

                             Started mapping on | Dec 08 18:05:47

                                    Finished on | Dec 08 18:11:28

       Mapping speed, Million of reads per hour | 711.25


                          Number of input reads | 67370875

                      Average input read length | 149

                                    UNIQUE READS:

                   Uniquely mapped reads number | 57511933

                        Uniquely mapped reads % | 85.37%

                          Average mapped length | 147.71

                       Number of splices: Total | 9806404

            Number of splices: Annotated (sjdb) | 9584887

                       Number of splices: GT/AG | 425374

                       Number of splices: GC/AG | 104414

                       Number of splices: AT/AC | 90720

               Number of splices: Non-canonical | 9185896

                      Mismatch rate per base, % | 0.39%

                         Deletion rate per base | 0.18%

                        Deletion average length | 8.85

                        Insertion rate per base | 0.02%

                       Insertion average length | 1.42

                             MULTI-MAPPING READS:

        Number of reads mapped to multiple loci | 774932

             % of reads mapped to multiple loci | 1.15%

        Number of reads mapped to too many loci | 1389

             % of reads mapped to too many loci | 0.00%

                                  UNMAPPED READS:

       % of reads unmapped: too many mismatches | 0.00%

                 % of reads unmapped: too short | 13.48%

                     % of reads unmapped: other | 0.00%

                                  CHIMERIC READS:

                       Number of chimeric reads | 0

                            % of chimeric reads | 0.00%


===
output from file with original run_variant_calling.py script (sample MiH1)
===

                                 Started job on | Dec 07 09:00:40

                             Started mapping on | Dec 07 09:05:56

                                    Finished on | Dec 07 09:11:31

       Mapping speed, Million of reads per hour | 657.93


                          Number of input reads | 61224019

                      Average input read length | 149

                                    UNIQUE READS:

                   Uniquely mapped reads number | 53307702

                        Uniquely mapped reads % | 87.07%

                          Average mapped length | 148.32

                       Number of splices: Total | 5686193

            Number of splices: Annotated (sjdb) | 5533908

                       Number of splices: GT/AG | 773838

                       Number of splices: GC/AG | 77793

                       Number of splices: AT/AC | 58222

               Number of splices: Non-canonical | 4776340

                      Mismatch rate per base, % | 0.23%

                         Deletion rate per base | 0.12%

                        Deletion average length | 8.87

                        Insertion rate per base | 0.01%

                       Insertion average length | 1.11

                             MULTI-MAPPING READS:

        Number of reads mapped to multiple loci | 653596

             % of reads mapped to multiple loci | 1.07%

        Number of reads mapped to too many loci | 5125

             % of reads mapped to too many loci | 0.01%

                                  UNMAPPED READS:

       % of reads unmapped: too many mismatches | 0.00%

                 % of reads unmapped: too short | 11.85%

                     % of reads unmapped: other | 0.00%

                                  CHIMERIC READS:

                       Number of chimeric reads | 0

                            % of chimeric reads | 0.00%

Reply all
Reply to author
Forward
0 new messages