chrX 20066072 20066072 0 0 1 360 0 7
This junction is bizarre since it has the same start and end coordinates.
I mapped three RNA-seq samples (ctrl+kd) and all three samples output this junction. Is there an explanation for this?
FYI, my 2-pass STAR command is:
STAR --runThreadN 2 --genomeDir $genomeDIR --outFileNamePrefix $output_file --readFilesIn $readfiles1 $readfiles2 --outSJfilterReads Unique --quantMode TranscriptomeSAM GeneCounts --outSAMstrandField intronMotif --outFilterMultimapNmax 1
Thank you!
Qingqing
STAR --runThreadN 2 --runMode $run_mode --genomeDir $genomeDIR --genomeFastaFiles $genome_fasta --sjdbGTFfile $genome_annotation --sjdbOverhang 99
Here for $genome_annotation I used a genes.gtf file that is downloaded from the Tophat website: https://ccb.jhu.edu/software/tophat/igenomes.shtml
This annotation profile is under Drosophila UCSC dm3.
After that, I chose all the unannotated splicing junctions from the SJ files by performing (I have three samples, Index1,2,and 3):
awk ‘$6 == 0’ Index1_SJ.out.tab > Index1_SJ.out.tab_unannotated
awk ‘$6 == 0’ Index2_SJ.out.tab > Index2_SJ.out.tab_unannotated
awk ‘$6 == 0’ Index3_SJ.out.tab > Index3_SJ.out.tab_unannotated
Then I combined all the unannoated junctions together and formatted for the 2nd pass genome generation as follows:
cat Index1_SJ.out.tab_unannotated Index2_SJ.out.tab_unannotated Index3_SJ.out.tab_unannotated Index4_SJ.out.tab_unannotated > combined_SJ_out_tab_unannotated.txt
awk '{if($4==1) $4="+"; else if($4==2) $4="-"; print $1 "\t" $2 "\t" $3 "\t" $4}' combined_SJ_out_tab_unannotated.txt > combined_SJ_out_tab_unannotated_for_2nd_pass_genome_generation.txt
Now for the second step:
STAR --runThreadN 2 --runMode $run_mode --genomeDir $genomeDIR --genomeFastaFiles $genome_fasta --sjdbFileChrStartEnd $spliceDB --sjdbGTFfile $genome_annotation --sjdbOverhang 99
chr2L 21388502 21388502 1 3 1 321 0 5
chr2R 14309059 14309059 0 0 1 69 0 3
chr3R 24962423 24962423 0 0 1 117 0 3
chrX 8802354 8802354 0 0 1 5 0 4
chrX 8806978 8806978 1 1 1 377 0 3
chrX 13493447 13493447 0 0 1 1960 0 11
chrX 20066072 20066072 0 0 1 86 0 7
I look forward to your reply! Thanks!
Qingqing