rsem-prepare-reference --gtf Pan_troglodytes.CHIMP2.1.4.82.chr.gtf --transcript-to-gene-map Pan_troglodytes.CHIMP2.1.4.82.gene_transcript_mapping.txt Pan_troglodytes.CHIMP2.1.4.chr.fa RSEMchimpmkdir STARchimp
STAR --runMode genomeGenerate --genomeDir STARchimp --genomeFastaFiles Pan_troglodytes.CHIMP2.1.4.chr.fa --sjdbGTFfile Pan_troglodytes.CHIMP2.1.4.82.chr.gtf --sjdbOverhang 199mkdir mapped_reads
STAR --genomeDir STARchimp --readFilesIn RNA-Seq_1.fastq RNA-Seq_2.fastq --outSAMunmapped Within --outFilterType BySJout --outSAMattributes NH HI AS NM MD --outFilterMultimapNmax 20 --outFilterMismatchNmax 999 --outFilterMismatchNoverLmax 0.04 --alignIntronMin 20 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --alignSJoverhangMin 8 --alignSJDBoverhangMin 1 --outSAMtype BAM SortedByCoordinate --quantMode TranscriptomeSAM --outFileNamePrefix mapped_reads/rsem-calculate-expression --paired-end --bam --no-bam-output --forward-prob 1.0 mapped_reads/Aligned.toTranscriptome.out.bam RSEMchimp rnaseq_quantrsem-parse-alignments RSEMchimp rnaseq_quant.temp/rnaseq_quant rnaseq_quant.stat/rnaseq_quant b rnaseq_mapped/Aligned.toTranscriptome.out.bam -t 3 -tag XM
RSEM can not recognize reference sequence name ENSPTRT00000054489!
"rsem-parse-alignments RSEMchimp rnaseq_quant.temp/rnaseq_quant rnaseq_quant.stat/rnaseq_quant b rnaseq_mapped/Aligned.toTranscriptome.out.bam -t 3 -tag XM" failed! Plase check if you provide correct parameters/options for the pipeline!RSEM can not recognize reference sequence name ENST00000456328!
rsem.idx.fa:>ENST00000456328_DDX11L1-002
rsem.n2g.idx.fa:>ENST00000456328_DDX11L1-002
rsem.seq:ENST00000456328_DDX11L1-002
rsem.ti:ENST00000456328_DDX11L1-002
rsem.ti:gene_id "ENSG00000223972"; gene_version "5"; transcript_id "ENST00000456328"; transcript_version "2"; exon_number "1"; gene_name "DDX11L1"; gene_source "havana"; gene_biotype "transcribed_unprocessed_pseudogene"; havana_gene "OTTHUMG00000000961"; havana_gene_version "2"; transcript_name "DDX11L1-002"; transcript_source "havana"; transcript_biotype "processed_transcript"; havana_transcript "OTTHUMT00000362751"; havana_transcript_version "1"; exon_id "ENSE00002234944"; exon_version "1"; tag "basic"; transcript_support_level "1";
rsem.transcripts.fa:>ENST00000456328_DDX11L1-002rsem-prepare-reference --gtf NCBI_Felis8.0_annotation_1106151122.gtf -p 8 GCF_000181335.2_Felis_catus_8.0_genomic.fa Felis8.0_RSEM
rsem-extract-reference-transcripts Felis8.0_RSEM 0 NCBI_Felis8.0_annotation_1106151122.gtf 0 GCF_000181335.2_Felis_catus_8.0_genomic.fa
Parsed 200000 lines
Parsed 400000 lines
Parsed 600000 lines
Parsing gtf File is done!
GCF_000181335.2_Felis_catus_8.0_genomic.fa is processed!
40495 transcripts are extracted and 0 transcripts are omitted.
Extracting sequences is done!
Group File is generated!
Transcript Information File is generated!
Chromosome List File is generated!
Extracted Sequences File is generated!
rsem-preref Felis8.0_RSEM.transcripts.fa 1 Felis8.0_RSEM -l 125
Refs.makeRefs finished!
Refs.saveRefs finished!
Felis8.0_RSEM.idx.fa is generated!
Felis8.0_RSEM.n2g.idx.fa is generated!
Generate STAR reference (using genome and annotations)
STAR --runThreadN 6 --runMode genomeGenerate --genomeDir . --genomeFastaFiles GCF_000181335.2_Felis_catus_8.0_genomic.fa --limitGenomeGenerateRAM 19000000000 --genomeChrBinNbits 13 --sjdbGTFfile NCBI_Felis8.0_annotation_1106151122.gtf --sjdbOverhang 100 --genomeSAindexNbases 11
Mapping
STAR --genomeDir . --outSAMunmapped Within --outFilterType BySJout --outSAMattributes NH HI AS NM MD --outFilterMultimapNmax 20 --outFilterMismatchNmax 999 --outFilterMismatchNoverLmax 0.04 --alignIntronMin 20 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --alignSJoverhangMin 8 --alignSJDBoverhangMin 1 --sjdbScore 1 --runThreadN 8 --genomeLoad NoSharedMemory --outSAMtype BAM Unsorted --quantMode TranscriptomeSAM --outSAMheaderHD \@HD VN:1.4 SO:unsorted --readFilesIn 541OD_R1.fq 541OD_R2.fq
rsem-calculate-expression --paired-end --bam -p 8 541Aligned.toTranscriptome.out.bam Felis8.0_RSEM 541RSEM
ERROR
rsem-parse-alignments Felis8.0_RSEM 541RSEM.temp/541RSEM 541RSEM.stat/541RSEM b 541Aligned.toTranscriptome.out.bam -t 3 -tag XM
RSEM can not recognize reference sequence name rna0!
"rsem-parse-alignments Felis8.0_RSEM 541RSEM.temp/541RSEM 541RSEM.stat/541RSEM b 541Aligned.toTranscriptome.out.bam -t 3 -tag XM" failed! Plase check if you provide correct parameters/options for the pipeline!
As follows to previous post, I used the same genome and annotation in STAR and RSEM. Also, input Aligned.toTranscriptome.out.bam, not genome.bam. In STAR Log.out,
Processing sjdbGTFfile=NCBI_Felis8.0_annotation_1106151122.gtf, found:
40495 transcripts
402177 exons (non-collapsed)
203665 collapsed junctions
..... Finished GTF processing
Loaded database junctions from the GTF file:
I couldn't find why it did't work. Please give me solution.
Thank you for your help!
Kazu