Hi,
I'm new to using RSEM and am a little confused by the results I'm getting (I'm using version 1.2.4). I've followed the instructions on the website and given a transcript fasta file to rsem-prepare-reference to create the necessary files. Next I ran rsem-calculate-expression on several samples using the prepared transcript file with paired-end reads - this all works fine and I get the output files. I then used the *.isoforms.results to calculate differential expression using DESeq (also fine). However I then used rsem-plot-transcript-wiggles to generate pdfs for my differentially expressed transcripts and I was a bit surprised to see that they differ wildly from the bam file output (generated using rsem-calculate-expression). I've attached screenshots of the plots of a particular transcript in two samples along with a genome browser plot of the same gene (using the bam (blue) & wig (black) files generated by RSEM).
The output from rsem-calculate-expression for the transcript in question is shown below - counts are highlighted:
NC_untreated:
gi|148234886|ref|NM_001090653| gi|148234886|ref|NM_001090653| 2487 2375.77 338.30 8.81 6.50 100.00
NC_treated:
gi|148234886|ref|NM_001090653| gi|148234886|ref|NM_001090653| 2487 2363.61 0.08 0.00 0.00 100.00
As you can see the coverage on the bam tracks (genome browser) are similar but the expected counts are very different for this transcript in the two samples. Can anyone explain to me why this is?
Thanks
Simon
NC_untreated
NC_treated
