Hi Floris,
RSEM only aligns to transcript sequences. It does not align to genomes.
If you provide RSEM with the genome sequence and a GTF file, RSEM will
extract the set of annotated transcripts for you. If you already have
the transcript sequences, you can directly use it.
The reasons that RSEM generate specific reference files for aligners is
two folds:
1) RSEM may add a 125bp poly(A) tail to the end of each mRNA;
2) Bowtie aligner cannot align reads against 'N' bases in the reference.
Thus RSEM first convert all 'N's in the reference to 'G's.
> - many transcripts are removed with the error message "cannot extract
> transcript XXX's sequence since the chromosome it locates, ZZZ, is
> absent". What does this mean?
It means in your genome FASTA file, there is no sequence for chromosome
ZZZ. Therefore RSEM cannot extract XXX since it locates on ZZZ.
Best,
Bo
> --
> RSEM website:
http://deweylab.biostat.wisc.edu/rsem/ [1]
> ---
> You received this message because you are subscribed to the Google
> Groups "RSEM Users" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to
rsem-users+...@googlegroups.com.
> To post to this group, send email to
rsem-...@googlegroups.com.
> Visit this group at
http://groups.google.com/group/rsem-users [2].
>
>
> Links:
> ------
> [1]
http://deweylab.biostat.wisc.edu/rsem/
> [2]
http://groups.google.com/group/rsem-usersH