RSEM.stat

1,076 views
Skip to first unread message

Alicia R. Pérez-Porro

unread,
Apr 29, 2013, 6:22:19 AM4/29/13
to rsem-...@googlegroups.com
Hi,

I just got my first RSEM results and i'm trying to understand the output. Can someone tell me what exactly is inside RSEM.stat? I have 3 files: RSEM.cnt, RSEM.model and RSEM.theta and i have no idea what are they.

Thanks,
Alicia


---
Alicia R. Pérez-Porro
PhD candidate

Giribet lab
Department of Organismic and Evolutionary Biology
MCZ labs
Harvard University
26 Oxford St, Cambridge MA 02138

b...@cs.wisc.edu

unread,
Apr 29, 2013, 10:37:03 AM4/29/13
to rsem-...@googlegroups.com
Hi Alicia,

RSEM.cnt stores alignment statistics. RSEM.model is the model of RNA
sequencing process learned from the data. RSEM.theta is the estimated read
generating probability, which can be converted into TPM/FPKM.

These are internal outputs, which are used for simulation/generating
plots. They are designed as black box for users.

Best,
Bo

> Hi,
>
> I just got my first RSEM results and i'm trying to understand the output.
> Can someone tell me what exactly is inside RSEM.stat? I have 3 files:
> RSEM.cnt, RSEM.model and RSEM.theta and i have no idea what are they.
>
> Thanks,
> Alicia
>
>
> ---
> Alicia R. P�rez-Porro
> PhD candidate
>
> Giribet lab
> Department of Organismic and Evolutionary Biology
> MCZ labs
> Harvard University
> 26 Oxford St, Cambridge MA 02138
> phone: +1 617-496-5308
> fax: +1 617-495-5667
> www.oeb.harvard.edu/faculty/giribet/
>
> --
> RSEM website: http://deweylab.biostat.wisc.edu/rsem/
> ---
> You received this message because you are subscribed to the Google Groups
> "RSEM Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to rsem-users+...@googlegroups.com.
> To post to this group, send email to rsem-...@googlegroups.com.
> Visit this group at http://groups.google.com/group/rsem-users?hl=en-US.
>
>
>


Oleg Moskvin

unread,
Oct 29, 2014, 11:29:17 AM10/29/14
to rsem-...@googlegroups.com
Hi Bo,

Would you please shine some light into one of those "black boxes"?
In particular, I believe the alignment statistics (*.cnt files under *.STAT folder) may be useful for the end user.
I see 17 (undocumented) lines there.
Would it be possible to annotate them?

Thanks!

Oleg

Bo Li

unread,
Nov 3, 2014, 7:07:25 PM11/3/14
to rsem-...@googlegroups.com
Hi Oleg,

Sorry to reply you late.

Sure thing. Please see below for an annotation of the *.cnt file. I'll
include it in next RSEM release.

Best,
Bo


# '#' marks the start of comments (till the end of the line)
# *.cnt file contains alignment statistics based purely on the alignment
results obtained from aligners
N0 N1 N2 N_tot # N0, number of unalignable reads; N1, number of
alignable reads; N2, number of filtered reads due to too
# many alignments; N_tot = N0 + N1 + N2

nUnique nMulti nUncertain # nUnique, number of reads aligned
uniquely to a gene; nMulti, number of reads aligned to
# multiple genes; nUnique + nMulti = N1;
# nUncertain, number of reads aligned to
multiple locations in the given reference sequences,
# which include isoform-level multi-mapping
reads
nHits read_type # nHits, number of total alignments.
# read_type: 0, single-end read, no quality
score; 1, single-end read, with quality score;
# 2, paired-end read, no quality score; 3,
paired-end read, with quality score

# The next section counts reads by the number of alignments they have.
Each line contains two values separated by a TAB character. The first
value is number of alignments. 'Inf' refers to reads filtered due to too
many alignments. The second value is the number of reads that contain
such many alignments

0 N0
...
number_of_alignments number_of_reads_with_that_many_alignments
...
Inf N2
>>> www.oeb.harvard.edu/faculty/giribet/ [1]
>>>
>>> --
>>> RSEM website: http://deweylab.biostat.wisc.edu/rsem/ [2]
>>> ---
>>> You received this message because you are subscribed to the Google
>> Groups
>>> "RSEM Users" group.
>>> To unsubscribe from this group and stop receiving emails from it,
>> send an
>>> email to rsem-users+...@googlegroups.com.
>>> To post to this group, send email to rsem-...@googlegroups.com.
>>> Visit this group at
>> http://groups.google.com/group/rsem-users?hl=en-US [3].
>>>
>>>
>>>
>
> --
> RSEM website: http://deweylab.biostat.wisc.edu/rsem/ [4]
> ---
> You received this message because you are subscribed to the Google
> Groups "RSEM Users" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to rsem-users+...@googlegroups.com.
> To post to this group, send email to rsem-...@googlegroups.com.
> Visit this group at http://groups.google.com/group/rsem-users [5].
>
>
> Links:
> ------
> [1]
> http://www.google.com/url?q75http%3A%2F%2Fwww.oeb.harvard.edu%2Ffaculty%2Fgiribet%2F46sa75D46sntz75146usg75AFQjCNHQW1tAuvm0H0iWRhwz_smCmiWNug
> [2]
> http://www.google.com/url?q75http%3A%2F%2Fdeweylab.biostat.wisc.edu%2Frsem%2F46sa75D46sntz75146usg75AFQjCNEU-wRL_aNCO7ziCWa-Y12BbbySXw
> [3] http://groups.google.com/group/rsem-users?hl75en-US
> [4] http://deweylab.biostat.wisc.edu/rsem/
> [5] http://groups.google.com/group/rsem-users

celine noirot

unread,
Aug 23, 2017, 11:54:11 AM8/23/17
to RSEM Users
Hi,
So why nUnique is not equal to the line number of alignment 1 ?

It's seems that number of unique read alignement is nUnique - nUncertain.
Could you confirm it ?
Best
Céline
Reply all
Reply to author
Forward
0 new messages