N0 = number of unalignable reads;
N1 = number of alignable reads;
N2 = number of filtered reads due
to too many
alignments;
N= N0 + N1 + N2
Giving an example of one of my samples, this is the first line of sample_name.cnt for that sample:
0 22910113 0 22910113
so N0 = 0, N1 = 22910113, N2 = 0 and Ntotal=22910113
I find this result odd because the total number of read pairs for this sample is 28,805,265 (from the fastq file). Assuming that the numbers inside .cnt file concern read pairs, I expect that N1 would be smaller than the total no. of input read pairs since I don't expect that all reads align. However, if there is some amount of reads that could not be mapped, shouldn't N0 be different from 0 and Nt equal to the number of input read pairs? Did the meaning of these 4 numbers changed with RSEM version?
Also, summing the expected_counts from the sample_name.gene.results, I get a library size of 11433722.93 (EdgeR outputs this number too, since it sums the expected counts column to produce the library size). This number is approximately half of N1. Now this is puzzling because it either means:
1) that N1 is actually the total number of reads mapped (no. R1 + no. R2). In that case there is a large amount of reads not mapping (22910113 / 28,805,265 * 2 ~ 0.40 --> 40 % mapping success) which is not good...
2) that N1 still represents read pairs, but the expected counts are wrong (?).
It is weird because I've done this math before to calculate mapping success and the N always matched the number of input read pairs, and N1 always matched library size.
Can I get some help solving this out?
Thank you in advance,
Mafalda
--
RSEM website: http://deweylab.biostat.wisc.edu/rsem/
---
You received this message because you are subscribed to the Google Groups "RSEM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rsem-users+...@googlegroups.com.
To post to this group, send email to rsem-...@googlegroups.com.
Visit this group at http://groups.google.com/group/rsem-users.
You received this message because you are subscribed to a topic in the Google Groups "RSEM Users" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/rsem-users/L4IQPTbNKos/unsubscribe.
To unsubscribe from this group and all its topics, send an email to rsem-users+...@googlegroups.com.
| Sample: | 3112_XAB_variable | |
| With rsem scripts | With trinity script | |
| N input read pairs | 10,064,999 | 10,064,999 |
| N0 | 6,028,072 | 0 |
| N1 | 4,036,927 | 6,946,945 |
| N2 | 0 | 0 |
| Nt | 10,064,999 | 6,946,945 |
| library size (sum expected counts) | 3,945,740 | 3,765,312 |
To unsubscribe from this topic, visit https://groups.google.com/d/topic/rsem-users/L4IQPTbNKos/unsubscribe.
To unsubscribe from this group and all its topics, send an email to rsem-users+...@googlegroups.com.
To post to this group, send email to rsem-...@googlegroups.com.
Visit this group at http://groups.google.com/group/rsem-users.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/rsem-users/L4IQPTbNKos/unsubscribe.
To unsubscribe from this group and all its topics, send an email to rsem-users+...@googlegroups.com.
To post to this group, send email to rsem-...@googlegroups.com.
Visit this group at http://groups.google.com/group/rsem-users.