Hi,
Thank you very much for developing and continuous support of Corset; it is very useful for my experiment. I have some questions and I hope you don't mind to help explaining me. I have 2 treatments (A and B) with 3 replicates each. After trinity, I used Salmon to quantify read abundance and followed the Corset instructions. I got 2 output files: clusters.txt and counts.txt.
I wanted to be sure that the samples are listed correctly so I looked at quant.sf from Salmon and tried to match the counts from each sample back to Corset output. I am not sure if I did anything wrong so that the no. of reads reported in quant.sf by Salmon and in counts.txt are not the same (please see some of the results below). Please correct me if I am wrong, but I understand that Corset summed all reads from transcripts of each cluster, rounded the counts somehow and reported them as raw counts. They should be the same (or very close) to the Salmon outputs. Or those reported in counts.txt are normalised reads?
Thank you very much and looking forward to hearing from you.
Best regards
Nad
-----------------
Transcript 1
-----------------
(1) Result from clusters.txt
TRINITY_DN22615_c0_g1_i1 is the only transcript in the Cluster-6962.59096
(2) Results from counts.txt.
A1 A2 A3 B1 B2 B3
Cluster-6962.59096 140 130 83 219 233 195
(3) Results from quant.sf (Salmon)
Name Length EffectiveLength TPM NumReads
Sample A1 TRINITY_DN22615_c0_g1_i1 237 61.993 25.864129 154.199
Sample A2 TRINITY_DN22615_c0_g1_i1 237 58.081 29.414225 140.224
Sample A3 TRINITY_DN22615_c0_g1_i1 237 56.194 18.605301 88.449
Sample B1 TRINITY_DN22615_c0_g1_i1 237 56.120 48.992792 219.375
Sample B2 TRINITY_DN22615_c0_g1_i1 237 55.068 58.604391 250.620
Sample B3 TRINITY_DN22615_c0_g1_i1 237 63.627 36.524794 199.230
-----------------
Transcript 2
-----------------
(1) Result from clusters.txt
TRINITY_DN77576_c0_g1_i1 is the only transcript in the Cluster-43219.0
(2) Results from counts.txt.
A1 A2 A3 B1 B2 B3
Cluster-43219.0 26 2 8 1 27 0
(3) Results from quant.sf (Salmon)
Name Length EffectiveLength TPM NumReads
Sample A1 TRINITY_DN77576_c0_g1_i1 793 565.197 0.433184 23.546
Sample A2 TRINITY_DN77576_c0_g1_i1 793 550.086 0.044297 2.000
Sample A3 TRINITY_DN77576_c0_g1_i1 793 546.198 0.173130 8.000
Sample B1 TRINITY_DN77576_c0_g1_i1 793 548.196 0.022862 1.000
Sample B2 TRINITY_DN77576_c0_g1_i1 793 541.401 0.642176 27.000
Sample B3 TRINITY_DN77576_c0_g1_i1 793 563.757 0.000000 0.000