If quality trimming or normalization are desired, these processes should be performed prior to aligning the reads to the genome, as Trinity will only use the reads as they exist in the coordinate-sorted bam file provided to it.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
Thanks so much for your instantaneous reply and your explanations. I will trim adapters and perform gentle quality trimming, as you suggested, before starting the normalization step you outlined in the manual. Then I think I am ready to align the reads and begin the assembly.
In addition, I would also like to follow the de novo assembly path. As I have more than a billion read pairs, memory is a constraint. I think I can start with the normalization and use the remaining read pairs for both de novo and alignment to the reference, correct?
Also, do you have any recommendations about error correcting the reads (or not)? There is some evidence in the literature that this might be useful, especially for high coverage data like mine, so any advice is appreciated.
Thanks again for making these great tools available to the research community!
Thomas