--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
Hi Tevor,If your experience is going to be anything like mine was, it's a long painful back-and-forth process with NCBI to get a transcriptome assembly submitted. If NCBI would make their full pipeline available to users to run independently and resolve issues directly before submitting, then we could automate the clean-up process, but that wasn't the case for when I did this months ago.My guess is that if you redo the assembly after more aggressive trimming, it'll still find some adaptors or sequences that it thinks are adaptors and reject.I'd suggest asking for the full report of required corrections, including regions matching adaptors and obvious contaminants, and then remove those sequences directly (trim ends of offending contigs, mask internal regions with N characters if they'll allow that, or remove offending sequences altogether), then resubmit. If they come back with a bunch of new offending sequences and regions that weren't in the original report, you'll have grounds to complain.I'm still suffering PTSD from my experience here.best wishes,~b
On Wed, Jul 5, 2017 at 5:33 PM, <krabb...@gmail.com> wrote:
Trinity Users,I am trying to deposit Trinity assemblies in NCBI's TSA database, but there are significant hits against the UniVec database and the assemblies are not accepted by NCBI. I ran Trimmomatic as part of the Trinity pipeline, but apparently some Illumina vectors made it through the trimming step. NCBI's comment is to remove the adapters from raw reads and start over with the assembly. I am wondering if this is a common issue for Trinity pipeline users or perhaps I have missed an important step in the pipeline? Minimally, hopefully this message saves someone else from similar problems.Thank you,Trevor
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
Hi Brian,Thank you for your reply. To follow up, per NCBI's suggestion, I have run the following on raw reads to identify sequences that match the UniVec database:blastn -task blastn -reward 1 -penalty -3 -evalue 700 -searchsp 1750000000000 -dust yes -gapopen 3 -gapextend 3 -query myreads -db UniVecIndeed, many raw reads (and contigs) have adapter contamination that was missed by Trimmomatic. I have run CutAdapt/TrimGalore (with default parameter settings) and it seems to resolve this problem. I do not know if adapter contamination is a ubiquitous problem, but Trinity users might consider trying TrimGalore instead of Trimmomatic. The next step will be to see how adapter trimming affects assembly results.Hopefully someone else finds this useful.Best,Trevor
On Wednesday, July 5, 2017 at 8:35:55 PM UTC-4, Brian Haas wrote:
Hi Tevor,If your experience is going to be anything like mine was, it's a long painful back-and-forth process with NCBI to get a transcriptome assembly submitted. If NCBI would make their full pipeline available to users to run independently and resolve issues directly before submitting, then we could automate the clean-up process, but that wasn't the case for when I did this months ago.My guess is that if you redo the assembly after more aggressive trimming, it'll still find some adaptors or sequences that it thinks are adaptors and reject.I'd suggest asking for the full report of required corrections, including regions matching adaptors and obvious contaminants, and then remove those sequences directly (trim ends of offending contigs, mask internal regions with N characters if they'll allow that, or remove offending sequences altogether), then resubmit. If they come back with a bunch of new offending sequences and regions that weren't in the original report, you'll have grounds to complain.I'm still suffering PTSD from my experience here.best wishes,~b
On Wed, Jul 5, 2017 at 5:33 PM, <krabb...@gmail.com> wrote:
Trinity Users,I am trying to deposit Trinity assemblies in NCBI's TSA database, but there are significant hits against the UniVec database and the assemblies are not accepted by NCBI. I ran Trimmomatic as part of the Trinity pipeline, but apparently some Illumina vectors made it through the trimming step. NCBI's comment is to remove the adapters from raw reads and start over with the assembly. I am wondering if this is a common issue for Trinity pipeline users or perhaps I have missed an important step in the pipeline? Minimally, hopefully this message saves someone else from similar problems.Thank you,Trevor
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsubscribe...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
Dr. Mark A. Chapman------------------------------------Centre for Biological Sciences
University of SouthamptonLife Sciences Building 85
Highfield Campus
Southampton
SO17 1BJ
--