now, i use trinity2.0.6 to assembly my plant transcriptome, my data is PE-end, specific strand, read length:125bp, three repeat, the number of left reads is about 67,000,000 in each sample. total clean data is 48G (fastq). after Trinity assembler finished its assembly, i found too many genes and transcripts in the Trinity.fasta files, abot 300,000 genes and 450,000 trabscripts. who can tell me why and how to deal with? my cmd is "Trinity --seqType fq --max_memory 130G --SS_lib_type RF --left 1_1.clean.fq,2_1.clean.fq,3_1.clean.fq --right 1_2.clean.fq, 2_2.clean.fq,3_2.clean.fq --CPU 6 -output trinity_out/"thanks for all at first.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
I think this might address the issue:best,~brian
On Wed, Apr 15, 2015 at 9:10 PM, Qiuzhong Zhou <cnzq...@gmail.com> wrote:
now, i use trinity2.0.6 to assembly my plant transcriptome, my data is PE-end, specific strand, read length:125bp, three repeat, the number of left reads is about 67,000,000 in each sample. total clean data is 48G (fastq). after Trinity assembler finished its assembly, i found too many genes and transcripts in the Trinity.fasta files, abot 300,000 genes and 450,000 trabscripts. who can tell me why and how to deal with? my cmd is "Trinity --seqType fq --max_memory 130G --SS_lib_type RF --left 1_1.clean.fq,2_1.clean.fq,3_1.clean.fq --right 1_2.clean.fq, 2_2.clean.fq,3_2.clean.fq --CPU 6 -output trinity_out/"thanks for all at first.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
Hi Qiuzhong,I would agree with Dan's point that some of your isoforms might be very lowly expressed and/or poorly supported, so a little filtering is often useful, and getting an idea for what your data look like is often a good idea.Try modifying a few of the parameters in the trinity command line. They will probably not make a huge change, but may remove some of the more poorly supported 'genes' and isoforms. In my command line I often use:--min_kmer_cov 2 --max_internal_gap_same_path 15 --max_diffs_same_path 4or variants on this.Also what is it you are assembling? is it likely to be polyploid? is there a genome sequence available?Thanks, Mark
On 16 April 2015 at 08:22, Qiuzhong Zhou <cnzq...@gmail.com> wrote:
but the lowly expressed transcripts maybe also have an important role, i do not want to filter out them. do you have any other method do for this issue. thanks
On Thursday, April 16, 2015 at 2:09:35 PM UTC+8, Dan Browne wrote:
The point is that you can filter out lowly expressed transcripts essentially as noise in your assembly, thus leaving you with a smaller set of perhaps more significant transcripts, as they are more strongly supported by the reads than the transcripts that you filtered out.
On Wednesday, April 15, 2015 at 9:28:18 PM UTC-5, Qiuzhong Zhou wrote:
thanks very much, but i think the url "http://trinityrnaseq.github.io/analysis/abundance_estimation.html#how_many_expr" can not solve my issue. my issue is why i have so much genes and transcripts, is not the abundance estimation of the assembly result.
On Thursday, April 16, 2015 at 9:56:25 AM UTC+8, Brian Haas wrote:
I think this might address the issue:best,~brian
On Wed, Apr 15, 2015 at 9:10 PM, Qiuzhong Zhou <cnzq...@gmail.com> wrote:
now, i use trinity2.0.6 to assembly my plant transcriptome, my data is PE-end, specific strand, read length:125bp, three repeat, the number of left reads is about 67,000,000 in each sample. total clean data is 48G (fastq). after Trinity assembler finished its assembly, i found too many genes and transcripts in the Trinity.fasta files, abot 300,000 genes and 450,000 trabscripts. who can tell me why and how to deal with? my cmd is "Trinity --seqType fq --max_memory 130G --SS_lib_type RF --left 1_1.clean.fq,2_1.clean.fq,3_1.clean.fq --right 1_2.clean.fq, 2_2.clean.fq,3_2.clean.fq --CPU 6 -output trinity_out/"thanks for all at first.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsubscribe...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
Day 03: 28,523,795 pairs
Day 05: 18,110,358 pairs
Day 08: 3,482,488 pairs
Day 13: 11,599,744 pairs
Day 22: 5,284,506 pairs
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.