>head infile
comp1
comp2
comp3
...
head Trinity.fasta
>comp1
ACTCCCTT
>comp2
ACACTGGGT
>compN
cat infile | awk '{print $1}' | grep -A1 -f - Trinity.fasta > good.genes.fa
Thank you!
Cheryl
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at http://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
$ python extract_unigene_isoforms -q gene_list.txt -f Trinity_assembly.fa -o output_file.fa
Cheryl
cat Trinity_FCF_FEF_DEgenes_mod1.txt | awk '{print $1}' | grep -A0 -f - Trinity_modif4.txt >Trinity_FCF_FEF_DEgenes_list
but it does not extract any sequences.
I have tested steps along the road and just running
cat Trinity_FCF_FEF_DEgenes_mod1.txt | awk '{print $1}'
produces the DE genes list so I suspect the problem lies in the grep. Running single gene entries with grep -A0 "entry" Trinity_modif4.txt works fine.
Can anyone see where I am going wrong.
Thanks
John
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-u...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
Are you trying to extract gene sequences from the trinity.fasta? Genes don't exist they are a combination of expression values from the transcripts. All that's in the fasta file will be transcript sequences.
Best wishes, Mark
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Are you trying to extract gene sequences from the trinity.fasta? Genes don't exist they are a combination of expression values from the transcripts. All that's in the fasta file will be transcript sequences.
Best wishes, Mark
On 28 Sep 2016 19:56, <hex...@gmail.com> wrote:
Hi Brian,--Is it possible to use acc_list_to_fasta_entries.pl to extract isoform (or gene) sequences from trinity.fasta for differentially expressed genes? I tried acc_list_to_fasta_entries.pl, and it did not work.I get list of DE genes (e.g. TR#|c#_g#), and the names of sequences in trinity.fasta are TR#|c#_g#_i#.Dan Browne provided a way to do that, but I have problem to install pyfaidx in the server I use.Thanks
在 2016年4月20日星期三 UTC-4下午7:11:50,Brian Haas写道:Lots of ways to do this, of course.Note, there's a script in the latest Trinity release:trinityrnaseq/util/misc/acc_list_to_fasta_entries.plusage: acc_list_to_fasta_entries.pl acc.list.txt file.fastathat'll extract the list of transcripts from a fasta file based on a list of accessions.best,~b
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Is it possible to extract all transcripts of selected genes?
在 2016年9月28日星期三 UTC-4下午4:59:31,Mark Chapman写道:
Are you trying to extract gene sequences from the trinity.fasta? Genes don't exist they are a combination of expression values from the transcripts. All that's in the fasta file will be transcript sequences.
Best wishes, Mark
On 28 Sep 2016 19:56, <hex...@gmail.com> wrote:
Hi Brian,--Is it possible to use acc_list_to_fasta_entries.pl to extract isoform (or gene) sequences from trinity.fasta for differentially expressed genes? I tried acc_list_to_fasta_entries.pl, and it did not work.I get list of DE genes (e.g. TR#|c#_g#), and the names of sequences in trinity.fasta are TR#|c#_g#_i#.Dan Browne provided a way to do that, but I have problem to install pyfaidx in the server I use.Thanks
在 2016年4月20日星期三 UTC-4下午7:11:50,Brian Haas写道:Lots of ways to do this, of course.Note, there's a script in the latest Trinity release:trinityrnaseq/util/misc/acc_list_to_fasta_entries.plusage: acc_list_to_fasta_entries.pl acc.list.txt file.fastathat'll extract the list of transcripts from a fasta file based on a list of accessions.best,~b
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsubscribe...@googlegroups.com.
To post to this group, send email to trinityrn...@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "trinityrnaseq-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to trinityrnaseq-users+unsub...@googlegroups.com.
To post to this group, send email to trinityrnaseq-users@googlegroups.com.
Visit this group at https://groups.google.com/group/trinityrnaseq-users.
For more options, visit https://groups.google.com/d/optout.