sequences shorter than 598bp?

53 views
Skip to first unread message

Quang Tran Ho

unread,
Jan 15, 2017, 12:10:49 AM1/15/17
to TaxonDNA
Hi,

I am using SequenceMatrix 1.8 to concatenate Pine sequences. These sequences were downloaded from NCBI. However, most of sequences were shorter than 598bp as required. 

How could i change the parameters in order to concatenate it?

Regards

Quang Tran
Screen Shot 2017-01-14 at 20.56.58.png

Gaurav Vaidya

unread,
Jan 16, 2017, 2:31:18 PM1/16/17
to TaxonDNA
Hi Quang Tran!
Unfortunately, SequenceMatrix won't let you concatenate unaligned sequences -- we didn't want someone spending all afternoon carefully concatenating all your sequences before realizing that there weren't aligned, making them useless for downstream analysis!

Of course, you don't have to align the sequences -- you could pad the end of the sequence with 'N's or '-'s (gaps) so that they're all the same length, but this will make downstream analysis harder, if not impossible. If you want a quick-and-dirty way of doing this, one technique I know is to individually import your files into Species Identifier (http://gaurav.github.io/taxondna) and then exporting them as Nexus from there, which will pad the ends of your sequences with gaps. You will have to do this with each input file individually, which might be okay if you don't have too many of them. If you do, you'll need to use some other tool to do this -- I don't know if SeqBuddy (https://github.com/biologyguy/BuddySuite/wiki/SeqBuddy) or one of the other tools in that suite can do that, but if not, you should get in touch with its creator and let him know that this is a feature you'd like!

Hope that helps!

cheers,
Gaurav
 
Regards

Quang Tran
Reply all
Reply to author
Forward
0 new messages