Maximum sequence length

73 views
Skip to first unread message

Lisle Mose

unread,
Oct 10, 2014, 11:07:07 AM10/10/14
to rna-...@googlegroups.com
Hello,

Is there a maximum input sequence length that STAR can handle?  I'm attempting to align assembled contigs (of varying length), but I get the following error message in stderr:

EXITING because of FATAL ERROR in input reads: unknown file format: the read ID should start with @ or > 

Oct 10 10:45:10 ...... FATAL ERROR, exiting


If I filter contigs of length 10k or greater, the error goes away.  I've compiled using make STARlong.  Is there a way to map these longer contigs using STAR?

Thanks,
Lisle

Alexander Dobin

unread,
Oct 13, 2014, 12:44:41 PM10/13/14
to rna-...@googlegroups.com
Hi Lisle,

when compiled as STARlong, STAR should be able to handle sequences up to 50kb.
STAR can handle "multi-line" (i.e. sequence split in multiple lines) FASTA, but not FASTQ.
Please send me a small portion of your file for which you can still see the error.

Cheers
Alex
Reply all
Reply to author
Forward
0 new messages