ERROR_00201: unknown file format: the read ID should start with @ or >

719 views
Skip to first unread message

Pepinos

unread,
Dec 21, 2014, 5:48:37 PM12/21/14
to rna-...@googlegroups.com
Hello,

I'm having trouble aligning the test files above. At first I thought the problem came from multiline fastq files but then I converted them to single-line files and keep getting the same error. The minimal test case above produces the error. Am I missing something obvious?

Thanks,
Petros
test1.R1.sline.fastq.gz
test1.R2.sline.fastq.gz
Log.progress.out
Log.out
Aligned.out.sam

Alexander Dobin

unread,
Jan 6, 2015, 4:56:48 PM1/6/15
to rna-...@googlegroups.com
Hi Petros,

gzip -c compresses the file, but you need to uncompress them, i.e.
--readFilesCommand gunzip -c 
OR, if it's installed on your system,
--readFilesCommand zcat

Cheers
Alex

kenny...@gmail.com

unread,
Dec 2, 2017, 9:36:18 PM12/2/17
to rna-star
can u tell me  how to convert multiline fastq files  to single-line files ?

在 2014年12月22日星期一 UTC+8上午6:48:37,Pepinos写道:

Alexander Dobin

unread,
Dec 3, 2017, 3:24:30 PM12/3/17
to rna-star
Hi @kennyzhao94

assuming that your quality scores (and sequences) do not contain @ or + characters, you need to check for strings
that start with "@" : this would be the read ID, and "+" : this would be the line separating sequence lines(s) from the quality scores line(s).

Cheers
Alex
Reply all
Reply to author
Forward
0 new messages