ERROR_00201: unknown file format: the read ID should start with @ or >

Pepinos

unread,

Dec 21, 2014, 5:48:37 PM12/21/14

to rna-...@googlegroups.com

Hello,

I'm having trouble aligning the test files above. At first I thought the problem came from multiline fastq files but then I converted them to single-line files and keep getting the same error. The minimal test case above produces the error. Am I missing something obvious?

Thanks,
Petros

test1.R1.sline.fastq.gz

test1.R2.sline.fastq.gz

Log.progress.out

Log.out

Aligned.out.sam

Alexander Dobin

unread,

Jan 6, 2015, 4:56:48 PM1/6/15

to rna-...@googlegroups.com

Hi Petros,

gzip -c compresses the file, but you need to uncompress them, i.e.

--readFilesCommand gunzip -c

OR, if it's installed on your system,

--readFilesCommand zcat

Cheers

Alex

kenny...@gmail.com

unread,

Dec 2, 2017, 9:36:18 PM12/2/17

to rna-star

can u tell me how to convert multiline fastq files to single-line files ?

在 2014年12月22日星期一 UTC+8上午6:48:37，Pepinos写道：

Alexander Dobin

unread,

Dec 3, 2017, 3:24:30 PM12/3/17

to rna-star

Hi @kennyzhao94

assuming that your quality scores (and sequences) do not contain @ or + characters, you need to check for strings

that start with "@" : this would be the read ID, and "+" : this would be the line separating sequence lines(s) from the quality scores line(s).