extractSequences.sh on gzipped input files

18 views

Skip to first unread message

You Me

unread,

Apr 6, 2020, 2:57:02 PM4/6/20

to CLARK Users

Hello,

I am trying to pull out the reads from my input file that were classified to a specific taxon. My input file is a gzipped fastq file, and I was able to run classify_metagenome.sh on this successfully with the --gzipped option. However, when I tried to run extractSequences.sh using this same input file, I encountered the following error:

Failed to recognize the file format: It does not look like a fastq/fasta format.

Is there an option to run extractSequences.sh on gzipped inputs? I would prefer not to unzip my input file if possible, as the compressed file is already very large (several GB).

Thank you!

Rachid

unread,

Apr 8, 2020, 11:58:34 PM4/8/20

to CLARK Users

Hi,

As for today, this script accepts only fast/fastq files. Please go ahead and uncompress your file behind running it,

Thank you,

Best,

Rachid

Reply all

Reply to author

Forward

0 new messages