extractSequences.sh on gzipped input files

18 views
Skip to first unread message

You Me

unread,
Apr 6, 2020, 2:57:02 PM4/6/20
to CLARK Users
Hello,

I am trying to pull out the reads from my input file that were classified to a specific taxon. My input file is a gzipped fastq file, and I was able to run classify_metagenome.sh on this successfully with the --gzipped option. However, when I tried to run extractSequences.sh using this same input file, I encountered the following error:

Failed to recognize the file format: It does not look like a fastq/fasta format.

Is there an option to run extractSequences.sh on gzipped inputs? I would prefer not to unzip my input file if possible, as the compressed file is already very large (several GB). 

Thank you!

Rachid

unread,
Apr 8, 2020, 11:58:34 PM4/8/20
to CLARK Users
Hi,

As for today, this script accepts only fast/fastq files. Please go ahead and uncompress your file behind running it,
Thank you,

Best,
Rachid
Reply all
Reply to author
Forward
0 new messages