--
You received this message because you are subscribed to the Google Groups "CLARK Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to clarkusers+...@googlegroups.com.
Visit this group at https://groups.google.com/group/clarkusers.
To view this discussion on the web visit https://groups.google.com/d/msgid/clarkusers/f2c447ae-59ae-4f72-8fcd-5c7bbc2f86b2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
For instance, ">gi|158333233|ref|NC_009925.1|. It seems your sequences do not have it.
If your sequences do not have a GI number then I would recommend to work directly with the CLARK executable in the folder "exe/" (so without the two scripts, "set_targets.sh" and "classify_metagenome.sh").
See in the README file, the section "LOW-LEVEL DESCRIPTION & EXAMPLES".
What you would need to do is to create the two-column file, called "targets.txt". That's what set_targets.txt does for you when you work with bacteria/viruses sequences from RefSeq.
This targets.txt file, as described in the README file, defines the targets CLARK uses for the classification:
The first column contains addresses of your sequences (bacteria, viruses, eukaryotes, etc.) and the second column contains the related taxonomy id at the species level.
Best,
Rachid