vsearch --cluster_fast seqdump_1.txt --clusters --id 1
The txt file is in FASTA format. However, when I run this code, I get the error:
Unrecognized string on command line (1)
This is the first few lines of the txt file:
>NP_001189784.1 cytochrome P450 3A4 isoform 2 [Homo sapiens]
MALIPDLAMETWLLLAVSLVLLYLYGTHSHGLFKKLGIPGPTPLPFLGNILSYHKGFCMFDMECHKKYGKVWGFYDGQQP
I would appreciate the help!
Thanks!