Hello,
I found this post, so I used cd-hit-est for sequence similarity clustering - 97% and extracted taxonomy information (7 levels)
So I got 400,000ish items ( grep ">" -c is 400,000ish) in the fasta file and id_to_taxonomy file.
I was just wondering this is the right way of converting the rdp fasta file for qiime database format.
Am I missing out on something or doing the wrong way?
Any piece of advice would be really appreciated.
Thank you
-jk Kim-
sorry for my English.