Dear everyone,
Now I want to assign taxonomy to my sequence based on the Silva database, my command is:
assign_taxonomy.py
-o rdp_silva/ -i rep_set.fna -r
/home/liuwei/database/Silva_111_post/rep_set/97_Silva_111_rep_set.fasta
-t
/home/liuwei/database/Silva_111_post/taxonomy/97_Silva_111_taxa_map_RDP_6_levels.txt
-m rdp
But this can only achieve to genus level, in the
/home/liuwei/database/Silva_111_post/eukaryotes_only/taxonomy_euks/
directory I can find the level 7(species) taxonomy file:
97_Silva_111_taxa_map_RDP_7_levels_euks.txt, however in the corresponding bacteria
directory I only can find level 6 (genus) taxonomy file. How can I get a
level 7(species) taxonomy file that can be used to annotate my sequence
to species level.
If I use this file 97_Silva_111_taxa_map.txt, the command is:
assign_taxonomy.py
-o rdp_silva/ -i rep_set.fna -r
/home/liuwei/database/Silva_111_post/rep_set/97_Silva_111_rep_set.fasta
-t
/home/liuwei/database/Silva_111_post/taxonomy/97_Silva_111_taxa_map.txt
-m rdp
I got this error: ValueError: Because the RDP
Classifier operates in a bottom-up manner, each taxonomy assignment in
the id-to-taxonomy file must have the same number of ranks. Detected 6
ranks in the first item of the file, but detected 4 ranks later in the
file. Offending taxonomy string: Bacteria; __Verrucomicrobia;
__OPB35_soil_group; __uncultured_bacterium
So please give me some advice. How can I solve this error? Really appreciate your help. Thanks very much.
I
am sorry , I still have a problem about the methods (rdp, blast) and
the database (greengenes, rdp, silva) that used to do taxonomy
annotation, how much will the annotation results be different between
each other, have anyone or any paper assess this topic. If I want to
increase my known species's abundance, how can I do, please give me more
advice. Thanks.