Hi Rachid,
I was trying to delete a couple of files from the *taxonomy*/ directory post analysis, after the desired database (Bacteria in the example below) is created, and I was wondering if I could remove nucl_accss, nucl_gb.accession2taxid, and nucl_wgs.accession2taxid? These files take up a lot of the memory and it seems like the important files required for the downstream abundance estimation (names.dmp/nodes.dmp among others) are already saved and zipped at taxdump.tar.gz.
So, is it okay to remove everything else but the taxdump.tar.gz folder? I couldn't find that information in the readme and I tried looking at a couple of the bash scripts too and couldn't find any instances of any other files in taxonomy/ other than names.dmp and nodes.dmp, so I just wanted to make sure I inquired before deleting something that would be important for reproducing the results in the future.
-bash-4.1$ ls db
Bacteria bacteria_0 files_excluded.txt targets.txt taxonomy
-bash-4.1$ ls db/taxonomy/
citations.dmp delnodes.dmp division.dmp gc.prt gencode.dmp merged.dmp names.dmp nodes.dmp nucl_accss nucl_gb.accession2taxid nucl_wgs.accession2taxid readme.txt taxdump.tar.gz
Thanks,
Drishti