Hello everyone,
I have a rather simple goal that seems to be incredibly difficult using IMG's restrictions. I want to gather the available diversity of sequences from a specific archaeal gene in metagenome datasets; however, I am limited in BLAST to only 20 metagenomes at a time (despite it saying there is a max of 100 in the search). This is next to impossible to do, if only because it is so difficult to keep track of which datasets you have searched out of the >7000 available (very few subsets within the "Tree" view are less than 20). I'd be more than willing to increase the specificity or change parameters (such as excluding all sequences shorter than a specified length) if this would allow more capacity to search.
So far my best success has been to make a few Workspaces of metagenomes and go from there, but this is incredibly time-consuming, and wastes a lot of time because many of the datasets don't have any of my genes of interest to start with.
Due to difficulties and inaccuracies in annotation pipelines, I cannot simply search using a locus tag (which would only let me search 50 at a time). And unfortunately NCBI itself (which has no limits) does not have anywhere close to the breadth of ecosystem types as IMG has.
Any tips?
Thanks!
-Bradley