Hi Will,
You are awesome!
Thanks a lot for your reply!
I'm unsure what exactly caused it, but I have been playing a bit with the python code of pG1, and found that apparently some genes are not found when searched for with the parameter [Genes], for instance, try the search string:
Altolamprologus calvus[Organism] AND S18[Genes] (which doesn't give a hit)
Altolamprologus calvus[Organism[ AND S18[All Fields] (which does generate hits)
So indeed, it seems to be a GenBank side flaw, not pG related.
My goal is to index genes of the family of Lamprologini, with the intent of building a tree (I will do that separately from the pG suite). In order to do so, I am now tracking which genes cover a sufficient number of species, and hence am downloading a large number of FASTA files per gene, per species. In order to verify that the downloaded sequences are, in fact, the sequences they pretend to be, it really helps to read the description. Oftentimes, the name of the gene is given in the description. This is also how I spotted the error in the first place (pG1 would say it downloaded sequences for the gene dlx2a, but the descriptions of said genes described them as RH1 or ND2 or something entirely else).
Thanks a bunch!
Thijs
Op maandag 30 januari 2017 20:50:47 UTC+1 schreef Will Pearse: