How to get some statistical data on the contents of the database.

45 views
Skip to first unread message

LAUREN

unread,
Sep 8, 2016, 2:14:00 PM9/8/16
to IMG User Forum
I need to know how many unique *species* are present in IMG per each phylogenetic category (phylum/class/order/etc). The only data I can find is the number of genomes, which counts different strains of the same species as unique entries. 

I also do not understand the difference between "assembled/unassembled" and "finished/draft". Have these categories anything to do with each other? When I do a gene search I can select genomes based on whether or not they are finished, draft, or both. I searched "finished". When I try to look at statistics for the whole website (example https://img.jgi.doe.gov/cgi-bin/mer/main.cgi#taxontable=results%3D100%26startIndex%3D0%26sort%3DGenomeNameSampleName%26dir%3Dasc%26c%3DDomain%26f%3D%26t%3Dtext) I only see an option for assembled vs unassembled. 

I need to calculate what percentage of species in a given phylum/class/order etc has a certain pfam, and I need the total number of unique species per category to do that. How do I find that information? 

Thank you,

--Lauren Cook
Graduate student, UCLA
Reply all
Reply to author
Forward
0 new messages