Finding different species from Phytozome with biomaRt

148 views
Skip to first unread message

Alistair Leverett

unread,
Oct 18, 2016, 1:57:11 PM10/18/16
to biomart-users
Hello,
 I have been struggling to use biomart in R to retrieve info from a variety of plant species. I have done the following :

> options(RCurlOptions=list(followlocation=TRUE, postredir=2L))
> phytozome <- useMart('phytozome_mart', host="phytozome.jgi.doe.gov", path ="/biomart/martservice/", dataset = 'phytozome')
> t2f <- getBM(attributes = c("ortholog_group","ortholog__dm_gene_name","ortholog__dm_organism_name", "ortholog__dm_ortholog_gene_name"),filters= c("organism_id"), values = c(382), mart = phytozome)    

However I have a problem. This gives me data from K. laxiflora, which is specified by the argument values =c(382) in the third line. If I want to download data from another species, how would i do this? So far my only approach has been to use trial and error to work out which value corresponds to which species (i.e. change the number which i am setting for value). But this is very laborious. I would like to know if there is a way of viewing which value corresponds to which species? So far i cannot find any way to view this.

Any help will be much appreciated

best wishes
Ali

jwca...@lbl.gov

unread,
Oct 18, 2016, 4:29:00 PM10/18/16
to biomart-users
Hello Ali,

This question is not so much an issue with biomart, but a detail of our implementation and data model.

Our biomart database includes different versions of the genome and annotation for some organisms. So we cannot use the organism name as a unique key for the data set. We use a unique integer key to designate the particular annotation. This is represented in biomart as the organism_id.

I see the source of your problem: we use the organism_id internally when making queries, but we do not expose it. Nor do we have a simple organism name filter in place that you can use. I'll work on this for the next deployment of our biomart. In the mean time, here is how you can get a lookup list of organism id and organism names in our biomart: 


(yes. It's an intermine query. I hope I will be forgiven for posting the link on this list.)
Reply all
Reply to author
Forward
0 new messages