Hello!
I'm looking for some help stitching together a request to your API, accessed
here. I'm using P01106 here for demonstration. I'd like protein features with their descriptions ("Note" fields), positioned on the genome, with intron gaps.
1) This query gets aa coordinates for features with descriptions (e.g.: I can tell this is a "NR LBD" domain):
P10275 UniProtKB DOMAIN 669 900 . . . Note=NR LBD;Ontology_term=ECO:0000255;evidence=ECO:0000255|PROSITE-ProRule:PRU01189
2) This query gets genomic coordinates for features, but the Note field is lost:
P10275 UniProtKB domain 669 900 . + . \
GenomeLocStart=67711521;GenomeLocEnd=67723778;Chromosome=X;EnsemblTranslationId=ENSP00000363822;EnsemblGeneId=ENSG00000169083;EnsemblTranscriptId=ENST00000374690
Finally, if you go to those coordinates in the
UCSC browser:
You can clearly see this region contains four or five introns, which the genome loci returned by query (2) fail to reflect.
First - can I issue a single query that pulls the Note field for genomic coordinate queries? Second - do you have any amenities to deal with introns in your API?
Thanks for developing such a great resource,
Brendan