Hi Jerven,
Thank you for giving us extra time!
Takatomo and I reviewed the manuscript based on the PDF file sent to this group yesterday.
I'll keep working on the sections indicated by [TODO] sign in below.
Here are our comments:
First of all, cases would be better to be consistent: s/semantic web/Semantic Web/g
p.4 Implementation
"i.e. something with an end and start position." -> "i.e. something with a start and end position."
p.5 Figure 1
Do we really need this negative phrase "for semantically poor data" here?
p.7 Users
"(see figure 5)" → "(see Figure 5)"
p.8 Figure 5
[TODO]
I hope to replace the figure with an updated version (in preparation).
p.13 Figure 9
The following URIs are not yet in production (pseudo) ones but please update from
"ddbj" : "http://.../exampleDDBJrecordNamespace/",
"ddbj-record" : "http://.../exampleDDBJrecordNamespace/" ,
"ddbj-seq" : "http://.../exampleDDBJsequenceNamespace/" ,
to
"ddbj" : "
http://ddbj.nig.ac.jp/ontologies/sequence/",
"ddbj-record" : "
http://ddbj.nig.ac.jp/entry/nucleotide/",
"ddbj-aa" : "
http://ddbj.nig.ac.jp/entry/protein/",
"ddbj-seq" : "
http://ddbj.nig.ac.jp/entry/sequence/",
and need to change "ddbj:CDS" to "ddbj:feature", so the resulted JSON will be as follows (with spaces before ',' and '"' are carefully removed)
{
"@context" : {
"label" : "
http://www.w3.org/2000/01/rdf-schema#label",
"comment" : "
http://www.w3.org/2000/01/rdf-schema#comment",
"taxonomy" : "
http://purl.uniprot.org/taxonomy/",
"identifier" : "
http://purl.org/dc/terms/identifier",
"ddbj" : "
http://ddbj.nig.ac.jp/ontologies/sequence/",
"ddbj-record" : "
http://ddbj.nig.ac.jp/entry/nucleotide/",
"ddbj-protein" : "
http://ddbj.nig.ac.jp/entry/protein/",
"ddbj-seq" : "
http://ddbj.nig.ac.jp/entry/sequence/",
"faldo" : "
http://biohackathon.org/resource/faldo#",
"so" : "
http://purl.obolibrary.org/obo/so.owl#"
},
"@id" : "ddbj-record:J02448",
"identifier" : "J02448",
"comment" : "Bacteriophage f1, complete genome",
"ddbj:organism" : "taxonomy:10863",
"ddbj:feature" : {
"@id" : "ddbj-protein:AAA32209.1",
"@type" : "so:0000316",
"label" : "Protein II",
"faldo:location" : {
"@type" : "faldo:Region",
"faldo:begin" : {
"@type" : [ "faldo:Position", "faldo:ExactlyKnownPosition", "faldo:ForwardStrandPosition" ],
"faldo:position" : "6006",
"faldo:reference" : "ddbj-seq:J02448"
},
"faldo:end" : {
"@type" : [ "faldo:Position", "faldo:ExactlyKnownPosition", "faldo:ForwardStrandPosition" ],
"faldo:position" : "831" ,
"faldo:reference" : "ddbj-seq:J02448"
}
}
}
}
p.10 Figure 6
Probably "isoform:P05064-1" should be replaced with "isoform:Q6Q250-1" as the subject ID is "uniprot:Q6Q250".
p.12 Figure 8
In the figure, "refseq:NC_000913.2" must be replaced with "ddbj:U00096.3" as RefSeq is not an INSDC database and please use the following prefix.
@prefix ddbj: <
http://ddbj.nig.ac.jp/entry/nucleotide/> .
Also, the figure legend must be changed to
"Figure 8: Using FALDO in Turtle [27] syntax to describe the location of a gene feature cheY at complement(1965072..1965461) in a INSDC record U00096.3."
[TODO]
Do you have any specific reason to use "cheY" gene here? (or can I use other gene instead?)
p.15 Figure 10
How about to replace following URIs
":EnzymeRestrictionSite" -> ":enzymeRestrictionSite"
":EnzymeRestrictionRegion" -> ":enzymeRestrictionRegion"
as they are instances (not Classes)?
Also, I suspect assignment of locations of :53cleavageSite and :35cleaveageSite are wrongly inverted to each other maybe?
:enzymeRestrictionSite a example:EnzymeRestrictionSite ;
faldo:location :EnzymeRestrictionRegion .
:enzymeRestrictionRegion a faldo:Region ;
faldo:begin :53_1 ;
faldo:end :35_6 .
:53cleaveageSite a example:CleavageSite ;
faldo:location [a faldo:InBetweenPosition,
SO:0001689 ; # 5’ restriction enzyme junction
faldo:after :53_1 ;
faldo:before :53_2 ] .
:35cleaveageSite a example:CleavageSite ;
faldo:location [a faldo:InBetweenPosition,
SO:0001690 ; # 3’ restriction enzyme junction
#Note the before after reversion due to biological start
faldo:before :35_5 ;
faldo:after :35_6 ] .
p.14 Discussion
"This encoding means there is no need to know the length of the sequence to compare positions on the different strands."
This statement is not true for the case that "features overlap the chromosome's origin of replication" described in p.12 and Figure 9.
THAT'S IT!!
Regards,
Toshiaki
2014/01/30 7:03、Jerven Bolleman <
m...@jerven.eu> のメール: