Hi folks
I'm revisiting Gremlin, with the idea of trying it out on some SPARQL
stores like OWLIM.
I figured I'd first get back to where I was last - i.e.
http://danbri.org/words/2011/05/10/675 - so tried to re-run a script
that was fine a few months ago, using a freshly-built-today new build:
g = new LinkedDataSailGraph(new MemoryStoreSailGraph())
fry = g.v('
http://dbpedia.org/resource/Stephen_Fry')
g.addNamespace('wp', '
http://dbpedia.org/ontology/')
m = [:]
fry.in('wp:starring').out('wp:starring').groupCount(m).loop(3){it.loops <2}
m2 = m.sort{ a,b -> b.value <=> a.value }
It fails to find anything. In my ripple.log, I have the following 3
entries, which suggest that The WebClosure page fetch is failing hard
due to a suspect triple. Since the Web is always going to have poor
data in it, perhaps Gremlin/Ripple could be more tolerant of errors in
this mode, and maybe just issue a warning and try to continue? I don't
believe dbpedia have changed their design/architecture so I don't
think it so likely the mimetype is the issue, though it is also a
possibility. W3C's RDF validator seemed happy with the URL.
2012-11-05 19:56:41,865 [main] INFO WebClosure - Dereferencing URI
<
http://dbpedia.org/resource/Stephen_Fry>
2012-11-05 19:56:50,797 [main] ERROR RippleException -
'1981-01-01T00:00:00+02:00' is not a valid value for datatype
http://www.w3.org/2001/XMLSchema#gYear [line 251, column 153]
2012-11-05 19:56:50,798 [main] INFO WebClosure - Failed to dereference
URI <
http://dbpedia.org/resource/Stephen_Fry>: ParseError (perhaps
application/rdf+xml is not the correct media type for this data)
Thanks for any suggestions,
Dan