May I know what is currently the best way to interact with Virtuoso open source edition using RDFLib, for querying and/or writing ?
Is the generic solution of accessing the SPARQL end-point the current way to do it, or are there some optimized libraries which are tool specific (Virtuoso, OWLIM, etc.) ?
The information here seems outdated: http://pythonhosted.org/virtuoso/
We did some tests for adding triples one by one on different back-end, and the Virtuoso/SPARQL gave bad performances compared to Berkeley DB, MySQL, PostgreSQL.
We would think that for large uploads, using the Virtuoso bulk loader features would be the way to go.
Then, for querying performances, we would expect that SPARQL queries on a SPARQL end-point (with a native triple store) should give better performances than queries on Berkeley DB, MySQL or PostgreSQL. Is there any information available about that ?
Thank you for any help or pointers
Fabian
--
http://github.com/RDFLib
---
You received this message because you are subscribed to the Google Groups "rdflib-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rdflib-dev+...@googlegroups.com.
To post to this group, send email to rdfli...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/rdflib-dev/c14b8271-0935-4467-8ec3-ba2cdcdb484c%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
To view this discussion on the web visit https://groups.google.com/d/msgid/rdflib-dev/c42154ea-5127-4971-a832-01c4e1f4de72%40googlegroups.com.
What you are telling me is that a native triple store is faster than storing RDF in relational tables, but that doing an RDF view on real relational data is still faster for querying ?
The recent Berlin SPARQL Benchmark shows some figures comparing Virtuoso SQL and SPARQL and SPARQL in front of relational representation. However, the test workload is heavily biased in favor of relational. See also BSBM: MySQL vs Virtuoso.
of course then we loose the flexibility of a triple store,
but that is interesting as I am currently having a look at D2R (SPARQL to SQL rewriting) which don't compete so far with native SQL performances from what I can see.
If you do your own performances tests, would you agree to notify me once they are done ?
Thanks, merci
Fabian