Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Message from discussion DataCube Consuming Tools/Services
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Keith Alexander  
View profile  
 More options Oct 8 2012, 1:07 pm
From: Keith Alexander <k.j.w.alexan...@gmail.com>
Date: Mon, 8 Oct 2012 18:07:58 +0100
Local: Mon, Oct 8 2012 1:07 pm
Subject: Re: [publishing-statistical-data] DataCube Consuming Tools/Services
Hi Leigh,

I have created some linked data statistical datasets though, and tried
to think of useful approaches, but  I  don't think I've ever written
an app that consumed Data Cube data. So these are just my opinions
from that perspective:

> * If the data is available as Linked Data, then is there an expectation that
> a SPARQL endpoint will be provided too?

The verbosity of RDF for representing multi-dimensional data makes me
think that RDF stores might not be the most efficient way of storing
and querying large volumes of stats, so I think expectation of a
SPARQL endpoint might be lower (though it's possible something like
D2RQ might work better?).

When (at Talis) we had to produce a 'triplification' of the European
Central Bank stats for the LATC project for instance, we decided that
doing a complete conversion and putting it into a triple store would
be completely impractical, even with the resources we had available to
us. We hit upon a hybrid approach of making the metadata about the
datasets (and perhaps some geo data) available through SPARQL + Linked
Data API, and making  URIs of the datasets themselves deref to a
script that fetched the original data and converted it to RDF on the
fly.

> * Is a SPARQL endpoint more important than Linked Data (e.g. because tools
> need to perform arbitrary queries, rather than follow-your-nose traversal)
> * Might the Linked Data API offer a means for exposing richer access to
> data, without (direct) recourse to a SPARQL endpoint

basic Linked Data  - dereffing CBDs of all the observations for
instance, could be pretty tiresome.
SPARQL is more efficient in that regard, but it would be nice to
combine the fyn discovery of linked data, and SPARQL's ability to
slice in different ways and calculate aggregates.
Linked Data API can let you provide these different views, but, for
statistical data, it would be nice to be able to also have calculated
properties - the average, max, min values for a slice (for instance) -
the areas dimension with the highest/lowest value for a
measureproperty and time dimension - that kind of thing.

At root, statistics are interesting because they let you compare
things, so I think there's a lot of value in trying to surface and
facilitate those comparisons in their publication.

Best

Keith


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.