Example Data Sets?

367 views
Skip to first unread message

Curran Kelleher

unread,
Dec 14, 2011, 12:51:51 PM12/14/11
to publishing-st...@googlegroups.com
Hello,

I'm interested in developing tools for visualizing data cubes using the Data Cube Vocabulary. Can anyone point me to some example public data sets published using this vocabulary? Thank you very much.

Best regards,
Curran

BillRoberts

unread,
Dec 14, 2011, 2:41:25 PM12/14/11
to Publishing Statistical Data
Hi Curran

There are several data cube datasets at http://opendatacommunities.org/datasets
representing the English 'Indices of Multiple Deprivation'. They are
fairly simple in structure, with just two dimensions - refPeriod and
refArea, ie time and place.

They are available for download in Turtle format, or via a SPARQL
endpoint.

Hope that helps

Bill

On Dec 14, 5:51 pm, Curran Kelleher <curran.kelle...@gmail.com> wrote:
> Hello,
>
> I'm interested in developing tools for visualizing data cubes using the Data

> Cube Vocabulary<http://publishing-statistical-data.googlecode.com/svn/trunk/specs/src...>.

Richard Cyganiak

unread,
Dec 14, 2011, 3:24:52 PM12/14/11
to publishing-st...@googlegroups.com
Hi Curran, there's a pretty good list here:

http://wiki.planet-data.eu/web/Datasets

Best,
Richard

Dave Reynolds

unread,
Dec 14, 2011, 5:55:40 PM12/14/11
to publishing-st...@googlegroups.com
Hi,

To add to the other replies, then the Environment Agency Bathing Water
Quality data uses the cube vocabulary. For a starting set of links see:
http://www.epimorphics.com/web/projects/bathing-water-quality

Dave

Keith Alexander

unread,
Dec 15, 2011, 3:11:01 AM12/15/11
to publishing-st...@googlegroups.com

Rob Dymond-Green

unread,
Dec 15, 2011, 6:03:03 AM12/15/11
to publishing-st...@googlegroups.com

Curran

 

ESDS International has recently completed a project where they looked at using the Data Cube Vocabulary to describe world back data, here’s the link to that data.

 

http://www.esds.ac.uk/international/access/LDaccess.asp

 

 

I hope this is what you are looking for.

 

 

Thanks

 

Rob

Curran

unread,
Dec 15, 2011, 1:06:35 PM12/15/11
to Publishing Statistical Data
Hi all,

Thank you so much for your prompt replies! Clearly there are
tremendous resources available.

I'd like to figure out how to query these and somehow support
interactive OLAP operations (slice, drill down, etc.) for driving
visualizations. Does anyone know of existing tools that support
interactive OLAP operations for navigating through data published
using the Data Cube Vocabulary? Thanks again!

Best regards,
Curran

Our visualization tool: oicweave.org

Benedikt Kämpgen

unread,
Dec 15, 2011, 2:43:53 PM12/15/11
to publishing-st...@googlegroups.com
Hello Curran,

In [1], we have used QB with Mondrian and XML/A clients.

Momentarily, we are working on an OLAP4J implementation [2] to allow common
OLAP clients to access QB data. So far, for visualization, we have been
thinking of using Saiku [3], which connects to the OLAP4J API. As soon as I
have a demo, I can post it here.

Regards,

Benedikt

[1] <http://www.aifb.kit.edu/web/Inproceedings3211>
[2] <http://www.olap4j.org/>
[3] <http://analytical-labs.com/>

--
AIFB, Karlsruhe Institute of Technology (KIT)
Phone: +49 721 608-47946
Email: benedikt...@kit.edu
Web: http://www.aifb.kit.edu/web/Hauptseite/en

Tim rdf

unread,
Dec 16, 2011, 10:41:19 AM12/16/11
to publishing-st...@googlegroups.com
Hey, Curran!

A few months ago, we spent two days [1] to strap a widget onto VIVO
[2] to expose QB data of their publication social network analyses. A
zip file of the data cube is at [3] (Warning: modeling has not been
vetted).

Happy to find an excuse to take this further.

Regards,
Tim Lebo

[1] https://github.com/timrdf/csv2rdf4lod-automation/wiki/Example:-vivohack1
[2] http://vivoweb.org/
[3] http://bit.ly/vnewFI

Curran

unread,
Dec 19, 2011, 11:40:40 AM12/19/11
to Publishing Statistical Data
Thank you both for the pointers! I'll be looking into these. I tried
Saiku, what a beautiful tool!

Best,
Curran

> > Email: benedikt.kaemp...@kit.edu

Sarven Capadisli

unread,
May 17, 2012, 7:36:21 AM5/17/12
to publishing-st...@googlegroups.com
On 11-12-14 05:51 PM, Curran Kelleher wrote:
> Hello,
>
> I'm interested in developing tools for visualizing data cubes using the
> Data Cube Vocabulary
> <http://publishing-statistical-data.googlecode.com/svn/trunk/specs/src/main/html/cube.html>.
> Can anyone point me to some example public data sets published using
> this vocabulary? Thank you very much.
>
> Best regards,
> Curran

Hi Curran,

I've clearly overlooked this thread. If you or others are still interested:

I've worked on World Bank Linked Data [1] which uses the Data Cube
vocabulary. The dataset is composed of World Development Indicators,
World Bank Finances, World Bank Climate Change, World Bank Projects and
Operations.

Also started on some preliminary visualizations at [2].

Data dumps are at [3]. See VoID file [4] if you need to automate or get
more information about the datasets.

If you can share what you are up to publicly or off the list, I'd love
to take a look.

[1] http://worldbank.270a.info/
[2] http://worldbank.270a.info/view
[3] http://worldbank.270a.info/data/
[4] http://worldbank.270a.info/.well-known/void

-Sarven

Tim rdf

unread,
May 18, 2012, 9:37:34 AM5/18/12
to publishing-st...@googlegroups.com
Sarven,

Thanks for the pointers!

Your site looks very nice.

As a side interest, do you have any provenance available? I'm assuming
that you transformed original WB data; where did you get the original
data?

Regards,
Tim

Tim rdf

unread,
May 18, 2012, 9:48:18 AM5/18/12
to publishing-st...@googlegroups.com
Oh, and is your dataset on CKAN :-)

-Tim


On Thu, May 17, 2012 at 7:36 AM, Sarven Capadisli <in...@csarven.ca> wrote:

Sarven Capadisli

unread,
May 18, 2012, 10:58:21 AM5/18/12
to publishing-st...@googlegroups.com
Hi Tim,

Thanks for the compliments. I'm always fixing up my boo boos as I go.
Hence, I'd appreciate all feedback.

For provenance, where appropriate, I've added triples in this nature:

Defining source
Data license
Data collection date
Data update date
Location of the source data
Creator of the data

You might want to give the About page [1] a look. It probably contains
some other information you'd be interested.

I got the data in XML from WB's APIs. Links to the source datasets all
the way down to the individual observations (where possible) are
described part of each resource.

I'm in the process of putting together a more extensive document about
the whole effort; from scratch all the way up to the application. Stay
tuned =)

[1] http://worldbank.270a.info/about

-Sarven

Sarven Capadisli

unread,
May 18, 2012, 10:59:29 AM5/18/12
to publishing-st...@googlegroups.com

Tim rdf

unread,
May 23, 2012, 12:58:04 PM5/23/12
to publishing-st...@googlegroups.com
Sarven,

I see you have dcterms:source pointing to the XML files from WB. I
think that's what I was looking for.

What URIs are you using for:

> Defining source
> Data license
> Data collection date
> Data update date
> Location of the source data
> Creator of the data

Thanks!

Tim

Sarven Capadisli

unread,
Oct 9, 2012, 4:46:06 AM10/9/12
to publishing-st...@googlegroups.com
Hi Tim,

It appears to be that I did not send a reply out as it was sitting in my
Drafts. My apologies for the late response!

In case this information is still of some use to you, it is now
available on the site:

http://worldbank.270a.info/about#data-provenance_table

-Sarven
Reply all
Reply to author
Forward
0 new messages