I'm working experimentally for some time in importing a data warehouse metadata into Neo4j, and getting information from it.
If you want to share some thoughts on it, please net me know.
Péterson
ps: for more information, here is the paragraph I sent to the contest:
There are many ways to tackle the complexity of a system. For a data warehouse (DW) system in particular, one way to achieve this goal is by taking a closer look on it's metadata. Typically, a DW metadata describe the different system's objects, their properties and the relationships between them. Objects, in this context, are DW concepts, such as, for example, cube, operational data store and , source system.
In this visualization, we focus on the "data flows" relationship between DW objects. We show a real DW data flow network. The network is a directed graph, in which nodes represent DW objects, and edges represent the "data flows to" relationship between objects, indicating that data is transferred from one object to another. The nodes are colored according to the objects they represent. For example, source systems are colored in pink, operational data stores are colored in light blue, cubes are colored in dark blue and queries are colored in red.
Through this visualization we are able to view the full path that data takes in the data warehouse, from it's origin in the source systems until it is finally presented to the end user. We are also able to view the complexity of the various existing ETL processes, and how the existing data marts are related to each other.
To construct this visualization we collected the metadata of a real data warehouse system that runs under the SAP Netweaver Business Warehouse Platform (SAP BW). We then stored this information in the Neo4j graph database, which allowed us great flexibility to work with and query the data. For the visualization, we used Gephi.