Introducing an ETL tool (early stages)

Skip to first unread message

José Juan Montes

Dec 31, 2018, 2:06:13 PM12/31/18

Hi all,

I have been improving and polishing the tool I use for ETL processes. While it's still alpha I wanted to share it.

CubETL is a Python-based ETL framework: .

The main focus was to be able to model OLAP schemas. It is able to inspect SQL databases and generate a default OLAP schema for them, and then to generate a Cubes model and config for it. CubETL supports arbitrarily nested dimensions, which it then flattens when exporting to Cubes.

Also you can define your OLAP schema manually and your own SQL mappings, or let CubETL generate SQL mappings automatically for you.

It also has a variety of other faciliities typical in ETL. Reads and writes files, CSV, JSON, XML, handles in-memory tables, SQL tables, query lookups, table lookups, caching, and reads PcAxis and SDMX multidimensional data formats.

I have done a major refactor and some parts of the code have not yet been migrated. It is still in early stages. But it does work, I'm using it for a few ETL processes, and I wanted to share and see it someone is interested. 

Please let me know what you think! Have a look at the example processes: .

Note: when generating an schema from a database and serving it with Cubes, you may need to use the branch "alias-issue" of my cubes repository (, as the current Cubes pip version has a bug related to table aliases. Clone my cubes repo, checkout the alias-issue branch and run 'python develop' from there.

Best regards!

Jose Juan Montes

Warwick Prince

Dec 31, 2018, 5:08:50 PM12/31/18
Hi Jose

Very interesting! I’ll take a look if I get a free moment. 


Warwick A. Prince
Mushroom Systems International Pty. Ltd.
You received this message because you are subscribed to the Google Groups "Cubes" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
For more options, visit
Reply all
Reply to author
0 new messages