Hi,
How can I index different pre-aggregations of the same data? In the same data source or another?
Let's say I import data into datasource "hourly_dimensions_metrics", which is a primary aggregation of the actual events from which all the others are done.
(My understanding is that I can do a hadoop index task to scan directly events pre-joined logs and output "hourly_dimensions_metrics", or have spark/hadoop map-reduce pre-compute it.)
Now I want to precompute a few aggregations on top of "hourly_dimensions_metrics", like daily, or with fewer dimensions.
How do I do this? With more index tasks, having "hourly_dimensions_metrics" as both source and destination?
If I need to use different sources (if they have a fixed schema which I expect intuitively), than the query will no longer be agnostic of the indexing, meaning the query will contain the datasource (the index name). Is this correct?
Thanks,
Nicu