Basically:
This scenario appears to be common in the ad-tech world, and druid originated out of the ad-tech world, so it's easy to understand why druid makes this distinction. In this setting, a table that contained only dimension columns or only metric columns would be useless.
However, druid is not actually this limiting. With a topN or groupBy query, we can calculate the distribution of our result set on our dimensions. So a table that contains only dimension columns could be useful --- consider this dashboard, which druid could power. Except for time, all of its columns are categorical dimensions. So at present, here's how you should actually think about druid's column types:
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/6a8347a0-71c6-41ae-94da-153e9b052973%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
It may be worth having min/max statistics on the columns or even on blocks of the columns to make range filtering faster. That might work out if the column is near the left hand side of the sort order. Otherwise range filters would be served by scanning the column, I guess? Is there anything better we could do?
Fwiw, there may still be some times where a discretized dimension is better than a full resolution numeric dimension. There are lots of data sets out there where discretizing an interesting metric will give you a good rollup ratio but storing the full resolution number will not. So the discretized version could be a huge space savings, and can be worth the loss of resolution.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/dd97e118-5660-46cf-b79f-f9a86b92aaaa%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAPP9sfZMrMeTzLNTcp2bq5szN0pz5c3T-Y%3DuJ2kjkGt8e74%3DQg%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/dd97e118-5660-46cf-b79f-f9a86b92aaaa%40googlegroups.com.
--
You received this message because you are subscribed to a topic in the Google Groups "Druid User" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/druid-user/Mk6omlC6Vbk/unsubscribe.
To unsubscribe from this group and all its topics, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAPOm%3DTOJR5U-WppDM2WzpTo-9z-%3DRc4aV84ncO-4nyYLgUdACg%40mail.gmail.com.
If configured to do so Plywood can treat Druid dimensions as bucketed strings or as raw numbers it then uses javascript tricks to make it work as expected.
Where I am going with this is that ideally you would be able to define a virtual attribute in Plywood where you say: "hey I have this thing called revenue, and it is composed of these metrics: min_revenue, max_revenue, sum_revenue and this dimension: revenue_buckets (which is bucketed to 0.05)". Then you should be able to filter / aggregate revenue as your heart desires while still preserving a reasonable level of roll up.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAOzVU0XjbB%3D17W6kzF4NxZPpf%3DWC9q7yLPySBjLXvUdP-7hfdQ%40mail.gmail.com.
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+...@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAOzVU0XjbB%3D17W6kzF4NxZPpf%3DWC9q7yLPySBjLXvUdP-7hfdQ%40mail.gmail.com.
Hi
--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/772436a2-1bbd-48e5-8fdc-7ee0096fcef3%40googlegroups.com.
Hi,