Hello!
The following unfiltered search query takes ~2 seconds:
{
"queryType": "search",
"dataSource": "DATASOURCE",
"searchDimensions": [
"DIMENSION"
],
"query": {
"type": "insensitive_contains",
"value": "foo"
},
"granularity": "all",
"intervals": ["2016-11-12T05:00:00+00:00/2016-12-12T13:00:00+00:00"]
}
Now I add this filter:
"filter": {
"type": "selector",
"dimension": "DIMENSION2",
"value": "BAR"
}
Running the same query with this filter takes ~22 seconds.
Some info:
The cardinality of "DIMENSION" is somewhere between 100.000 and 1.000.000, and "DIMENSION2" is 15-20.
We are running Druid 0.9.1. DATASOURCE contains around 41GB of data over one month, with rollup hour and Concise bitmaps. Our timeseries and topN queries with filters are not experiencing the same drastic difference in speed.
Is this behavior expected? Is there anything we can do to speed up the filtered query? Let me know if there is any other information I can provide to help narrow down the issue.
Best,
John