Druid Query is very slow

K.Y.Xing

unread,

Jan 19, 2015, 5:42:50 AM1/19/15

to druid-de...@googlegroups.com

Hi Druid team,

There's some problems with my druid cluster.

we use druid version is 0.6.169

1. how to enable historical write to memcached and broker read from memcached?

2. when we do queries, some queries are very slow, attachments is the metrics of each kind node.

could you give me some advice to tune the druid cluster?

Thanks..

broker.txt

historical.txt

real-time-phx.txt

real-time-slc.txt

Nishant Bangarwa

unread,

Jan 19, 2015, 7:32:37 AM1/19/15

to druid-de...@googlegroups.com

Hi,

see inline

On Mon, Jan 19, 2015 at 4:12 PM, K.Y.Xing <xingq...@gmail.com> wrote:

Hi Druid team,

There's some problems with my druid cluster.
we use druid version is 0.6.169

1. how to enable historical write to memcached and broker read from memcached?

to do so you need to set these in runtime.properties -

on broker -

druid.broker.cache.useCache=true

druid.broker.cache.populateCache=false

on historical -

druid.historical.cache.useCache=true

druid.historical.cache.populateCache=true

2. when we do queries, some queries are very slow, attachments is the metrics of each kind node.
could you give me some advice to tune the druid cluster?

the query seems to be scanning ~700 segments on the historical nodes, request time on the historicals seems normal but the brokers taking large amount of time in merging the results.

Do you see any GC issues on the broker when you run the query ?

Can you also give more details on the hardware configs, broker runtime.properties and jvm configs ?

Thanks..

--
You received this message because you are subscribed to the Google Groups "Druid Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-developm...@googlegroups.com.
To post to this group, send email to druid-de...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-development/31fb1d08-9220-4895-bfdb-a269212c2340%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--

Nishant

Software Engineer

|

METAMARKETS

m	+91-9729200044

nishant....@metamarkets.com

Message has been deleted

Fangjin Yang

unread,

Jan 20, 2015, 2:14:38 AM1/20/15

to druid-de...@googlegroups.com

Xing, I see some major problems with the setup:

Your historical nodes have 128G of RAM total but you are setting a 200G heap. Can you try a 8G heap instead? I think you are guys misunderstanding how Druid uses memory. Memory for segments = total memory - (heap size + direct memory ＋ JVM overhead)

Your server.maxSize is 10TB and you have very little memory. Your queries will be paging a TREMENDOUS amount. This will be EXTREMELY slow.

You have a 100MB set for druid.processing.buffer.sizeBytes. This is too small. Please set this to 512MB.

druid.processing.numThreads=30 --> change this to 39.

With this setup, you have about ~100GB for segments in memory at a given time. I think you should choose a memory:disk ratio that is greater than your current value.

Fangjin Yang

unread,

Jan 20, 2015, 2:18:38 AM1/20/15

to druid-de...@googlegroups.com

On the broker, do the following:

remove these config:

druid.broker.cache.unCacheable=["select"]

druid.paths.indexCache=/data01/druid/indexCache

druid.paths.segmentInfoCache=/data01/druid/segmentInfoCache

druid.segmentCache.dropSegmentDelayMillis=300000

You have set a 100GB heap. You will probably get murdered by full GC. Reduce this to 30GB. Also, try to turn on historical caching and do not do caching on the broker.

Fangjin Yang

unread,

Jan 20, 2015, 2:22:55 AM1/20/15

to druid-de...@googlegroups.com

Realtime nodes, the configs also need to change.

Heap: 15G is probably enough, 50G seems excessive.

druid.processing.buffer.sizeBytes=100000000

druid.processing.numThreads=1

Try 512MB buffer.sizeBytes and more processing threads. You may also need to change your direct memory settings.

Fangjin Yang

unread,

Jan 20, 2015, 2:23:54 AM1/20/15

to druid-de...@googlegroups.com

I should mention with changes to the historical configuration, you may need to adjust direct memory sizes as well.

K.Y.Xing

unread,

Jan 20, 2015, 2:58:48 AM1/20/15

to druid-de...@googlegroups.com

[Delay attachments]]Thanks very much for your quick reply.

Yes, I have check the gc metrics on broker and found that caused heavy gc.

We should tune the jvm start parameters.

Attachments are the runtime.properties and start scripts

Our cluster:

40 historical machines, each with 40CPUs and 128G mem

18 realtime machines, each with 40CPUs and 128G mem, but 2 realtime nodes are started on each machine.

5 broker machines, each with 40CPUsand 128G mem.

Any suggestions for the configurations?

Thanks

在 2015年1月19日星期一 UTC+8下午8:32:37，Nishant Bangarwa写道：

Fangjin Yang

unread,

Jan 20, 2015, 3:29:06 PM1/20/15

to druid-de...@googlegroups.com

Hi Xing, in the previous posts I gave you the numbers already. I'll repeat these inline.

On Monday, January 19, 2015 at 11:58:48 PM UTC-8, K.Y.Xing wrote:

[Delay attachments]]Thanks very much for your quick reply.

Yes, I have check the gc metrics on broker and found that caused heavy gc.
We should tune the jvm start parameters.

Attachments are the runtime.properties and start scripts
Our cluster:
40 historical machines, each with 40CPUs and 128G mem

Try 10GB heap, 39 processing threads, 512 MB intermediate computation buffer, 20GB direct memory, 100GB max server size

18 realtime machines, each with 40CPUs and 128G mem, but 2 realtime nodes are started on each machine.

Try a 15G or 20G heap with more processing threads (30 for example), 512MB intermediate buffer size, and enough direct memory.

5 broker machines, each with 40CPUsand 128G mem.

Try a 20G or 30G heap.

Message has been deleted

bimo

unread,

Sep 15, 2015, 10:14:38 AM9/15/15

to Druid Development

why the broker doesn't disptach search requests to historical node in parallel when I submit a request to a broker? How to configure to make it in parallel?

2015-09-10T19:03:22,880 INFO [qtp1781399452-77] com.metamx.http.client.pool.ChannelResourceFactory - Generating: http://broker185.kafka.game.dev.com:8083
2015-09-10T19:03:24,126 INFO [qtp1781399452-77] com.metamx.http.client.pool.ChannelResourceFactory - Generating: http://broker36.kafka.game.dev.com:8083
2015-09-10T19:03:24,714 INFO [qtp1781399452-77] com.metamx.http.client.pool.ChannelResourceFactory - Generating: http://broker20.kafka.game.dev.com:8083
2015-09-10T19:03:26,420 INFO [qtp1781399452-77] com.metamx.http.client.pool.ChannelResourceFactory - Generating: http://broker239.kafka.game.dev.com:8083

2015-09-10T19:03:26,952 INFO [qtp1781399452-77] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"metrics","timestamp":"2015-09-10T19:03:26.952+08:00","service":"broker","host":"broker36.kafka.game.dev.com:8082","metric":"query/node/time","value":2828,"dataSource":"user_game_daily","duration":"PT1296000S","hasFilters":"true","id":"d8145fcd-83dd-45be-bde5-8a19811af6a3","interval":["2015-05-03T00:00:00.000+08:00/2015-05-04T00:00:00.000+08:00","2015-05-05T00:00:00.000+08:00/2015-05-06T00:00:00.000+08:00","2015-05-20T00:00:00.000+08:00/2015-05-21T00:00:00.000+08:00","2015-05-23T00:00:00.000+08:00/2015-05-28T00:00:00.000+08:00","2015-05-29T00:00:00.000+08:00/2015-06-01T00:00:00.000+08:00","2015-06-02T00:00:00.000+08:00/2015-06-06T00:00:00.000+08:00"],"numComplexMetrics":"0","numDimensions":"3","numMetrics":"6","server":"broker36.kafka.game.dev.com:8083","type":"groupBy"}]
2015-09-10T19:03:26,955 INFO [qtp1781399452-77] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"metrics","timestamp":"2015-09-10T19:03:26.955+08:00","service":"broker","host":"broker36.kafka.game.dev.com:8082","metric":"query/node/time","value":2243,"dataSource":"user_game_daily","duration":"PT777600S","hasFilters":"true","id":"d8145fcd-83dd-45be-bde5-8a19811af6a3","interval":["2015-05-08T00:00:00.000+08:00/2015-05-10T00:00:00.000+08:00","2015-05-13T00:00:00.000+08:00/2015-05-18T00:00:00.000+08:00","2015-05-28T00:00:00.000+08:00/2015-05-29T00:00:00.000+08:00","2015-06-06T00:00:00.000+08:00/2015-06-07T00:00:00.000+08:00"],"numComplexMetrics":"0","numDimensions":"3","numMetrics":"6","server":"broker20.kafka.game.dev.com:8083","type":"groupBy"}]
2015-09-10T19:03:26,958 INFO [qtp1781399452-77] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"metrics","timestamp":"2015-09-10T19:03:26.958+08:00","service":"broker","host":"broker36.kafka.game.dev.com:8082","metric":"query/node/time","value":540,"dataSource":"user_game_daily","duration":"PT518400S","hasFilters":"true","id":"d8145fcd-83dd-45be-bde5-8a19811af6a3","interval":["2015-05-10T00:00:00.000+08:00/2015-05-11T00:00:00.000+08:00","2015-05-12T00:00:00.000+08:00/2015-05-13T00:00:00.000+08:00","2015-05-18T00:00:00.000+08:00/2015-05-19T00:00:00.000+08:00","2015-05-21T00:00:00.000+08:00/2015-05-23T00:00:00.000+08:00","2015-06-07T00:00:00.000+08:00/2015-06-08T00:00:00.000+08:00"],"numComplexMetrics":"0","numDimensions":"3","numMetrics":"6","server":"broker239.kafka.game.dev.com:8083","type":"groupBy"}]
2015-09-10T19:03:26,960 INFO [qtp1781399452-77] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"metrics","timestamp":"2015-09-10T19:03:26.960+08:00","service":"broker","host":"broker36.kafka.game.dev.com:8082","metric":"query/node/time","value":4082,"dataSource":"user_game_daily","duration":"PT777600S","hasFilters":"true","id":"d8145fcd-83dd-45be-bde5-8a19811af6a3","interval":["2015-05-01T00:00:00.000+08:00/2015-05-03T00:00:00.000+08:00","2015-05-04T00:00:00.000+08:00/2015-05-05T00:00:00.000+08:00","2015-05-06T00:00:00.000+08:00/2015-05-08T00:00:00.000+08:00","2015-05-11T00:00:00.000+08:00/2015-05-12T00:00:00.000+08:00","2015-05-19T00:00:00.000+08:00/2015-05-20T00:00:00.000+08:00","2015-06-01T00:00:00.000+08:00/2015-06-02T00:00:00.000+08:00","2015-06-08T00:00:00.000+08:00/2015-06-09T00:00:00.000+08:00"],"numComplexMetrics":"0","numDimensions":"3","numMetrics":"6","server":"broker185.kafka.game.dev.com:8083","type":"groupBy"}]
2015-09-10T19:03:27,137 INFO [qtp1781399452-77] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"metrics","timestamp":"2015-09-10T19:03:27.137+08:00","service":"broker","host":"broker36.kafka.game.dev.com:8082","metric":"query/time","value":4265,"context":"{\"queryId\":\"d8145fcd-83dd-45be-bde5-8a19811af6a3\",\"timeout\":300000}","dataSource":"user_game_daily","duration":"PT3369600S","hasFilters":"true","id":"d8145fcd-83dd-45be-bde5-8a19811af6a3","interval":["2015-05-01T00:00:00.000+08:00/2015-06-09T00:00:00.000+08:00"],"remoteAddress":"10.21.42.62","type":"groupBy"}]

my broker config

#druid.host=localhost
druid.port=8082
druid.service=broker

# We enable using the local query cache here
druid.broker.cache.useCache=true
druid.broker.cache.populateCache=true

#default 5
druid.broker.http.numConnections=20
druid.broker.http.readTimeout=PT5M

# For prod: set numThreads = # cores - 1, and sizeBytes to 512mb
druid.processing.buffer.sizeBytes=512000000
druid.processing.numThreads=20

#Number of threads for HTTP requests. default 10
druid.server.http.numThreads=30

the historical config

druid.port=8083
druid.service=historical
# Our intermediate buffer is also very small so longer topNs will be slow.
# In prod: set sizeBytes = 512mb
druid.processing.buffer.sizeBytes=512000000
# We can only 1 scan segment in parallel with these configs.
# In prod: set numThreads = # cores - 1
druid.processing.numThreads=20

druid.server.http.numThreads=30

# maxSize should reflect the performance you want.
# Druid memory maps segments.
# memory_for_segments = total_memory - heap_size - (processing.buffer.sizeBytes * (processing.numThreads+1)) - JVM overhead (~1G)
# The greater the memory/disk ratio, the better performance you should see
druid.segmentCache.locations=[{"path": "/data/druid/data/indexCache", "maxSize"\: 500000000000}]

druid.monitoring.monitors=["io.druid.server.metrics.HistoricalMetricsMonitor", "com.metamx.metrics.JvmMonitor"]
# 700G
druid.server.maxSize=500000000000

thanks

Fangjin Yang

unread,

Sep 16, 2015, 7:52:47 PM9/16/15

to Druid Development

Hi Bimo,

Not entirely sure what is happening there. I can ask around for people that run large groupBys on historicals to see if they experience the same issue. One issue that may be happening is some blocking is occurring while looking up the name of the broker.

Also as an aside, you may be interested in using multiple topNs versus a single multi-dimensional groupBy.

For example, an open source UI we recently created can support pivot table like functionality with multiple topNs instead of a single groupBy:

https://github.com/implydata/pivot

Reply all

Reply to author

Forward