Urgently need help getting my historical nodes up

Ben Vogan

unread,

Sep 9, 2016, 1:49:04 PM9/9/16

to druid...@googlegroups.com

Hi all,

I have had another Event (capital E) with my Druid cluster. I am not entirely sure what happened, but there is a problem with the communication between nodes. It appears that the coordinator is having trouble with the historical nodes.

The cluster is small, I have a single master node running the coordinator and overlord, 2 data nodes running middle managers and historical nodes, and 2 query nodes running brokers. The middle manager is spawning indexing tasks and the overlord can see them fine. However there are many errors in the coordinator log: first about not having any servers at all, then about not having enough for the given replication factor (2).

I do have 2 historical nodes, but tailing the logs one appears to do nothing (and this one is currently seen by the coordinator), while the other is in an endless loop. It loads all of segment cache files, then it announces all of the segments, then it starts over:

So a bunch of these:

2016-09-09T17:39:02,221 INFO [main] io.druid.server.coordination.ZkCoordinator - Loading segment cache file [7552/64589][/services/druid/data/segment-cache/info_dir/sor_business_event_success_v1_2016-08-14T23:00:00.000Z_2016-08-15T00:00:00.000Z_2016-08-14T

Followed by a bunch of these:

2016-09-09T17:41:10,427 INFO [main] io.druid.server.coordination.BatchDataSegmentAnnouncer - Announcing segment[business_events_test_2016-09-04T01:00:00.000Z_2016-09-04T02:00:00.000Z_2016-09-03T01:54:31.884Z_11] at path[/druid/segments/druid-data001.internal.shopkick.com:8083/druid-data001.internal.shopkick.com:8083_historical__default_tier_2016-09-09T17:41:10.381Z_e7b20ac9d0e544c187cb1187da8ac68b0]

Then:

2016-09-09T17:45:08,939 INFO [main] io.druid.server.coordination.ZkCoordinator$BackgroundSegmentAnnouncer - Completed background segment announcing

2016-09-09T17:45:08,940 INFO [main] io.druid.server.coordination.ZkCoordinator - Cache load took 63,227 ms

2016-09-09T17:45:08,946 INFO [main] com.metamx.common.lifecycle.Lifecycle$AnnotationBasedHandler - Invoking start method[public void io.druid.query.lookup.LookupReferencesManager.start()] on object[io.druid.query.lookup.LookupReferencesManager@7135ce0a].

2016-09-09T17:45:08,946 INFO [main] io.druid.query.lookup.LookupReferencesManager - Started lookup factory references manager

2016-09-09T17:45:08,948 INFO [main] com.metamx.common.lifecycle.Lifecycle$AnnotationBasedHandler - Invoking start method[public void io.druid.server.listener.announcer.ListenerResourceAnnouncer.start()] on object[io.druid.query.lookup.LookupResourceListenerAnnouncer@2e590b].

2016-09-09T17:45:08,968 INFO [main] io.druid.server.listener.announcer.ListenerResourceAnnouncer - Announcing start time on [/druid/listeners/lookups/__default/druid-data001.internal.shopkick.com:8083]

2016-09-09T17:45:08,969 INFO [main] com.metamx.common.lifecycle.Lifecycle$AnnotationBasedHandler - Invoking start method[public void io.druid.server.coordination.AbstractDataSegmentAnnouncer.start()] on object[io.druid.server.coordination.BatchDataSegmentAnnouncer@2c18a3ea].

2016-09-09T17:45:08,969 INFO [main] io.druid.server.coordination.AbstractDataSegmentAnnouncer - Announcing self[DruidServerMetadata{name='druid-data001.internal.shopkick.com:8083', host='druid-data001.internal.shopkick.com:8083', maxSize=130000000000, tier='_default_tier', type='historical', priority='0'}] at [/druid/announcements/druid-data001.internal.shopkick.com:8083]

2016-09-09T17:45:09,018 INFO [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - New request[LOAD: business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8] with zNode[/druid/loadQueue/druid-data001.internal.shopkick.com:8083/business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8].

2016-09-09T17:45:09,019 INFO [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - Loading segment business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8

2016-09-09T17:45:09,024 WARN [ZkCoordinator-0] io.druid.server.coordination.ServerManager - Told to load a adapter for a segment[business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8] that already exists

2016-09-09T17:45:09,038 INFO [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - Completed request [LOAD: business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8]

2016-09-09T17:45:09,038 INFO [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - zNode[/druid/loadQueue/druid-data001.internal.shopkick.com:8083/business_events_test_2016-09-11T02:00:00.000Z_2016-09-11T03:00:00.000Z_2016-09-07T02:08:30.019Z_8] was removed

Followed by a bunch of metrics and then it looks like the process restarts and it logs all the initialization stuff and starts over.

Help would be greatly appreciated.

--

Benjamin Vogan | Data Platform Team Lead

shopkick

The indispensable app that rewards you for shopping.

Gian Merlino

unread,

Sep 9, 2016, 2:08:31 PM9/9/16

to druid...@googlegroups.com

Hey Ben,

Any idea why the process dies and restarts? That seems like the real problem. Some stuff to check for:

1) Exceptions/errors in the historical jvm logs

2) Errors in the logs of whatever is launching the historical (if you're using Imply, this is bin/supervise)

3) Errors from your OS (if Linux, one big one to look for is the oom-killer; Linux may be torching your process unceremoniously – check dmesg / kernel logs)

Gian

--
You received this message because you are subscribed to the Google Groups "Druid User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to druid-user+unsubscribe@googlegroups.com.
To post to this group, send email to druid...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAAoNsdnbgBTjW%3D8Z0g7f0sJ1Xy82R8CkMFTJnnNYHxA6Dk2o8Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Ben Vogan

unread,

Sep 9, 2016, 2:52:44 PM9/9/16

to druid...@googlegroups.com

Hi Gian,

Thanks for your help.

The only error in the historical jvm log is this one, and it doesn't appear to trigger the shutdown as the process continues for a good while afterwards:

2016-09-09T17:06:50,714 ERROR [main] io.druid.server.coordination.ZkCoordinator - Failed to load segment from segmentInfo file: {class=io.druid.server.coordination.ZkCoordinator, exceptionType=class com.fasterxml.jackson.databind.JsonMappingException, exceptionMessage=No content to map due to end-of-input

at [Source: /services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z; line: 1, column: 1], file=/services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z}

com.fasterxml.jackson.databind.JsonMappingException: No content to map due to end-of-input

at [Source: /services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z; line: 1, column: 1]

at com.fasterxml.jackson.databind.JsonMappingException.from(JsonMappingException.java:148) ~[jackson-databind-2.4.6.jar:2.4.6]

at com.fasterxml.jackson.databind.ObjectMapper._initForReading(ObjectMapper.java:3110) ~[jackson-databind-2.4.6.jar:2.4.6]

at com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:3051) ~[jackson-databind-2.4.6.jar:2.4.6]

at com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:2115) ~[jackson-databind-2.4.6.jar:2.4.6]

at io.druid.server.coordination.ZkCoordinator.loadLocalCache(ZkCoordinator.java:258) [druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ZkCoordinator.start(ZkCoordinator.java:133) [druid-server-0.9.1.1.jar:0.9.1.1]

at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) ~[?:1.8.0_91]

at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) ~[?:1.8.0_91]

at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) ~[?:1.8.0_91]

at java.lang.reflect.Method.invoke(Method.java:498) ~[?:1.8.0_91]

at com.metamx.common.lifecycle.Lifecycle$AnnotationBasedHandler.start(Lifecycle.java:350) [java-util-0.27.9.jar:?]

at com.metamx.common.lifecycle.Lifecycle.start(Lifecycle.java:259) [java-util-0.27.9.jar:?]

at io.druid.guice.LifecycleModule$2.start(LifecycleModule.java:155) [druid-api-0.9.1.1.jar:0.9.1.1]

at io.druid.cli.GuiceRunnable.initLifecycle(GuiceRunnable.java:91) [druid-services-0.9.1.1.jar:0.9.1.1]

at io.druid.cli.ServerRunnable.run(ServerRunnable.java:40) [druid-services-0.9.1.1.jar:0.9.1.1]

at io.druid.cli.Main.main(Main.java:105) [druid-services-0.9.1.1.jar:0.9.1.1]

2016-09-09T17:06:50,747 INFO [main] com.metamx.emitter.core.LoggingEmitter - Event [{"feed":"alerts","timestamp":"2016-09-09T17:06:50.725Z","service":"druid/historical","host":"druid-data002.internal.shopkick.com:8083","severity":"component-failure","description":"Failed to load segment from segmentInfo file","data":{"class":"io.druid.server.coordination.ZkCoordinator","exceptionType":"com.fasterxml.jackson.databind.JsonMappingException","exceptionMessage":"No content to map due to end-of-input\n at [Source: /services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z; line: 1, column: 1]","exceptionStackTrace":"com.fasterxml.jackson.databind.JsonMappingException: No content to map due to end-of-input\n at [Source: /services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z; line: 1, column: 1]\n\tat com.fasterxml.jackson.databind.JsonMappingException.from(JsonMappingException.java:148)\n\tat com.fasterxml.jackson.databind.ObjectMapper._initForReading(ObjectMapper.java:3110)\n\tat com.fasterxml.jackson.databind.ObjectMapper._readMapAndClose(ObjectMapper.java:3051)\n\tat com.fasterxml.jackson.databind.ObjectMapper.readValue(ObjectMapper.java:2115)\n\tat io.druid.server.coordination.ZkCoordinator.loadLocalCache(ZkCoordinator.java:258)\n\tat io.druid.server.coordination.ZkCoordinator.start(ZkCoordinator.java:133)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)\n\tat sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)\n\tat sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)\n\tat java.lang.reflect.Method.invoke(Method.java:498)\n\tat com.metamx.common.lifecycle.Lifecycle$AnnotationBasedHandler.start(Lifecycle.java:350)\n\tat com.metamx.common.lifecycle.Lifecycle.start(Lifecycle.java:259)\n\tat io.druid.guice.LifecycleModule$2.start(LifecycleModule.java:155)\n\tat io.druid.cli.GuiceRunnable.initLifecycle(GuiceRunnable.java:91)\n\tat io.druid.cli.ServerRunnable.run(ServerRunnable.java:40)\n\tat io.druid.cli.Main.main(Main.java:105)\n","file":"/services/druid/data/segment-cache/info_dir/business_events_success_2016-09-09T14:00:00.000Z_2016-09-09T15:00:00.000Z_2016-09-09T13:30:28.843Z"}}]

I am using supervise, but I'm not sure where to look for its logs. Can you point me in the right place? In the sv directory i see only the historical and middle manager logs.

I see no evidence that an OOM killer is doing anything to the process.

--Ben

To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CACZNdYBSy6ho%2BokYCdVdq37JDt-9yFpsQz-_L7-UkRswpKEvxg%40mail.gmail.com.

For more options, visit https://groups.google.com/d/optout.

Gian Merlino

unread,

Sep 9, 2016, 3:30:29 PM9/9/16

to druid...@googlegroups.com

supervise logs to stdout/stderr, so its logs depend on what runs *it*.

Gian

To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAAoNsdkC7t5W%2Bq6uxV%3DN9A-xAqOeAOV-KmOYnavCzfP5pxWTAA%40mail.gmail.com.

Gian Merlino

unread,

Sep 9, 2016, 4:51:36 PM9/9/16

to druid...@googlegroups.com

That error doesn't look like anything that should kill the node, btw.

Gian

To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CAAoNsdkC7t5W%2Bq6uxV%3DN9A-xAqOeAOV-KmOYnavCzfP5pxWTAA%40mail.gmail.com.

Ben Vogan

unread,

Sep 10, 2016, 8:07:18 PM9/10/16

to druid...@googlegroups.com

Just to follow up. There was no OOM killer and I still do not know why the process was dying since it did not log an error. However I deleted the segment-cache on all of the historical nodes and restarted the processes and they came back healthy.

--Ben

To view this discussion on the web visit https://groups.google.com/d/msgid/druid-user/CACZNdYDfu0Jnpr1%3DuUon9PFVPscpiaqfWpH5EQUjVejHd-fO%2BQ%40mail.gmail.com.

For more options, visit https://groups.google.com/d/optout.

Ben Vogan

unread,

Sep 12, 2016, 11:31:54 AM9/12/16

to druid...@googlegroups.com

That fix was only temporary it appears. The supervise script is definitely showing historical node restarts:

[Sun Sep 11 19:36:48 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

[Sun Sep 11 19:36:48 2016] Running command[middleManager], logging to[/services/druid/druid-20160817.01/../data/sv/middleManager.log]: /etc/druid/current/run-druid middleManager /services/druid/conf

[Sun Sep 11 21:25:41 2016] Command[historical] exited (pid = 47468, exited = 1)

[Sun Sep 11 21:25:41 2016] Command[historical] failed, see logfile for more details: /services/druid/druid-20160817.01/../data/sv/historical.log

[Sun Sep 11 21:25:44 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

[Sun Sep 11 21:41:39 2016] Command[historical] exited (pid = 9191, exited = 1)

[Sun Sep 11 21:41:39 2016] Command[historical] failed, see logfile for more details: /services/druid/druid-20160817.01/../data/sv/historical.log

[Sun Sep 11 21:41:42 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

[Sun Sep 11 22:00:55 2016] Command[historical] exited (pid = 24161, exited = 1)

[Sun Sep 11 22:00:55 2016] Command[historical] failed, see logfile for more details: /services/druid/druid-20160817.01/../data/sv/historical.log

[Sun Sep 11 22:00:58 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

[Sun Sep 11 22:02:14 2016] Command[historical] exited (pid = 44688, exited = 1)

[Sun Sep 11 22:02:14 2016] Command[historical] failed, see logfile for more details: /services/druid/druid-20160817.01/../data/sv/historical.log

[Sun Sep 11 22:02:17 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

[Sun Sep 11 22:03:33 2016] Command[historical] exited (pid = 47175, exited = 1)

[Sun Sep 11 22:03:33 2016] Command[historical] failed, see logfile for more details: /services/druid/druid-20160817.01/../data/sv/historical.log

[Sun Sep 11 22:03:36 2016] Running command[historical], logging to[/services/druid/druid-20160817.01/../data/sv/historical.log]: /etc/druid/current/run-druid historical /services/druid/conf

But the historical node log isn't very helpful. The only exceptions I see are these below - there exist for different data sources and for different segments in each data source:

2016-09-12T01:05:51,134 ERROR [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - Failed to load segment for dataSource: {class=io.druid.server.coordination.ZkCoordinator, exceptionType=class io.druid.segment.loading.SegmentLoadingException, exceptionMessage=Exception loading segment[business_events_test_2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z_2016-09-11T15:12:33.459Z], segment=DataSegment{size=17665, shardSpec=NumberedShardSpec{partitionNum=0, partitions=0}, metrics=[count, transaction_kicks, transaction_cost, transaction_revenue, users, user_devices, locations], dimensions=[user_gender, user_social_facebook, user_registration_age_bracket, user_registration_age_day, user_registration_age_week, event_type, event_status, event_is_offline, user_country, client_app_version, client_ip_address, client_locale, client_os_name, client_user_agent, device_platform_type, device_name, app_id, user_id, device_id, client_event_time, client_request_time, server_request_time, user_registration_time, event_coordinates], version='2016-09-11T15:12:33.459Z', loadSpec={type=hdfs, path=hdfs://hdfs001:8020/druid/segments/business_events_test/20161024T150000.000Z_20161024T160000.000Z/2016-09-11T15_12_33.459Z/0/index.zip}, interval=2016-10-24T15:00:00.000Z/2016-10-24T16:00:00.000Z, dataSource='business_events_test', binaryVersion='9'}}

io.druid.segment.loading.SegmentLoadingException: Exception loading segment[business_events_test_2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z_2016-09-11T15:12:33.459Z]

at io.druid.server.coordination.ZkCoordinator.loadSegment(ZkCoordinator.java:309) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ZkCoordinator.addSegment(ZkCoordinator.java:350) [druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.SegmentChangeRequestLoad.go(SegmentChangeRequestLoad.java:44) [druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ZkCoordinator$1.childEvent(ZkCoordinator.java:152) [druid-server-0.9.1.1.jar:0.9.1.1]

at org.apache.curator.framework.recipes.cache.PathChildrenCache$5.apply(PathChildrenCache.java:522) [curator-recipes-2.10.0.jar:?]

at org.apache.curator.framework.recipes.cache.PathChildrenCache$5.apply(PathChildrenCache.java:516) [curator-recipes-2.10.0.jar:?]

at org.apache.curator.framework.listen.ListenerContainer$1.run(ListenerContainer.java:93) [curator-framework-2.10.0.jar:?]

at com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:297) [guava-16.0.1.jar:?]

at org.apache.curator.framework.listen.ListenerContainer.forEach(ListenerContainer.java:85) [curator-framework-2.10.0.jar:?]

at org.apache.curator.framework.recipes.cache.PathChildrenCache.callListeners(PathChildrenCache.java:514) [curator-recipes-2.10.0.jar:?]

at org.apache.curator.framework.recipes.cache.EventOperation.invoke(EventOperation.java:35) [curator-recipes-2.10.0.jar:?]

at org.apache.curator.framework.recipes.cache.PathChildrenCache$9.run(PathChildrenCache.java:772) [curator-recipes-2.10.0.jar:?]

at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [?:1.8.0_91]

at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_91]

at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) [?:1.8.0_91]

at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_91]

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_91]

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_91]

at java.lang.Thread.run(Thread.java:745) [?:1.8.0_91]

Caused by: io.druid.segment.loading.SegmentLoadingException: /services/druid/data/segment-cache/business_events_test/2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z/2016-09-11T15:12:33.459Z/0/index.drd (No such file or directory)

at io.druid.segment.loading.MMappedQueryableIndexFactory.factorize(MMappedQueryableIndexFactory.java:52) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.segment.loading.SegmentLoaderLocalCacheManager.getSegment(SegmentLoaderLocalCacheManager.java:96) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ServerManager.loadSegment(ServerManager.java:152) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ZkCoordinator.loadSegment(ZkCoordinator.java:305) ~[druid-server-0.9.1.1.jar:0.9.1.1]

... 18 more

Caused by: java.io.FileNotFoundException: /services/druid/data/segment-cache/business_events_test/2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z/2016-09-11T15:12:33.459Z/0/index.drd (No such file or directory)

at java.io.FileInputStream.open0(Native Method) ~[?:1.8.0_91]

at java.io.FileInputStream.open(FileInputStream.java:195) ~[?:1.8.0_91]

at java.io.FileInputStream.<init>(FileInputStream.java:138) ~[?:1.8.0_91]

at io.druid.segment.SegmentUtils.getVersionFromDir(SegmentUtils.java:43) ~[druid-api-0.9.1.1.jar:0.9.1.1]

at io.druid.segment.IndexIO.loadIndex(IndexIO.java:211) ~[druid-processing-0.9.1.1.jar:0.9.1.1]

at io.druid.segment.loading.MMappedQueryableIndexFactory.factorize(MMappedQueryableIndexFactory.java:49) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.segment.loading.SegmentLoaderLocalCacheManager.getSegment(SegmentLoaderLocalCacheManager.java:96) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ServerManager.loadSegment(ServerManager.java:152) ~[druid-server-0.9.1.1.jar:0.9.1.1]

at io.druid.server.coordination.ZkCoordinator.loadSegment(ZkCoordinator.java:305) ~[druid-server-0.9.1.1.jar:0.9.1.1]

... 18 more

2016-09-12T01:07:21,065 ERROR [ZkCoordinator-0] io.druid.server.coordination.ZkCoordinator - Failed to load segment for dataSource: {class=io.druid.server.coordination.ZkCoordinator, exceptionType=class io.druid.segment.loading.SegmentLoadingException, exceptionMessage=Exception loading segment[business_events_2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z_2016-09-11T15:12:32.883Z], segment=DataSegment{size=18044, shardSpec=NumberedShardSpec{partitionNum=0, partitions=0}, metrics=[count, transaction_kicks, transaction_cost, transaction_revenue, user_id_sketch, unique_user_id, unique_user_device_id, unique_location_id], dimensions=[event_is_offline, user_social_facebook, user_gender, user_registration_age_bracket, event_type, event_status, user_registration_age_day, user_registration_age_week, user_country, client_app_version, client_locale, client_os_name, client_user_agent, device_platform_type, device_name, app_id, client_ip_address, user_id, device_id, client_event_time, client_request_time, server_request_time, user_registration_time, event_coordinates], version='2016-09-11T15:12:32.883Z', loadSpec={type=hdfs, path=hdfs://hdfs001:8020/druid/segments/business_events/20161024T150000.000Z_20161024T160000.000Z/2016-09-11T15_12_32.883Z/0/index.zip}, interval=2016-10-24T15:00:00.000Z/2016-10-24T16:00:00.000Z, dataSource='business_events', binaryVersion='9'}}

io.druid.segment.loading.SegmentLoadingException: Exception loading segment[business_events_2016-10-24T15:00:00.000Z_2016-10-24T16:00:00.000Z_2016-09-11T15:12:32.883Z]