We are trying to load TSV file to druid cluster setup with Ambari. Before doing so we did the same on a single node in the local machine and everything worked fine. On Ambari cluster is gives the error as mentioned below
2018-10-14T12:22:12,321 ERROR [task-runner-0-priority-0] io.druid.indexing.overlord.ThreadPoolTaskRunner - Exception while running task[IndexTask{id=index_etlranker_2018-10-14T12:22:08.103Z, type=index, dataSource=etlranker}]
java.lang.IllegalArgumentException: Parameter 'directory' is not a directory: /home/druid/DFP_SESReport/2018/04/02
at org.apache.commons.io.FileUtils.validateListFilesParameters(FileUtils.java:536) ~[commons-io-2.5.jar:2.5]
at org.apache.commons.io.FileUtils.listFiles(FileUtils.java:512) ~[commons-io-2.5.jar:2.5]
at io.druid.segment.realtime.firehose.LocalFirehoseFactory.initObjects(LocalFirehoseFactory.java:82) ~[druid-server-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.data.input.impl.AbstractTextFilesFirehoseFactory.connect(AbstractTextFilesFirehoseFactory.java:57) ~[druid-api-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.data.input.impl.AbstractTextFilesFirehoseFactory.connect(AbstractTextFilesFirehoseFactory.java:46) ~[druid-api-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.indexing.common.task.IndexTask.determineShardSpecs(IndexTask.java:268) ~[druid-indexing-service-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.indexing.common.task.IndexTask.run(IndexTask.java:188) ~[druid-indexing-service-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.indexing.overlord.ThreadPoolTaskRunner$ThreadPoolTaskRunnerCallable.call(ThreadPoolTaskRunner.java:436) [druid-indexing-service-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at io.druid.indexing.overlord.ThreadPoolTaskRunner$ThreadPoolTaskRunnerCallable.call(ThreadPoolTaskRunner.java:408) [druid-indexing-service-0.10.1.2.6.5.0-292.jar:0.10.1.2.6.5.0-292]
at java.util.concurrent.FutureTask.run(FutureTask.java:266) [?:1.8.0_112]
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) [?:1.8.0_112]
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [?:1.8.0_112]
at java.lang.Thread.run(Thread.java:745) [?:1.8.0_112]
2018-10-14T12:22:12,323 INFO [task-runner-0-priority-0] io.druid.indexing.overlord.TaskRunnerUtils - Task [index_etlranker_2018-10-14T12:22:08.103Z] status changed to [FAILED].
2018-10-14T12:22:12,327 INFO [task-runner-0-priority-0] io.druid.indexing.worker.executor.ExecutorLifecycle - Task completed with status: {
"id" : "index_etlranker_2018-10-14T12:22:08.103Z",
"status" : "FAILED",
"duration" : 37
}
attached as the json file and the full error logs. Have spend hours to fix this with no luck.
Any help will be appreciated.
Regards,
Chethan G Puttaswamy