I've been trying get the HadoopDruidIndexer to work with my cluster, but to no avail.
After digging around, it seems like it may be a version issue between my version of Hadoop and the one Druid expects.
java -Xmx256m -Duser.timezone=UTC -Dfile.encoding=UTF-8 -classpath services/target/*:services/target/classes:lib/*:/etc/hadoop/conf/ io.druid.cli.Main index hadoop macrosense/hadoopindexer.specFile
2014-06-13 06:17:00,871 WARN [main] org.apache.hadoop.util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2014-06-13 06:17:00,894 INFO [main] io.druid.indexer.path.StaticPathSpec - Adding paths[s3n://processing/input/]
2014-06-13 06:17:03,622 INFO [main] io.druid.indexer.JobHelper - Uploading jar to path[/tmp/classpath/druid-services-0.6.122-SNAPSHOT.jar]
2014-06-13 06:17:03,806 INFO [main] io.druid.indexer.path.StaticPathSpec - Adding paths[s3n://macrosense-rtb-processing/input/]
2014-06-13 06:17:03,818 INFO [main] org.apache.hadoop.conf.Configuration.deprecation -
session.id is deprecated. Instead, use dfs.metrics.session-id
2014-06-13 06:17:03,819 INFO [main] org.apache.hadoop.metrics.jvm.JvmMetrics - Initializing JVM Metrics with processName=JobTracker, sessionId=
2014-06-13 06:17:03,843 WARN [main] org.apache.hadoop.mapreduce.JobSubmitter - Hadoop command-line option parsing not performed. Implement the Tool interface and execute your application with ToolRunner to remedy this.
2014-06-13 06:17:03,850 INFO [main] org.apache.hadoop.mapreduce.JobSubmitter - Cleaning up the staging area file:/user/sense367716156/.staging/job_local367716156_0001
2014-06-13 06:17:03,850 WARN [main] org.apache.hadoop.security.UserGroupInformation - PriviledgedActionException as:sense (auth:SIMPLE) cause:org.apache.hadoop.util.Shell$ExitCodeException: chmod: cannot access ‘/user/sense367716156/.staging/job_local367716156_0001’: No such file or directory
2014-06-13 06:17:03,852 ERROR [main] io.druid.cli.CliHadoopIndexer - failure!!!!
java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at io.druid.cli.CliHadoopIndexer.run(CliHadoopIndexer.java:113)
at io.druid.cli.Main.main(Main.java:92)
Caused by: java.lang.RuntimeException: org.apache.hadoop.util.Shell$ExitCodeException: chmod: cannot access ‘/user/sense367716156/.staging/job_local367716156_0001’: No such file or directory
at com.google.common.base.Throwables.propagate(Throwables.java:160)
at io.druid.indexer.DetermineHashedPartitionsJob.run(DetermineHashedPartitionsJob.java:202)
at io.druid.indexer.JobHelper.runJobs(JobHelper.java:135)
at io.druid.indexer.HadoopDruidDetermineConfigurationJob.run(HadoopDruidDetermineConfigurationJob.java:86)
at io.druid.indexer.JobHelper.runJobs(JobHelper.java:135)
at io.druid.cli.CliInternalHadoopIndexer.run(CliInternalHadoopIndexer.java:57)
at io.druid.cli.Main.main(Main.java:92)
... 6 more
Caused by: org.apache.hadoop.util.Shell$ExitCodeException: chmod: cannot access ‘/user/sense367716156/.staging/job_local367716156_0001’: No such file or directory
at org.apache.hadoop.util.Shell.runCommand(Shell.java:505)
at org.apache.hadoop.util.Shell.run(Shell.java:418)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:650)
at org.apache.hadoop.util.Shell.execCommand(Shell.java:739)
at org.apache.hadoop.util.Shell.execCommand(Shell.java:722)
at org.apache.hadoop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:631)
at org.apache.hadoop.fs.FilterFileSystem.setPermission(FilterFileSystem.java:468)
at org.apache.hadoop.fs.FileSystem.mkdirs(FileSystem.java:598)
at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:179)
at org.apache.hadoop.mapreduce.JobSubmitter.copyAndConfigureFiles(JobSubmitter.java:301)
at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:389)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1285)
at org.apache.hadoop.mapreduce.Job$10.run(Job.java:1282)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:1282)
at io.druid.indexer.DetermineHashedPartitionsJob.run(DetermineHashedPartitionsJob.java:120)
... 11 more