When I try this, I get the following exception. Any thoughts? It works just fine if I use the DistributedCache in a plain M/R job. And this job works if I remvoe the two properites you listed for the distributed cache.
13/08/08 16:21:48 INFO util.HadoopUtil: resolving application jar from found main method on: com.uhg.optum.ap.hadoop.cascadingtest.CascadingInput2
13/08/08 16:21:48 INFO planner.HadoopPlanner: using application jar: /mapr/datalake/optuminsight/ap_data/temp/john/cascading-test-0.0.1-SNAPSHOT.jar
13/08/08 16:21:48 INFO property.AppProps: using
app.id: 99446B0842C7C55BC8AF507474A74E32
13/08/08 16:21:49 INFO util.NativeCodeLoader: Loaded the native-hadoop library
13/08/08 16:21:49 INFO security.JniBasedUnixGroupsMapping: Using JniBasedUnixGroupsMapping for Group resolution
13/08/08 16:21:49 INFO util.Version: Concurrent, Inc - Cascading 2.1.4
13/08/08 16:21:49 INFO flow.Flow: [] starting
13/08/08 16:21:49 INFO flow.Flow: [] source: GlobHfs[/datalake/optuminsight/ap_data/in/symmetry-input]
13/08/08 16:21:49 INFO flow.Flow: [] sink: Hfs["SequenceFile[['etgOutput']]"]["/datalake/optuminsight/ap_data/temp/john/cascading-test3"]
13/08/08 16:21:49 INFO flow.Flow: [] parallel execution is enabled: true
13/08/08 16:21:49 INFO flow.Flow: [] starting jobs: 1
13/08/08 16:21:49 INFO flow.Flow: [] allocating threads: 1
13/08/08 16:21:49 INFO flow.FlowStep: [] starting step: (1/1) ...temp/john/cascading-test3
13/08/08 16:21:49 INFO fs.JobTrackerWatcher: Current running JobTracker is: lablmapr2.uhclab.lab/
10.112.255.17:900113/08/08 16:21:49 INFO mapred.JobClient: Cleaning up the staging area maprfs:/var/mapr/cluster/mapred/jobTracker/staging/jlavoi3/.staging/job_201308011057_0361
13/08/08 16:21:49 INFO flow.Flow: [] stopping all jobs
13/08/08 16:21:49 INFO flow.FlowStep: [] stopping: (1/1) ...temp/john/cascading-test3
13/08/08 16:21:49 INFO flow.Flow: [] stopped all jobs
Exception in thread "main" cascading.flow.FlowException: unhandled exception
at cascading.flow.BaseFlow.complete(BaseFlow.java:825)
at com.uhg.optum.ap.hadoop.cascadingtest.CascadingInput2.main(CascadingInput2.java:100)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:197)
Caused by: java.io.IOException: Cluster datalake has no entry in /opt/mapr//conf/mapr-clusters.conf
at com.mapr.fs.ClusterConf.getClusterByPath(ClusterConf.java:382)
at com.mapr.fs.MapRFileSystem.lookupClient(MapRFileSystem.java:257)
at com.mapr.fs.MapRFileSystem.getMapRFileStatus(MapRFileSystem.java:684)
at com.mapr.fs.MapRFileSystem.getFileStatus(MapRFileSystem.java:710)
at org.apache.hadoop.filecache.DistributedCache.getFileStatus(DistributedCache.java:185)
at org.apache.hadoop.filecache.TrackerDistributedCacheManager.determineTimestamps(TrackerDistributedCacheManager.java:846)
at org.apache.hadoop.mapred.JobClient.copyAndConfigureFiles(JobClient.java:798)
at org.apache.hadoop.mapred.JobClient.copyAndConfigureFiles(JobClient.java:689)
at org.apache.hadoop.mapred.JobClient.access$300(JobClient.java:173)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:902)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:885)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:885)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:859)
at cascading.flow.hadoop.planner.HadoopFlowStepJob.internalNonBlockingStart(HadoopFlowStepJob.java:104)
at cascading.flow.planner.FlowStepJob.blockOnJob(FlowStepJob.java:191)
at cascading.flow.planner.FlowStepJob.start(FlowStepJob.java:145)
at cascading.flow.planner.FlowStepJob.call(FlowStepJob.java:120)
at cascading.flow.planner.FlowStepJob.call(FlowStepJob.java:42)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)