Thank you so much for your help, Eric, really appreciated.
I think that we are almost there, but the job fails whenever a mapper need to spill. I think that the jobconfig somehow is not passed down to the after the spill: you can see how the first time the file is read from the cache it works, but after the spill it fails. Attached is the log from one such mapper. Also I see that the 0.8.0-SNAPSHOT has been updated on maven, but not the cdh3 version.
2013-08-14 10:18:44,619 INFO scoobi.MapTask: Starting on myhost
2013-08-14 10:18:44,619 INFO scoobi.MapTask: Input is hdfs://myhost:8020/sys/edw/dw_lstg_item/snapshot/2013/08/13/00/part-r-00001:536870912+536870912 (on channel:11)
2013-08-14 10:18:44,671 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/d26cc3b1-cc53-44da-a3b1-91c62cdab253
2013-08-14 10:18:44,673 INFO scoobi.DistCache: trying to open: file:/hadoop/1/scratch/taskTracker/distcache/-8498768754780428785_1337695511_2111141465/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/d26cc3b1-cc53-44da-a3b1-91c62cdab253
2013-08-14 10:18:44,674 INFO scoobi.DistCache: successfully opened: file:/hadoop/1/scratch/taskTracker/distcache/-8498768754780428785_1337695511_2111141465/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/d26cc3b1-cc53-44da-a3b1-91c62cdab253
2013-08-14 10:18:44,676 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/086a56ca-7169-4964-8ff1-92180a37e23f
2013-08-14 10:18:44,677 INFO scoobi.DistCache: trying to open: file:/hadoop/2/scratch/taskTracker/distcache/-5392923895352759590_951072140_2111141469/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/086a56ca-7169-4964-8ff1-92180a37e23f
2013-08-14 10:18:44,679 INFO scoobi.DistCache: successfully opened: file:/hadoop/2/scratch/taskTracker/distcache/-5392923895352759590_951072140_2111141469/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/env/086a56ca-7169-4964-8ff1-92180a37e23f
2013-08-14 10:18:44,807 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TP90
2013-08-14 10:18:44,808 INFO scoobi.DistCache: trying to open: file:/hadoop/7/scratch/taskTracker/distcache/-2772926115251135789_-
2069565859_2111141904/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TP90
2013-08-14 10:18:44,810 INFO scoobi.DistCache: successfully opened: file:/hadoop/7/scratch/taskTracker/distcache/-2772926115251135789_-
2069565859_2111141904/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TP90
2013-08-14 10:18:44,878 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TK90
2013-08-14 10:18:44,879 INFO scoobi.DistCache: trying to open: file:/hadoop/5/scratch/taskTracker/distcache/7027002775065643939_-
2069570664_2111141749/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TK90
2013-08-14 10:18:44,880 INFO scoobi.DistCache: successfully opened: file:/hadoop/5/scratch/taskTracker/distcache/7027002775065643939_-
2069570664_2111141749/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TK90
2013-08-14 10:18:44,913 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TV90
2013-08-14 10:18:44,914 INFO scoobi.DistCache: trying to open: file:/hadoop/6/scratch/taskTracker/distcache/3089233342640131732_-
2069560093_2111141869/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TV90
2013-08-14 10:18:44,916 INFO scoobi.DistCache: successfully opened: file:/hadoop/6/scratch/taskTracker/distcache/3089233342640131732_-
2069560093_2111141869/myhost/tmp/scoobi-username/scoobi-20130814-101748-GbxTranslation$-f1b6777c-0e09-441f-b80d-9b19c6bbb8b8/dist-objs/scoobi.metadata.TV90
2013-08-14 10:20:01,885 INFO org.apache.hadoop.mapred.MapTask: Spilling map output: record full = true
2013-08-14 10:20:01,885 INFO org.apache.hadoop.mapred.MapTask: bufstart = 0; bufend = 72477576; bufvoid = 456340272
2013-08-14 10:20:01,885 INFO org.apache.hadoop.mapred.MapTask: kvstart = 0; kvend = 4026532; length = 5033165
2013-08-14 10:20:01,930 INFO scoobi.DistCache: trying to pull an object from the cache at path: /tmp/scoobi-username/scoobi-20130814-102001-6a49f75c-0432-4183-bb01-ad2a7484aadb/dist-objs/scoobi.metadata.TK90
2013-08-14 10:20:01,932 INFO scoobi.DistCache: trying to open: /tmp/scoobi-username/scoobi-20130814-102001-6a49f75c-0432-4183-bb01-ad2a7484aadb/dist-objs/scoobi.metadata.TK90
2013-08-14 10:20:01,937 ERROR scoobi.DistCache: No successfully opened path. The cache files which were used are
/tmp/scoobi-username/scoobi-20130814-102001-6a49f75c-0432-4183-bb01-ad2a7484aadb/dist-objs/scoobi.metadata.TK90
2013-08-14 10:20:01,943 INFO org.apache.hadoop.mapred.MapTask: Starting flush of map output
2013-08-14 10:20:01,944 INFO org.apache.hadoop.mapred.MapTask: Ignoring exception during close for org.apache.hadoop.mapred.MapTask$NewOutputCollector@779a639b
java.io.IOException: Spill failed
at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.flush(MapTask.java:1296)
at org.apache.hadoop.mapred.MapTask$NewOutputCollector.close(MapTask.java:697)
at org.apache.hadoop.mapred.MapTask.closeQuietly(MapTask.java:1792)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:778)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:363)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1232)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
Caused by: java.util.NoSuchElementException: None.get
at scala.None$.get(Option.scala:313)
at scala.None$.get(Option.scala:311)
at com.nicta.scoobi.impl.rtt.ScoobiMetadata$$anonfun$metadata$1$$anonfun$apply$1.apply(ScoobiMetadata.scala:45)
at com.nicta.scoobi.impl.rtt.ScoobiMetadata$$anonfun$metadata$1$$anonfun$apply$1.apply(ScoobiMetadata.scala:43)
at scalaz.MemoFunctions$$anonfun$mutableMapMemo$1$$anonfun$apply$2$$anonfun$apply$3.apply(Memo.scala:67)
at scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189)
at scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91)
at scalaz.MemoFunctions$$anonfun$mutableMapMemo$1$$anonfun$apply$2.apply(Memo.scala:67)
at com.nicta.scoobi.impl.rtt.ScoobiMetadata$$anonfun$metadata$1.apply(ScoobiMetadata.scala:46)
at com.nicta.scoobi.impl.rtt.ScoobiMetadata$$anonfun$metadata$1.apply(ScoobiMetadata.scala:43)
at com.nicta.scoobi.impl.rtt.TaggedMetadata$class.metaDatas(ScoobiMetadata.scala:54)
at com.nicta.scoobi.impl.rtt.MetadataTaggedKey.metaDatas$lzycompute(TaggedKey.scala:47)
at com.nicta.scoobi.impl.rtt.MetadataTaggedKey.metaDatas(TaggedKey.scala:47)
at com.nicta.scoobi.impl.rtt.TaggedMetadata$class.tags(ScoobiMetadata.scala:55)
at com.nicta.scoobi.impl.rtt.MetadataTaggedKey.tags$lzycompute(TaggedKey.scala:47)
at com.nicta.scoobi.impl.rtt.MetadataTaggedKey.tags(TaggedKey.scala:47)
at com.nicta.scoobi.impl.rtt.MetadataTaggedWritable$class.readFields(Tagged.scala:70)
at com.nicta.scoobi.impl.rtt.MetadataTaggedKey.readFields(TaggedKey.scala:47)
at org.apache.hadoop.io.WritableComparator.compare(WritableComparator.java:97)
at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.compare(MapTask.java:1115)
at org.apache.hadoop.util.QuickSort.sortInternal(QuickSort.java:95)
at org.apache.hadoop.util.QuickSort.sort(QuickSort.java:59)
at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.sortAndSpill(MapTask.java:1403)
at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.access$1800(MapTask.java:857)
at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$SpillThread.run(MapTask.java:1348)