Update: got the job to run w/o any errors (after setting the kafka.host.url and kafka.host.port props), but I dont see any output from the topic in whitelist. Here is what I see:
13/10/28 15:33:00 INFO kafka.CamusJob: Dir Destination set to: hdfs://idh251-0/var/log/camus/data
13/10/28 15:33:00 INFO kafka.CamusJob: Previous execution: hdfs://idh251-0/var/log/camus/history/2013-10-28-22-30-03
13/10/28 15:33:00 INFO kafka.CamusJob: New execution temp location: hdfs://idh251-0/var/log/camus/2013-10-28-22-33-00
13/10/28 15:33:01 WARN kafka.CamusJob: The configuration properties kafka.host.url and kafka.host.port are deprecated. Please switch to using kafka.brokers
13/10/28 15:33:01 INFO mapred.EtlInputFormat: Fetching metadata from broker idh251-kafka:9095 with client id camus-abhi1 for 0 topic(s) []
13/10/28 15:33:02 INFO mapred.EtlInputFormat: Discrading topic : intel.test
13/10/28 15:33:02 INFO mapred.EtlInputFormat: Discarding topic (Decoder generation failed) : iot.test
13/10/28 15:33:02 INFO util.NativeCodeLoader: Trying to load the custom-built native-hadoop library...
13/10/28 15:33:02 INFO util.NativeCodeLoader: Loaded the native-hadoop library
13/10/28 15:33:02 INFO zlib.ZlibFactory: Successfully loaded & initialized native-zlib library
13/10/28 15:33:02 INFO compress.CodecPool: Got brand-new compressor
13/10/28 15:33:02 INFO mapred.EtlInputFormat: previous offset file:hdfs://idh251-0/var/log/camus/history/2013-10-28-22-30-03/offsets-previous
13/10/28 15:33:02 INFO compress.CodecPool: Got brand-new decompressor
13/10/28 15:33:02 INFO mapred.JobClient: Running job: job_201310251711_0005
13/10/28 15:33:03 INFO mapred.JobClient: map 0% reduce 0%
13/10/28 15:33:07 INFO mapred.JobClient: Job complete: job_201310251711_0005
13/10/28 15:33:07 INFO mapred.JobClient: Counters: 4
13/10/28 15:33:07 INFO mapred.JobClient: Job Counters
13/10/28 15:33:07 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=3111
13/10/28 15:33:07 INFO mapred.JobClient: Total time spent by all reduces waiting after reserving slots (ms)=0
13/10/28 15:33:07 INFO mapred.JobClient: Total time spent by all maps waiting after reserving slots (ms)=0
13/10/28 15:33:07 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0
13/10/28 15:33:07 INFO kafka.CamusJob: Group: Job Counters
13/10/28 15:33:07 INFO kafka.CamusJob: Total time spent by all maps waiting after reserving slots (ms): 0
13/10/28 15:33:07 INFO kafka.CamusJob: Total time spent by all reduces waiting after reserving slots (ms): 0
13/10/28 15:33:07 INFO kafka.CamusJob: SLOTS_MILLIS_MAPS: 3111
13/10/28 15:33:07 INFO kafka.CamusJob: SLOTS_MILLIS_REDUCES: 0
13/10/28 15:33:07 INFO kafka.CamusJob: Job finished
13/10/28 15:33:07 INFO kafka.CamusJob: ***********Timing Report*************
Job time (seconds):
pre setup 0.0 (0%)
get splits 0.0 (0%)
hadoop job 5.0 (71%)
commit 0.0 (0%)
Total: 0 minutes 7 seconds
Hadoop job task times (seconds):
min 9223372036854776.0
mean NaN
max 0.0
skew NaN/0.0 = NaN
Task wait time (seconds):
min 9223372036854776.0
mean NaN
max 0.0
Hadoop task breakdown:
kafka �
decode �
map output �
other �
Total MB read: 0
Do I need to write my own decoder? All I want to do is to see this end-to-end by having my Kafka messages inserted into HDFS in Avro format.
Thanks,
Abhi