Problems with Path to File batch source

43 views
Skip to first unread message

Lukasz

unread,
May 19, 2016, 6:15:27 AM5/19/16
to CDAP User
Hi All,
I am trying to run a simple pipeline (conf attached). Could you please advise what is the correct format for the Path property?
I tried with the following:
/tmp/cloud/Book1.csv and got the error 
" ERROR [MapReduceRunner-phase-1:c.c.c.i.a.r.b.MapReduceRuntimeService@299] - Exception when submitting MapReduce Job: job=phase-1,=namespaceId=U8004871, applicationId=hahaha, program=phase-1, runid=31b5d451-1da8-11e6-aa3c-fa163e8e3c1c
org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: file:/tmp/cloud/Book1.csv"

hdfs:///tmp/cloud/Book1.csv,  Incomplete HDFS URI, no host: hdfs:///tmp/cloud/Book1.csv
hdfs://tmp/cloud/Book1.csv,  java.net.UnknownHostException: tmp
hdfs:/tmp/cloud/Book1.csv, java.io.IOException: Incomplete HDFS URI, no host: hdfs:/tmp/cloud/Book1.csv

The file was put in HDFS on the same machine that CDAP resides.I changed the ownership of the directory and file to cdap default user "cdap"
I also tried to access the file from local unix files system, no luck.  

hahaha-cdap-data-pipeline.json

Bhooshan Mogal

unread,
May 19, 2016, 7:24:00 AM5/19/16
to Lukasz, CDAP User
Hi Lukasz,

If your file is on the HDFS, try hdfs://[namenode-hostname]:[namenode-port]/tmp/cloud/Book1.csv. The [namenode-hostname]:[namenode-port] will be defined in the core-site.xml of your Hadoop installation under the property "fs.defaultFS".


Thanks,
Bhooshan

--
You received this message because you are subscribed to the Google Groups "CDAP User" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cdap-user+...@googlegroups.com.
To post to this group, send email to cdap...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/cdap-user/86a8be37-54a8-4351-a3fd-7671d5734b94%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Lukasz

unread,
May 19, 2016, 8:58:55 AM5/19/16
to CDAP User, flor...@gmail.com
Thank you Bhooshan. It works. The problem was that defaultFS was on different node than CDAP.
Reply all
Reply to author
Forward
0 new messages