org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern:

690 views
Skip to first unread message

Hariprasad Akkineni

unread,
Mar 19, 2016, 2:52:00 AM3/19/16
to CDK Development
Hello All,

I am trying to use kitedata set in my scala project. 

I get the below exception when to trying run my code

Exception in thread "main" org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern: dataset:hive:default/test?location=hdfs://10.132.0.2:8020/apps/hive/warehouse/fintech.db/test
Check that JARs for hive datasets are on the classpath
        at org.kitesdk.data.spi.Registration.lookupDatasetUri(Registration.java:
128)
        at org.kitesdk.data.Datasets.exists(Datasets.java:624)
        at org.kitesdk.data.Datasets.exists(Datasets.java:646)

enabled debug for org.kitesdk.data.spi.Registration and found that only hive dataset uri is not in the list of uris.

2016-03-19 06:41:53 DEBUG Registration:136 - Loading: org.kitesdk.data.hbase.impl.Loader
2016-03-19 06:41:53 DEBUG Registration:147 - Registered repository URIs:
        URIPattern{pattern=file:/*path?absolute=true}
        URIPattern{pattern=file:*path}
        URIPattern{pattern=hdfs:/*path?absolute=true}
        URIPattern{pattern=hdfs:*path}
        URIPattern{pattern=webhdfs:/*path?absolute=true}
        URIPattern{pattern=s3n:/*path}
        URIPattern{pattern=s3a:/}
        URIPattern{pattern=hbase::zk}
2016-03-19 06:41:53 DEBUG Registration:149 - Registered dataset URIs:
        URIPattern{pattern=file:/*path/:namespace/:dataset?absolute=true}
        URIPattern{pattern=file:*path/:namespace/:dataset}
        URIPattern{pattern=hdfs:/*path/:namespace/:dataset?absolute=true}
        URIPattern{pattern=hdfs:*path/:namespace/:dataset}
        URIPattern{pattern=webhdfs:/*path/:namespace/:dataset?absolute=true}
        URIPattern{pattern=s3n:/*path/:namespace/:dataset}
        URIPattern{pattern=s3a:/:namespace/:dataset}
        URIPattern{pattern=hbase::zk/:dataset?namespace=default}

I have checked my class path all all the relevant Jars for hive are present.

$APP_HOME/lib/hive-jdbc-1.1.1.jar:$APP_HOME/lib/hive-common-1.1.1.jar:$APP_HOME/lib/hive-service-1.1.1.jar:$APP_HOME/lib/hive-serde-1.1.1.jar:$APP_HOME/lib/hive-metastore-1.1.1.jar:$APP_HOME/lib/hive-shims-1.1.1.jar:$APP_HOME/lib/hive-shims-0.20S-1.1.1.jar:$APP_HOME/lib/hive-exec-1.1.1.jar:$APP_HOME/lib/hive-shims-common-1.1.1.jar:$APP_HOME/lib/hive-shims-scheduler-1.1.1.jar:

Any help would be greatly appreciated.

Regards
Hari

Andrew Stevenson

unread,
Mar 19, 2016, 3:03:28 AM3/19/16
to Hariprasad Akkineni, CDK Development
Add the Hive jars to your class path.

Regards

Andrew

From: Hariprasad Akkineni
Sent: ‎19/‎03/‎2016 07:52
To: CDK Development
Subject: org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI pattern:

--
You received this message because you are subscribed to the Google Groups "CDK Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cdk-dev+u...@cloudera.org.
For more options, visit https://groups.google.com/a/cloudera.org/d/optout.

Hariprasad Akkineni

unread,
Mar 19, 2016, 3:23:33 AM3/19/16
to CDK Development, harip...@gmail.com
Hi Andrew , i have checked all the hive jars are in my classpath. I have tried setting HIVE_HOME and HIVE_CONF as well. Looks like kite uses the classpath instead of any of these env variables.

Jasper

unread,
Apr 1, 2016, 4:19:59 AM4/1/16
to CDK Development, harip...@gmail.com
Hi Hari,

Have you gotten any further with this? I'm having the exact same problem, all the hive jars are on the classpath but kite doesn't seem to be using them. The hive URIPattern doesn't show up in the list of registered repositories.

Thanks,
Jasper

Hariprasad Akkineni

unread,
Apr 1, 2016, 4:24:36 AM4/1/16
to Jasper, CDK Development
Hi Jasper,

Unfortunately haven't progressed any further on it, had to park the issue for now and use kite command line for now.

Regards
Hari

Damian Smith

unread,
Apr 14, 2016, 2:02:28 PM4/14/16
to CDK Development
Having the same issue. Any progress?

Jasper

unread,
Apr 15, 2016, 5:30:40 AM4/15/16
to CDK Development
No I'm working around it at the moment, just writing straight to HDFS.

Opened this issue in the Cloudera community:
https://community.cloudera.com/t5/Kite-SDK-includes-Morphlines/Hive-jars-not-found-on-classpath/m-p/39410#M390

Let me know if you get it to work...
Reply all
Reply to author
Forward
0 new messages