Trouble with s3a

20 views
Skip to first unread message

Charlie Tanksley

unread,
Jun 23, 2016, 9:18:34 AM6/23/16
to CDK Development
Hi,
I'm having some trouble using Kite with s3a.  I'm pretty new to this, so there is every chance it is just a user error. :)

I've tried to load the dataset directly into hive (not specifying a location for an external table), loading it into hive with an external table specified, and just loading it into the external table (creating a `dataset` directly).  In each case I get a version of the same error: `Argument error: Unknown repository URI pattern: dataset:s3a://XXX/tmp/...`.  I'll include the terminal scrollback for three cases I described below.  Anyone know what I'm doing wrong?

Loading directly into hive:

```
$ ./kite-dataset -v create f1 -s s3a://XXX/facilities.avsc 
Created f1
$ ./kite-dataset json-import s3a://XXX/facilities/2016-06-02.json f1
Argument error: Unknown repository URI pattern: dataset:s3a://XXX/tmp/default/.temp/2169a099-285c-40c8-b177-ce13168090e8
```

When I try to load the data into a dataset with a `location` on s3a:

```
$ ./kite-dataset -v create facilities --schema s3a://XXX/facilities.avsc --location s3a://XXX/facilities
Created facilities
$ ./kite-dataset json-import s3a://XXX/facilities/2016-06-02.json facilities
Argument error: Unknown repository URI pattern: dataset:s3a://XXX/tmp/default/.temp/5ce8a5d1-7d56-42f0-9206-38e14bb33ca7
```

When I try to load the data into an s3a dataset (I think this means it isn't going into hive at this point?):

```
$ ./kite-dataset -v create dataset:s3a://XXX/NAMESPACE/facilities -s s3a://XXX/facilities.avsc 
Created dataset:s3a://XXX/NAMESPACE/facilities
$ ./kite-dataset json-import s3a://XXX/facilities/2016-06-02.json dataset:s3a://XXX/NAMESPACE/facilities
Argument error: Unknown repository URI pattern: dataset:s3a://XXX/tmp/NAMESPACE/.temp/dea3f036-613d-44c0-abac-fffa8a6110ba
```

Thanks!

Charlie

Micah Whitacre

unread,
Jun 23, 2016, 4:37:07 PM6/23/16
to CDK Development
You might try running with what is in master as I think this got fixed with:

Charlie Tanksley

unread,
Jun 24, 2016, 10:14:58 AM6/24/16
to Micah Whitacre, CDK Development
Great! I'll give that a shot. Thanks!

Charlie
--
You received this message because you are subscribed to the Google Groups "CDK Development" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cdk-dev+u...@cloudera.org.
For more options, visit https://groups.google.com/a/cloudera.org/d/optout.
Reply all
Reply to author
Forward
0 new messages