Hi,
I am trying to create partition table in hive through cascading. Everything is working fine but there is an issue associated with the value having space in partition key. Below is the example for this:
Input :
f1,f2,f3
1,2,mumbai pune
2,3,pune IND
partition directory : (Cascading Hive)
C:\user\hive\warehouse\partitiontest\f3=mumbai pune
C:\user\hive\warehouse\partitiontest\f3=mumbai%20pune
C:\user\hive\warehouse\partitiontest\f3=pune IND
C:\user\hive\warehouse\partitiontest\f3=pune%20US
Here, it creates partition directory twice for each partition key.
The above issue occurs only when I run any job on local (Windows) machine. This perfectly works fine on cluster.
I am attaching sample cascading source code, snapshot of created partition directory and input file.
I also found that when I am creating same partition table though hive cli then it creates only one directory for each partition(cluster).
Partition Directory: (Hive cli)
C:\user\hive\warehouse\partitiontest\f3=mumbai pune
C:\user\hive\warehouse\partitiontest\f3=pune IND
Can someone please help me in resolving this?
I have used below version of jar :
cascading-hadoop2-mr1-3.1.0.jar
cascading-hive-2.0.0.jar
cascading-local-3.1.0.jar
hive-exec-1.2.0.jar
hive-metastore-1.2.0.jar
hive-shims-1.2.0.jar
hive-serde-1.2.0.jar
hadoop-mapreduce-client-common-2.6.0.jar
Thanks
Santlal Gupta