Hadoop version clarification

27 views
Skip to first unread message

Tim Williams

unread,
Jun 27, 2012, 11:50:37 AM6/27/12
to blur...@googlegroups.com
The docs say[1],

"The normal 0.20.2 is not compatible with Cloudera's 0.20.2 CDH3u2 version. "

If the issue running incompatible versions or something specific to
CDH3? Or, how much of a headache would it be to run *all* CDH3? Are
we talking tweaks to the InputFormat's or major surgery?

Thanks,
--tim

[1] - https://github.com/nearinfinity/blur/blob/master/README.md#hdfs-options

Aaron McCurry

unread,
Jun 27, 2012, 1:46:46 PM6/27/12
to blur...@googlegroups.com
The issue is the native HDFS protocol is different between, 0.20.2 (the couple of year old version) and CDH3.  The newer versions of hadoop 0.20.20x have also changes in the protocol and they should match CDH3.  You would run into the same issues if you were using different HDFS clusters with different versions of hadoop and wanted to copy data between them using distcp (NOTE: You can make it work by using the http version of the FS).

So to make a long story short if you use the same version of Hadoop CDH or Apache across different clusters everything should work fine.

Aaron
Reply all
Reply to author
Forward
0 new messages