Hi folks,
i'm starting to integrate the HBaseWD in our project.
The unanswered question for us is still the amount of buckets, we want to use.
At the moment we have a very small cluster, there are just 3 region servers.
so, actually, there will be no value, if we set the max buckets to more than 3 (assume we have cpu's with only one core), is that correct?
the disadvantage of a too big number of buckets is a scan / mapreduce job, which will take longer, right?
now the question is, can we set the number for now to a small value and increase it later?
if we can change the number of buckets later. afaik, the increasing will work fine, but not the decreasing, since in that case the method "getAllPossiblePrefixes" will not work properly
please correct me, if i'm wrong
thanks in advance for any help
andre