On 08/14/2012 05:15 PM, markh wrote:> In order to minimize the impact of balancing on the cluster,
the balancer will not begin balancing
> until the distribution of chunks has reached certain thresholds. These thresholds apply to the
> difference in number of chunks between the shard with the greatest number of chunks and the shard
> with the least number of chunks. Once a balancing round starts, the balancer will not stop until
> the difference between the number of chunks on any two shards is less than two.
>
> Mark
Hi,
thanks for the clarifications.
I already "solved" my problem of uneven disk sizes between servers.
I use sharding mainly to distribute the data (as in Gibibytes) across servers, not the load. So
handling uneven disk sizes is more of an issue than uneven load distribution.
I just disabled the build in balancer and implemented my own (crude) balancer which support chunk
splitting and chunk movement according to the disk space characteristics of my servers.
This way I am able to better utilize my disk space without regarding load distribution in any way
(which is fairly unimportant in my case).
Exi