Hi, I work with Jing and this issue is still bugging us. We are still on kafka 0.11.0, zk 3.4.5 We are in the process of deploying approximately 30 five-broker clusters. We constantly encounter this issue on just-deployed clusters. More often than not, a cluster will be brought online, and the __consumer_offsets topic will be auto-created with a 20% broker skew. We then run a partition reassignment to bring it into balance.
If a cluster starts with this condition, then any new topic created with more than 1x replication -- our default is 2x for these clusters -- will also result in a 20% broker skew upon creation. If I watch the kafka server.log file, one of the brokers will throw errors like the one Jing mentioned.
Any ideas what could be causing this?