In that issue, the error occurs when a node is abruptly terminated. However, we've also seen the error occur when all Cassandra nodes appeared to be healthy.
There are a few possible explanations for why the errors only occur with the Datastax driver, but I'm not sure which is correct:
a) There is a problem with how we're using the Datastax Driver to compose batches of counter updates
b) There is a difference in the between the implementation of counter updates in the Native protocol from the Thrift protocol such that the error is reported to native clients, but not to Thrift clients.
c) There is a difference between the keyspace/column family definition of the production and testing keyspaces.
d) The Astyanax/Thrift version is getting the error but is ignoring it for some reason.
I doubt (c) is the reason; we've made an effort to ensure that the keyspace and CF configurations are the same. Also, (d) seems unlikely because we've seen other errors (such as unavailable exceptions) reported correctly. So, I'm betting that either (a) or (b) is the reason.
Would someone please suggest which of these explanations is likely to be correct, and what we might do to avoid the problem?