2017-09-07 21:27:51.591079 I | flags: recognized and used environment variable ETCD_DATA_DIR=/var/etcd/data
2017-09-07 21:27:51.591092 I | flags: recognized and used environment variable ETCD_DEBUG=true
2017-09-07 21:27:51.591160 I | flags: recognized and used environment variable ETCD_INITIAL_CLUSTER_STATE=existing
2017-09-07 21:27:51.591171 I | flags: recognized and used environment variable ETCD_INITIAL_CLUSTER_TOKEN=etcd-mycluster-token
2017-09-07 21:27:51.591185 I | flags: recognized and used environment variable ETCD_LISTEN_CLIENT_URLS=http://0.0.0.0:4001 2017-09-07 21:27:51.591201 I | flags: recognized and used environment variable ETCD_LISTEN_PEER_URLS=http://0.0.0.0:2380 2017-09-07 21:27:51.591218 I | flags: recognized and used environment variable ETCD_NAME=etcd-us-east-1b
2017-09-07 21:27:51.591352 I | etcdmain: etcd Version: 2.2.1
2017-09-07 21:27:51.591363 I | etcdmain: Git SHA: 75f8282
2017-09-07 21:27:51.591370 I | etcdmain: Go Version: go1.5.1
2017-09-07 21:27:51.591377 I | etcdmain: Go OS/Arch: linux/amd64
2017-09-07 21:27:51.591388 I | etcdmain: setting maximum number of CPUs to 4, total number of available CPUs is 4
2017-09-07 21:27:51.602034 I | etcdserver: name = etcd-us-east-1b
2017-09-07 21:27:51.602058 I | etcdserver: data dir = /var/etcd/data
2017-09-07 21:27:51.602074 I | etcdserver: member dir = /var/etcd/data/member
2017-09-07 21:27:51.602082 I | etcdserver: heartbeat = 100ms
2017-09-07 21:27:51.602090 I | etcdserver: election = 1000ms
2017-09-07 21:27:51.602096 I | etcdserver: snapshot count = 10000
2017-09-07 21:27:51.605161 I | etcdserver: starting member ad955e1348f4aa89 in cluster 58d26b168f47d0e7
2017-09-07 21:27:51.605202 I | raft: ad955e1348f4aa89 became follower at term 0
2017-09-07 21:27:51.605212 I | raft: newRaft ad955e1348f4aa89 [peers: [], term: 0, commit: 0, applied: 0, lastindex: 0, lastterm: 0]
2017-09-07 21:27:51.605217 I | raft: ad955e1348f4aa89 became follower at term 1
2017-09-07 21:27:51.612717 I | rafthttp: the connection with 9705a6cab44e97e8 became active
2017-09-07 21:27:51.617389 I | etcdserver: starting server... [version: 2.2.1, cluster version: to_be_decided]
2017-09-07 21:27:51.620916 I | rafthttp: the connection with c2233199ea681cfc became active
2017-09-07 21:27:51.689181 I | raft: ad955e1348f4aa89 [term: 1] received a MsgHeartbeat message with higher term from c2233199ea681cfc [term: 804]
2017-09-07 21:27:51.689200 I | raft: ad955e1348f4aa89 became follower at term 804
2017-09-07 21:27:51.689213 I | raft: raft.node: ad955e1348f4aa89 elected leader c2233199ea681cfc at term 804
2017-09-07 21:27:56.803734 E | rafthttp: failed to read raft message (unexpected EOF)
2017-09-07 21:27:58.617893 E | etcdserver: publish error: etcdserver: request timed out, possibly due to previous leader failure
2017-09-07 21:28:01.907742 E | rafthttp: failed to read raft message (unexpected EOF)
2017-09-07 21:28:05.618080 E | etcdserver: publish error: etcdserver: request timed out
2017-09-07 21:28:07.007743 E | rafthttp: failed to read raft message (unexpected EOF)
2017-09-07 21:28:12.135761 E | rafthttp: failed to read raft message (unexpected EOF)
2017-09-07 21:28:12.618271 E | etcdserver: publish error: etcdserver: request timed out
2017-09-07 21:28:17.207743 E | rafthttp: failed to read raft message (unexpected EOF)
POST /raft HTTP/1.1
User-Agent: Go-http-client/1.1
Content-Length: 25384951
Content-Type: application/protobuf
X-Etcd-Cluster-Id: 58d26b168f47d0e7
X-Min-Cluster-Version: 2.1.0
X-Server-From: 9705a6cab44e97e8
X-Server-Version: 2.2.1
Accept-Encoding: gzip
........................ ..(.0.@.J....
....{"Root":{"Path":"/","CreatedIndex":0,"ModifiedIndex":0,"ExpireTime":"0001-01-01T00:00:00Z","Value":"","Children":{"0":{"Path":"/0","CreatedIndex":0,"ModifiedIndex":0,"ExpireTime":"0001-01-01T00:00:00Z","Value":"","Children":{"members":{"Path":"/0/members","CreatedIndex":1,"ModifiedIndex":1,"ExpireTime":"0001-01-01T00:00:00Z","Value":"","Children":{"9705a6cab44e97e8":
~2MB of data omitted
[\"secrets\",\"configmaps\"]},{\"verbs\":[\"get\"],\"attributeRestrictions\":null,\"apiGroups\":[\"\"],\"resources\":[\"persistentvolumeclaims\",\"persistentvolumes\"]},{\"verbs\":[\"get\
HTTP/1.1 400 Bad Request
Content-Type: text/plain; charset=utf-8
X-Content-Type-Options: nosniff
X-Etcd-Cluster-Id: 58d26b168f47d0e7
Date: Wed, 06 Sep 2017 23:07:44 GMT
Content-Length: 27
Connection: close
error reading raft message
So I'm now trying to determine likely causes of this truncation.
Some additional info: this cluster is used as the data store for our Kubernetes cluster on AWS, and the procs themselves are run as docker containers with host networking. It's the standard setup that the Kops provisioner creates.