I had to stop a node in my four node cluster.
I cannot for the life of me figgure out from the logs how to reconnect.
according to the logs, it seems it will not enable a Query cashe due to resize or similar command in progress, but there isn't on happening.
I did rename a table earlier, but that was a couple of hours ago and this node seemed to be fine.
i really don't mind having to initiate a new state transfer is there a way i can force that?
here is the output of the error log.
thanks for any suggestions.
150424 16:38:47 mysqld_safe Starting mysqld daemon with databases from /var/lib/mysql
150424 16:38:47 mysqld_safe WSREP: Running position recovery with --log_error='/var/lib/mysql/wsrep_recovery.YXcEpb' --pid-file='/var/lib/mysql/dot129-recover.pid'
nohup: ignoring input
/usr/sbin/mysqld: Query cache is disabled (resize or similar command in progress); repeat this command later
150424 16:39:00 mysqld_safe WSREP: Recovered position 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984373331
150424 16:39:01 [Note] WSREP: wsrep_start_position var submitted: '4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984373331'
/usr/sbin/mysqld: Query cache is disabled (resize or similar command in progress); repeat this command later
150424 16:39:01 [Note] WSREP: Read nil XID from storage engines, skipping position init
150424 16:39:01 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib/galera/libgalera_smm.so'
150424 16:39:01 [Note] WSREP: wsrep_load(): Galera 25.3.5-wheezy(rXXXX) by Codership Oy <
in...@codership.com> loaded successfully.
150424 16:39:01 [Note] WSREP: CRC-32C: using hardware acceleration.
150424 16:39:01 [Note] WSREP: Found saved state: 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984373331
150424 16:39:01 [Note] WSREP: Passing config to GCS: base_host = 192.168.1.129; base_port = 4567; cert.log_conflicts = no; debug = no; evs.inactive_check_period = PT0.5S; evs.inactive_timeout = PT15S; evs.join_retrans_period = PT1S; evs.max_install_timeouts = 1; evs.send_window = 4; evs.stats_report_period = PT1M; evs.suspect_timeout = PT5S; evs.user_send_window = 2; evs.view_forget_timeout = PT24H; gcache.dir = /var/lib/mysql/; gcache.keep_pages_size = 0; gcache.mem_size = 0;
gcache.name = /var/lib/mysql//galera.cache; gcache.page_size = 128M; gcache.size = 128M; gcs.fc_debug = 0; gcs.fc_factor = 1.0; gcs.fc_limit = 16; gcs.fc_master_slave = no; gcs.max_packet_size = 64500; gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807; gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = no; gmcast.segment = 0; gmcast.version = 0; pc.announce_timeout = PT3S; pc.checksum = false; pc.ignore_quorum = false; pc.ignore_sb = false; pc.npvo = false; pc.version = 0; pc.wait_prim = true; pc.wait_prim_timeout = P30S; pc.weight = 1; proton
150424 16:39:01 [Note] WSREP: Service thread queue flushed.
150424 16:39:01 [Note] WSREP: Assign initial position for certification: 4984373331, protocol version: -1
150424 16:39:01 [Note] WSREP: wsrep_sst_grab()
150424 16:39:01 [Note] WSREP: Start replication
150424 16:39:01 [Note] WSREP: Setting initial position to 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984373331
150424 16:39:01 [Note] WSREP: protonet asio version 0
150424 16:39:01 [Note] WSREP: Using CRC-32C (optimized) for message checksums.
150424 16:39:01 [Note] WSREP: backend: asio
150424 16:39:01 [Note] WSREP: GMCast version 0
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') listening at tcp://
0.0.0.0:4567150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') multicast: , ttl: 1
150424 16:39:01 [Note] WSREP: EVS version 0
150424 16:39:01 [Note] WSREP: PC version 0a
150424 16:39:01 [Warning] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' points to own listening address, blacklisting
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: (52251c1e-eaca-11e4-afaa-be5b30d718da, 'tcp://
0.0.0.0:4567') address 'tcp://
192.168.1.129:4567' pointing to uuid 52251c1e-eaca-11e4-afaa-be5b30d718da is blacklisted, skipping
150424 16:39:01 [Note] WSREP: declaring 0dfc66c2-acb3-11e4-b2e4-2ec7507bc014 stable
150424 16:39:01 [Note] WSREP: declaring 1a161bd2-c05d-11e4-95d0-2fe34f370ecf stable
150424 16:39:01 [Note] WSREP: declaring 45b5db39-a5d9-11e4-9c7e-86608fa5b128 stable
150424 16:39:01 [Note] WSREP: Node 0dfc66c2-acb3-11e4-b2e4-2ec7507bc014 state prim
150424 16:39:01 [Note] WSREP: view(view_id(PRIM,0dfc66c2-acb3-11e4-b2e4-2ec7507bc014,434) memb {
0dfc66c2-acb3-11e4-b2e4-2ec7507bc014,0
1a161bd2-c05d-11e4-95d0-2fe34f370ecf,0
45b5db39-a5d9-11e4-9c7e-86608fa5b128,0
52251c1e-eaca-11e4-afaa-be5b30d718da,0
} joined {
} left {
} partitioned {
})
150424 16:39:01 [Note] WSREP: gcomm: connected
150424 16:39:01 [Note] WSREP: Changing maximum packet size to 64500, resulting msg size: 32636
150424 16:39:01 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
150424 16:39:01 [Note] WSREP: Opened channel 'pivotrac_cluster'
150424 16:39:01 [Note] WSREP: New COMPONENT: primary = yes, bootstrap = no, my_idx = 3, memb_num = 4
150424 16:39:01 [Note] WSREP: Waiting for SST to complete.
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: sent state msg: eec43f17-eaca-11e4-8ef0-06276ff16a39
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: got state msg: eec43f17-eaca-11e4-8ef0-06276ff16a39 from 0 (dot113)
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: got state msg: eec43f17-eaca-11e4-8ef0-06276ff16a39 from 1 ()
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: got state msg: eec43f17-eaca-11e4-8ef0-06276ff16a39 from 2 (dot126)
150424 16:39:01 [Note] WSREP: STATE EXCHANGE: got state msg: eec43f17-eaca-11e4-8ef0-06276ff16a39 from 3 ()
150424 16:39:01 [Note] WSREP: Quorum results:
version   = 3,
component  = PRIMARY,
conf_id   = 282,
members   = 3/4 (joined/total),
act_id   = 4984816135,
last_appl. = -1,
protocols  = 0/5/2 (gcs/repl/appl),
group UUID = 4f93c528-ac04-11e3-ae2c-1eedce9e1c84
150424 16:39:01 [Note] WSREP: Flow-control interval: [32, 32]
150424 16:39:01 [Note] WSREP: Shifting OPEN -> PRIMARY (TO: 4984816135)
150424 16:39:01 [Note] WSREP: State transfer required:Â
Group state: 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984816135
Local state: 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984373331
150424 16:39:01 [Note] WSREP: New cluster view: global state: 4f93c528-ac04-11e3-ae2c-1eedce9e1c84:4984816135, view# 283: Primary, number of nodes: 4, my index: 3, protocol version 2
150424 16:39:01 [Note] WSREP: closing client connections for protocol change 3 -> 2
150424 16:39:03 [Warning] WSREP: Gap in state sequence. Need state transfer.
150424 16:39:05 [Note] WSREP: Running: 'wsrep_sst_rsync --role 'joiner' --address '192.168.1.129' --auth 'geek:snape99' --datadir '/var/lib/mysql/' --defaults-file '/etc/mysql/my.cnf' --parent '23362''
150424 16:39:05 [Note] WSREP: wsrep_notify_cmd is not defined, skipping notification.
150424 16:39:05 [Note] WSREP: REPL Protocols: 5 (3, 1)
150424 16:39:05 [Note] WSREP: Service thread queue flushed.
150424 16:39:05 [Note] WSREP: Assign initial position for certification: 4984816135, protocol version: 3
150424 16:39:05 [Note] WSREP: Service thread queue flushed.
150424 16:39:05 [Note] WSREP: Prepared IST receiver, listening at: tcp://
192.168.1.129:4568150424 16:39:05 [ERROR] WSREP: Requesting state transfer failed: -113(No route to host)
150424 16:39:05 [ERROR] WSREP: State transfer request failed unrecoverably: 113 (No route to host). Most likely it is due to inability to communicate with the cluster primary component. Restart required.
150424 16:39:05 [Note] WSREP: Closing send monitor...
150424 16:39:05 [Note] WSREP: Closed send monitor.
150424 16:39:05 [Note] WSREP: gcomm: terminating thread
150424 16:39:05 [Note] WSREP: gcomm: joining thread
150424 16:39:05 [Note] WSREP: view(view_id(NON_PRIM,0dfc66c2-acb3-11e4-b2e4-2ec7507bc014,434) memb {
52251c1e-eaca-11e4-afaa-be5b30d718da,0
} joined {
} left {
} partitioned {
0dfc66c2-acb3-11e4-b2e4-2ec7507bc014,0
1a161bd2-c05d-11e4-95d0-2fe34f370ecf,0
45b5db39-a5d9-11e4-9c7e-86608fa5b128,0
})
150424 16:39:05 [Note] WSREP: view((empty))
150424 16:39:05 [Note] WSREP: New COMPONENT: primary = no, bootstrap = no, my_idx = 0, memb_num = 1
150424 16:39:05 [Note] WSREP: gcomm: closed
150424 16:39:05 [Note] WSREP: Flow-control interval: [16, 16]
150424 16:39:05 [Note] WSREP: Received NON-PRIMARY.
150424 16:39:05 [Note] WSREP: Shifting PRIMARY -> OPEN (TO: 4984816529)
150424 16:39:05 [Note] WSREP: Received self-leave message.
150424 16:39:05 [Note] WSREP: Flow-control interval: [0, 0]
150424 16:39:05 [Note] WSREP: Received SELF-LEAVE. Closing connection.
150424 16:39:05 [Note] WSREP: Shifting OPEN -> CLOSED (TO: 4984816529)
150424 16:39:05 [Note] WSREP: RECV thread exiting 0: Success
150424 16:39:05 [Note] WSREP: recv_thread() joined.
150424 16:39:05 [Note] WSREP: Closing replication queue.
150424 16:39:05 [Note] WSREP: Closing slave action queue.
150424 16:39:05 [Note] WSREP: /usr/sbin/mysqld: Terminated.
150424 16:39:05 mysqld_safe mysqld from pid file /var/lib/mysql/dot129.pid ended
WSREP_SST: [ERROR] Parent mysqld process (PID:23362) terminated unexpectedly. (20150424 16:39:06.673)
WSREP_SST: [INFO] Joiner cleanup. (20150424 16:39:06.676)
WSREP_SST: [INFO] Joiner cleanup done. (20150424 16:39:07.185)
                                 Â