Alex,
With my proposed fix there are several scenarios, and in one of them
gives me very serious problem.
So following scenario:
we did clean shutdown (via mysqladmin shutdown).
Now we start with
"mysqld_safe &"
In this case the script works, but mysqld does not go to InnoDB
initialization and does not create socket.
There is log:
121029 16:28:08 mysqld_safe Starting mysqld daemon with databases from
/mnt/data/mysql
121029 16:28:08 mysqld_safe WSREP: Running position recovery
121029 16:28:15 mysqld_safe WSREP: Recovered position
2f8acf7d-1f93-11e2-0800-316fe0e276a3:62610891
121029 16:28:15 [Note] WSREP: wsrep_start_position var submitted:
'2f8acf7d-1f93-11e2-0800-316fe0e276a3:62610891'
121029 16:28:15 [Note] WSREP: Read nil XID from storage engines,
skipping position init
121029 16:28:15 [Note] WSREP: wsrep_load(): loading provider library
'/usr/local/mysql/lib/libgalera_smm.so'
121029 16:28:15 [Note] WSREP: wsrep_load(): Galera 2.2(r137) by
121029 16:28:15 [Note] WSREP: Found saved state:
2f8acf7d-1f93-11e2-0800-316fe0e276a3:62610891
121029 16:28:15 [Note] WSREP: Reusing existing '/mnt/data/mysql//galera.cache'.
121029 16:28:15 [Note] WSREP: Passing config to GCS: base_host =
10.7.77.252; base_port = 4567; cert.log_conflicts = no; gcache.dir =
/mnt/data/mysql/; gcache.keep_pages_size = 0; gcache.mem_size = 0;
gcache.name = /mnt/data/mysql//galera.cache; gcache.page_size = 128M;
gcache.size = 128M; gcs.fc_debug = 0; gcs.fc_factor = 1; gcs.fc_limit
= 16; gcs.fc_master_slave = NO; gcs.max_packet_size = 64500;
gcs.max_throttle = 0.25; gcs.recv_q_hard_limit = 9223372036854775807;
gcs.recv_q_soft_limit = 0.25; gcs.sync_donor = NO;
replicator.causal_read_timeout = PT30S; replicator.commit_order = 3
121029 16:28:15 [Note] WSREP: Assign initial position for
certification: 62610891, protocol version: -1
121029 16:28:15 [Note] WSREP: wsrep_sst_grab()
121029 16:28:15 [Note] WSREP: Start replication
121029 16:28:15 [Note] WSREP: Setting initial position to
2f8acf7d-1f93-11e2-0800-316fe0e276a3:62610891
121029 16:28:15 [Note] WSREP: protonet asio version 0
121029 16:28:15 [Note] WSREP: backend: asio
121029 16:28:15 [Note] WSREP: GMCast version 0
121029 16:28:15 [Note] WSREP: (50558e15-2220-11e2-0800-1792f80bd29d,
'tcp://
0.0.0.0:4567') listening at tcp://
0.0.0.0:4567
121029 16:28:15 [Note] WSREP: (50558e15-2220-11e2-0800-1792f80bd29d,
'tcp://
0.0.0.0:4567') multicast: , ttl: 1
121029 16:28:15 [Note] WSREP: EVS version 0
121029 16:28:15 [Note] WSREP: PC version 0
121029 16:28:15 [Note] WSREP: gcomm: connecting to group
'trimethylxanthine', peer '
10.7.75.174:'
121029 16:28:15 [Note] WSREP: (50558e15-2220-11e2-0800-1792f80bd29d,
'tcp://
0.0.0.0:4567') turning message relay requesting on, nonlive
peers: tcp://
10.7.74.195:4567 tcp://
10.7.76.10:4567
121029 16:28:15 [Note] WSREP: (50558e15-2220-11e2-0800-1792f80bd29d,
'tcp://
0.0.0.0:4567') cleaning up duplicate 0x14dd3f0 after
established 0x14d4d90
121029 16:28:15 [Note] WSREP: (50558e15-2220-11e2-0800-1792f80bd29d,
'tcp://
0.0.0.0:4567') turning message relay requesting off
121029 16:28:16 [Note] WSREP: declaring
2f8a1896-1f93-11e2-0800-206d6e27783b stable
121029 16:28:16 [Note] WSREP: declaring
6c47b8cf-1f95-11e2-0800-f4fdffe1cfa7 stable
121029 16:28:16 [Note] WSREP: declaring
f7a93de1-1f94-11e2-0800-fb06dc57f95a stable
121029 16:28:16 [Note] WSREP:
view(view_id(PRIM,2f8a1896-1f93-11e2-0800-206d6e27783b,60) memb {
2f8a1896-1f93-11e2-0800-206d6e27783b,
50558e15-2220-11e2-0800-1792f80bd29d,
6c47b8cf-1f95-11e2-0800-f4fdffe1cfa7,
f7a93de1-1f94-11e2-0800-fb06dc57f95a,
} joined {
} left {
} partitioned {
})
121029 16:28:16 [Note] WSREP: gcomm: connected
121029 16:28:16 [Note] WSREP: Changing maximum packet size to 64500,
resulting msg size: 32636
121029 16:28:16 [Note] WSREP: Shifting CLOSED -> OPEN (TO: 0)
121029 16:28:16 [Note] WSREP: Opened channel 'trimethylxanthine'
121029 16:28:16 [Note] WSREP: New COMPONENT: primary = yes, bootstrap
= no, my_idx = 1, memb_num = 4
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: Waiting for state UUID.
121029 16:28:16 [Note] WSREP: Waiting for SST to complete.
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: sent state msg:
508d6c4b-2220-11e2-0800-818688cdec35
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: got state msg:
508d6c4b-2220-11e2-0800-818688cdec35 from 0 (node1)
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: got state msg:
508d6c4b-2220-11e2-0800-818688cdec35 from 2 (node4)
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: got state msg:
508d6c4b-2220-11e2-0800-818688cdec35 from 3 (node3)
121029 16:28:16 [Note] WSREP: STATE EXCHANGE: got state msg:
508d6c4b-2220-11e2-0800-818688cdec35 from 1 (node4)
121029 16:28:16 [Note] WSREP: Quorum results:
version = 2,
component = PRIMARY,
conf_id = 53,
members = 4/4 (joined/total),
act_id = 62610891,
last_appl. = -1,
protocols = 0/4/2 (gcs/repl/appl),
group UUID = 2f8acf7d-1f93-11e2-0800-316fe0e276a3
121029 16:28:16 [Note] WSREP: Flow-control interval: [32, 32]
121029 16:28:16 [Note] WSREP: Restored state OPEN -> JOINED (62610891)
121029 16:28:16 [Note] WSREP: New cluster view: global state:
2f8acf7d-1f93-11e2-0800-316fe0e276a3:62610891, view# 54: Primary,
number of nodes: 4, my index: 1, protocol version 2
121029 16:28:16 [Note] WSREP: wsrep_notify_cmd is not defined,
skipping notification.
121029 16:28:16 [Note] WSREP: Assign initial position for
certification: 62610891, protocol version: 2
121029 16:28:16 [Note] WSREP: Member 1 (node4) synced with group.
121029 16:28:16 [Note] WSREP: Shifting JOINED -> SYNCED (TO: 62610891)
121029 16:28:16 [Note] WSREP: Synchronized with group, ready for connections
121029 16:28:16 [Note] WSREP: wsrep_notify_cmd is not defined,
skipping notification.
On this stage nothing happens and as you see from the log, InnoDB and
socket were not initialized.
Thanks,
Vadim