Hi guys, we are having some weird issue.
We got a 4 master cluster schema with galera/mysql 5.5.23 ( 3 r/w
nodes and one node of reference )
We create a new database and filled it with tables and data.
After that, i ran a "mysqldump" from the node of reference, and the
dump is ok, THEN! i stop and start the node of reference, the SST is
done, but when i try to read one of the tables, i get this error :
- - - - - - - - - - - - - - - - - - - - -
May 28 11:57:55 i-00000092-asm mysqld: 120528 11:57:55 [ERROR] Cannot
find or open table nicira1/projects from
May 28 11:57:55 i-00000092-asm mysqld: the internal data dictionary of
InnoDB though the .frm file for the
May 28 11:57:55 i-00000092-asm mysqld: table exists. Maybe you have
deleted and recreated InnoDB data
May 28 11:57:55 i-00000092-asm mysqld: files but have forgotten to
delete the corresponding .frm files
May 28 11:57:55 i-00000092-asm mysqld: of InnoDB tables, or you have
moved .frm files to another database?
May 28 11:57:55 i-00000092-asm mysqld: or, the table contains indexes
that this version of the engine
May 28 11:57:55 i-00000092-asm mysqld: doesn't support.
May 28 11:57:55 i-00000092-asm mysqld: See
http://dev.mysql.com/doc/refman/5.5/en/innodb-troubleshooting.html
May 28 11:57:55 i-00000092-asm mysqld: how you can resolve the
problem.
- - - - - - - - - - - - - - - - - - - - -
And from the client perspective, it says that the table doesnt exist.
This is the log from the Node of Reference in the SST time.
- - - - - - - - - - - - - - - - - - - - -
May 28 11:56:31 i-00000092-asm mysqld: version = 2,
May 28 11:56:31 i-00000092-asm mysqld: component = PRIMARY,
May 28 11:56:31 i-00000092-asm mysqld: conf_id = 64,
May 28 11:56:31 i-00000092-asm mysqld: members = 3/4 (joined/
total),
May 28 11:56:31 i-00000092-asm mysqld: act_id = 210837192,
May 28 11:56:31 i-00000092-asm mysqld: last_appl. = -1,
May 28 11:56:31 i-00000092-asm mysqld: protocols = 0/3/1 (gcs/repl/
appl),
May 28 11:56:31 i-00000092-asm mysqld: group UUID =
3108998a-67d4-11e1-0800-84a08b380d82
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Note] WSREP:
Flow-control interval: [16, 32]
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Note] WSREP:
Shifting OPEN -> PRIMARY (TO: 210837192)
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Note] WSREP:
State transfer required:
May 28 11:56:31 i-00000092-asm mysqld: Group state:
3108998a-67d4-11e1-0800-84a08b380d82:210837192
May 28 11:56:31 i-00000092-asm mysqld: Local state:
3108998a-67d4-11e1-0800-84a08b380d82:210835601
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Note] WSREP:
New cluster view: global state:
3108998a-67d4-11e1-0800-84a08b380d82:210192, view# 65: Primary, number
of nodes: 4, my index: 3, protocol version 1
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Warning]
WSREP: Gap in state sequence. Need state transfer.
May 28 11:56:31 i-00000092-asm mysqld: 120528 11:56:31 [Note] WSREP:
[debug]: closing client connections for PRIM
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
waiting for client connections to close: 2
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Running: 'wsrep_sst_rsync 'joiner' '172.16.159.121' 'wsrep_sst:wspass'
'r/lib/mysql/' '/etc/mysql/conf.d/wsrep.cnf' '28132' 2>sst.err'
May 28 11:56:33 i-00000092-asm rsyncd[28175]: rsyncd version 3.0.7
starting, listening on port 4444
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Prepared SST request: rsync|
172.16.159.121:4444/rsync_sst
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
wsrep_notify_cmd is not defined, skipping notification.
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Assign initial position for certification: 210837192, protocol
version:
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Prepared IST receiver, listening at: tcp://
172.16.159.121:4568
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Node 3 (i-00000092-asm) requested state transfer from '*any*'.
Selected i-0000015c-psm)(SYNCED) as donor.
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Shifting PRIMARY -> JOINER (TO: 210837263)
May 28 11:56:33 i-00000092-asm mysqld: 120528 11:56:33 [Note] WSREP:
Requesting state transfer: success, donor: 0
May 28 11:56:33 i-00000092-asm rsyncd[28186]: name lookup failed for
172.16.148.95: Temporary failure in name resolution
May 28 11:56:33 i-00000092-asm rsyncd[28186]: connect from UNKNOWN
(172.16.148.95)
May 28 11:56:33 i-00000092-asm rsyncd[28186]: rsync to rsync_sst/ from
UNKNOWN (172.16.148.95)
May 28 11:56:33 i-00000092-asm rsyncd[28186]: receiving file list
May 28 11:56:33 i-00000092-asm rsyncd[28186]: sent 54 bytes received
182 bytes total size 47
May 28 11:56:35 i-00000092-asm rsyncd[28175]: rsync error: received
SIGINT, SIGTERM, or SIGHUP (code 20) at rsync.c(541) [Receiver=3.0.7]
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 [Note] WSREP:
SST complete, seqno: 210835601
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB: The
InnoDB memory heap is disabled
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB: Mutexes
and rw_locks use GCC atomic builtins
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB:
Compressed tables use zlib 1.2.3.3
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB: Using
Linux native AIO
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB:
Initializing buffer pool, size = 4.0G
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB:
Completed initialization of buffer pool
May 28 11:56:35 i-00000092-asm mysqld: 120528 11:56:35 InnoDB: highest
supported file format is Barracuda.
May 28 11:56:37 i-00000092-asm mysqld: 120528 11:56:37 InnoDB:
Waiting for the background threads to start
May 28 11:56:38 i-00000092-asm mysqld: 120528 11:56:38 InnoDB: 1.1.8
started; log sequence number 100441910215
May 28 11:56:38 i-00000092-asm mysqld: 120528 11:56:38 [Note] WSREP:
Initial TC log open: dummy
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] Event
Scheduler: Loaded 0 events
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Signalling provider to continue.
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Received SST: 3108998a-67d4-11e1-0800-84a08b380d82:210835601
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
SST received: 3108998a-67d4-11e1-0800-84a08b380d82:210835601
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Receiving IST: 1591 writesets, seqnos 210835601-210837192
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] /usr/
sbin/mysqld: ready for connections.
May 28 11:56:39 i-00000092-asm mysqld: Version: '5.5.23' socket: '/
var/run/mysqld/mysqld.sock' port: 3306 Source distribution,
wsrep_23.543
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
IST received: 3108998a-67d4-11e1-0800-84a08b380d82:210837192
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP: 0
(i-0000015c-psm): State transfer to 3 (i-00000092-asm) complete.
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP: 3
(i-00000092-asm): State transfer from 0 (i-0000015c-psm) complete.
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Shifting JOINER -> JOINED (TO: 210837519)
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Member 0 (i-0000015c-psm) synced with group.
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Member 3 (i-00000092-asm) synced with group.
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Shifting JOINED -> SYNCED (TO: 210837520)
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
Synchronized with group, ready for connections
May 28 11:56:39 i-00000092-asm mysqld: 120528 11:56:39 [Note] WSREP:
wsrep_notify_cmd is not defined, skipping notification.
- - - - - - - - - - - - - - - - - - - - -
What might be happening in the SST time ? ( if the problem is in the
SST )