After a Schema-Change on which I have missed out to take care about replication, I now have one Server not synching anymore, as it tries to alter the table which already has been taken care of (Server 1&2 have the additional column and are fine, Server 3 tries to insert it again - errors and therewith is in wsrep_local_state: 2
The error log is quite clear with that:
150728 15:19:29 [ERROR] Slave SQL: Error 'Duplicate column name 'somecolumn'' on query. Default database: 'my_db'. Query: 'ALTER TABLE tablename ADD somecolumn DATETIME NULL', Internal MariaDB error code: 1060
150728 15:19:29 [Warning] WSREP: RBR event 1 Query apply warning: 1, 195651391
150728 15:19:29 [Warning] WSREP: Ignoring error for TO isolated action: source: f7715036-20ca-11e5-a90f-f23b6463dd16 version: 3 local: 0 state: APPLYING flags: 65 conn_id: 2348812 trx_id: -1 seqnos (l: 14350890, g: 195651391, s: 195651390, d: 195651390, ts: 20916266579392088)
As it is quite obvious this transaction can be skipped, I would like to do so - but can't really find anywhere how to (only for a master-slave setup with a "SQL_SLAVE_SKIP_COUNTER=1" - which won't work on galera multi-master).
Thanks in advance,
Regards Chris
Server 3 does not try to do that - it has no brains for that. A cluster
client tries to do that. Or replication event from async master (which
can be skipped as usual). So look elsewhere.
Also, errors don't send the server to state 2. Either joiner or client
request does that.
> The error log is quite clear with that:
>
> 150728 15:19:29 [ERROR] Slave SQL: Error 'Duplicate column name
> 'somecolumn'' on query. Default database: 'my_db'. Query: 'ALTER TABLE
> tablename ADD somecolumn DATETIME NULL', Internal MariaDB error code:
> 1060
This error clearly states that this node already has this column. Is
this message from Server 3?