5.5.28 node 2 crashes with following error in production

117 views
Skip to first unread message

Amol

unread,
Feb 5, 2013, 6:06:06 PM2/5/13
to percona-d...@googlegroups.com
my production server is running on 5.5.28 on ubuntu 10.04 LTS and the node2 crashed and this is what i see in the node 2 log

$ sudo dpkg --list | grep percona*

ii  percona-xtrabackup                2.0.3-470.lucid                   Open source backup tool for InnoDB and XtraD
ii  percona-xtradb-cluster-client-5.5 5.5.28-23.7-369.lucid             Percona Server database client binaries
ii  percona-xtradb-cluster-common-5.5 5.5.28-23.7-369.lucid             Percona Server database common files (e.g. /
ii  percona-xtradb-cluster-galera-2.x 117.lucid                         Galera components of Percona XtraDB Cluster
ii  percona-xtradb-cluster-server-5.5 5.5.28-23.7-369.lucid             Percona Server database server binaries



can anyone please explain this? its a production environment so i am worried

error from mysql/error.log

130205  6:32:26 [ERROR] Slave SQL: Error 'Can't DROP 'as_a_valued_customer_we_would_like_to_keep_you_informed_with_inf'; check that column/key exists' on query. Default database: 'mobile'. Query: 'ALTER TABLE _data11279_clf_test DROP as_a_valued_customer_we_would_like_to_keep_you_informed_with_inf', Error_code: 1091
130205  6:32:26 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4271369
130205  6:32:26 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 2955309 trx_id: -1 seqnos (l: 624437, g: 4271369, s: 4271367, d: 4271368, ts: 1360063946151320234)
130205  6:42:10 [ERROR] Slave SQL: Error 'Can't DROP 'my_element1'; check that column/key exists' on query. Default database: 'mobile'. Query: 'ALTER TABLE _data11279_clf_test DROP my_element1', Error_code: 1091
130205  6:42:10 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4272415
130205  6:42:10 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 2966052 trx_id: -1 seqnos (l: 625489, g: 4272415, s: 4272414, d: 4272414, ts: 1360064530206809819)
130205 10:40:17 [ERROR] Slave SQL: Error 'Unknown column 'water_s' in '_data283200_water_track_sheet'' on query. Default database: 'mobile'. Query: 'ALTER TABLE _data283200_water_track_sheet CHANGE water_s water_supply_off TEXT', Error_code: 1054
130205 10:40:17 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4290960
130205 10:40:17 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 3140133 trx_id: -1 seqnos (l: 644090, g: 4290960, s: 4290959, d: 4290959, ts: 1360078817756048805)
130205 10:42:40 [ERROR] Slave SQL: Error 'Unknown column 'water_s' in '_data283200_water_track_sheet'' on query. Default database: 'mobile'. Query: 'ALTER TABLE _data283200_water_track_sheet CHANGE water_s water_supply_info TEXT', Error_code: 1054
130205 10:42:40 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4291068
130205 10:42:40 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 3141270 trx_id: -1 seqnos (l: 644198, g: 4291068, s: 4291067, d: 4291067, ts: 1360078960292961657)
130205 13:30:18 [ERROR] Slave SQL: Error 'Can't DROP 'div1'; check that column/key exists' on query. Default database: 'mobile'. Query: 'ALTER TABLE _data283206_fleet_maintenance_subscription DROP div1', Error_code: 1091
130205 13:30:18 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4312212
130205 13:30:18 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 3324335 trx_id: -1 seqnos (l: 665387, g: 4312212, s: 4312209, d: 4312211, ts: 1360089018748799843)
130205 15:32:12 [ERROR] Slave SQL: Error 'There is no such grant defined for user 'smilesuser' on host 'localhost'' on query. Default database: 'mysql'. Query: 'REVOKE ALL PRIVILEGES ON `exzactmobileZCdb`.* FROM 'smilesuser'@'localhost'', Error_code: 1141
130205 15:32:12 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4330311
130205 15:32:12 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 3474375 trx_id: -1 seqnos (l: 683543, g: 4330311, s: 4330310, d: 4330310, ts: 1360096332712438748)
130205 15:32:12 [ERROR] Slave SQL: Error 'There is no such grant defined for user 'smilesuser' on host 'localhost'' on query. Default database: 'mysql'. Query: 'REVOKE GRANT OPTION ON `exzactmobileZCdb`.* FROM 'smilesuser'@'localhost'', Error_code: 1141
130205 15:32:12 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4330312
130205 15:32:12 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 3474375 trx_id: -1 seqnos (l: 683544, g: 4330312, s: 4330311, d: 4330311, ts: 1360096332733166590)
130205 16:34:06 [Warning] WSREP: last inactive check more than PT1.5S ago, skipping check
21:51:12 UTC - mysqld got signal 7 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona Server better by reporting any

key_buffer_size=33554432
read_buffer_size=131072
max_used_connections=19
max_threads=300
thread_count=4
connection_count=4
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 689362 K  bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x51ca080
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7fe5db9c5a68 thread_stack 0x40000
/usr/sbin/mysqld(my_print_stacktrace+0x35)[0x7f2dd5]
/usr/sbin/mysqld(handle_fatal_signal+0x4a4)[0x6bc344]
/lib/libpthread.so.0(+0xf8f0)[0x7fe63fbe58f0]
/usr/sbin/mysqld[0x8ec8e0]
/usr/sbin/mysqld[0x8ed2da]
/usr/sbin/mysqld[0x8ee01c]
/usr/sbin/mysqld[0x8ee60c]
/usr/sbin/mysqld[0x8e1d27]
/usr/sbin/mysqld[0x8f0434]
/usr/sbin/mysqld[0x8f0fcf]
/usr/sbin/mysqld[0x8e08cb]
/usr/sbin/mysqld[0x924f45]
/usr/sbin/mysqld[0x9285d7]
/usr/sbin/mysqld[0x8c0d7d]
/usr/sbin/mysqld[0x8c7611]
/usr/sbin/mysqld[0x97f361]
/usr/sbin/mysqld[0x981799]
/usr/sbin/mysqld[0x872b26]
/usr/sbin/mysqld[0x855b40]
/usr/sbin/mysqld(_ZN7handler12ha_write_rowEPh+0x5e)[0x6c106e]                                                                                                                                                                                                                                                              /usr/sbin/mysqld[0x5a3466]
/usr/sbin/mysqld(_Z14wsrep_apply_cbPvPKvml+0x8e)[0x5a3ace]
/usr/lib64/libgalera_smm.so(+0x1a9b32)[0x7fe63c6b9b32]
/usr/lib64/libgalera_smm.so(_ZN6galera13ReplicatorSMM9apply_trxEPvPNS_9TrxHandleE+0x222)[0x7fe63c6c3d72]
/usr/lib64/libgalera_smm.so(_ZN6galera13ReplicatorSMM11process_trxEPvPNS_9TrxHandleE+0x45)[0x7fe63c6c4905]
/usr/lib64/libgalera_smm.so(_ZN6galera15GcsActionSource8dispatchEPvRK10gcs_action+0x305)[0x7fe63c698945]
/usr/lib64/libgalera_smm.so(_ZN6galera15GcsActionSource7processEPv+0x58)[0x7fe63c698ee8]
/usr/lib64/libgalera_smm.so(_ZN6galera13ReplicatorSMM10async_recvEPv+0x7d)[0x7fe63c6baf5d]
/usr/lib64/libgalera_smm.so(galera_recv+0x23)[0x7fe63c6d6a13]
/usr/sbin/mysqld(_Z25wsrep_replication_processP3THD+0x52)[0x5a2fd2]
/usr/sbin/mysqld(start_wsrep_THD+0x41b)[0x524e3b]
/lib/libpthread.so.0(+0x69ca)[0x7fe63fbdc9ca]
/lib/libc.so.6(clone+0x6d)[0x7fe63ee6f21d]

Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (0): is an invalid pointer
Connection ID (thread ID): 3
Status: NOT_KILLED

You may download the Percona Server operations manual by visiting
in the manual which will help you identify the cause of the crash.
130205 16:51:14 mysqld_safe mysqld from pid file /var/lib/mysql/iform-db-clusternode2.pid ended
130205 17:01:21 mysqld_safe Starting mysqld daemon with databases from /var/lib/mysql
130205 17:01:21 mysqld_safe WSREP: 'wsrep_urls' is DEPRECATED! Use wsrep_cluster_address to specify multiple addresses instead.
130205 17:01:21 mysqld_safe WSREP: Running position recovery with --log_error=/tmp/tmp.wdArKbXjGS

Amol

unread,
Feb 10, 2013, 4:06:20 PM2/10/13
to percona-d...@googlegroups.com
i see another crash today on the same node here are the errors from the mysql/error.log


130210 12:00:19 [ERROR] Slave SQL: Error 'Can't DROP 'section_f2'; check that column/key exists' on query. Default database: 'mobiledb'. Query: 'ALTER TABLE _data280433_nfirs2_sectionf DROP section_f2', Error_code: 1091
130210 12:00:19 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4775290
130210 12:00:19 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 8868654 trx_id: -1 seqnos (l: 157331, g: 4775290, s: 4775289, d: 4775289, ts: 1360515619033508849)
130210 13:49:39 [ERROR] Slave SQL: Error 'Can't DROP 'documentation'; check that column/key exists' on query. Default database: 'mobiledb'. Query: 'ALTER TABLE _data13641_greek_life_commons_parent DROP documentation', Error_code: 1091
130210 13:49:39 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4779118
130210 13:49:39 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 8942544 trx_id: -1 seqnos (l: 161168, g: 4779118, s: 4779117, d: 4779117, ts: 1360522179726553449)
130210 13:50:10 [ERROR] Slave SQL: Error 'Can't DROP 'site_audit'; check that column/key exists' on query. Default database: 'mobiledb'. Query: 'ALTER TABLE _data13641_greek_life_commons_parent DROP site_audit', Error_code: 1091
130210 13:50:10 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4779131
130210 13:50:10 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 8942823 trx_id: -1 seqnos (l: 161181, g: 4779131, s: 4779129, d: 4779130, ts: 1360522210866430462)
130210 14:01:34 [ERROR] Slave SQL: Error 'Can't DROP 'note_this_audit_form_is_not_inclusive_of_all_2010_fire_code_of'; check that column/key exists' on query. Default database: 'mobiledb'. Query: 'ALTER TABLE _data13641_greek_life_commons_parent DROP note_this_audit_form_is_not_inclusive_of_all_2010_fire_code_of', Error_code: 1091
130210 14:01:34 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4779439
130210 14:01:34 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 8950420 trx_id: -1 seqnos (l: 161490, g: 4779439, s: 4779438, d: 4779438, ts: 1360522894432074470)
130210 14:28:08 [ERROR] Slave SQL: Error 'Unknown column 'no_electrical' in '_data13641_fire_resistance_rated_construction_sub'' on query. Default database: 'mobiledb'. Query: 'ALTER TABLE _data13641_fire_resistance_rated_construction_sub CHANGE no_electrical doors_requiring VARCHAR(100)', Error_code: 1054
130210 14:28:08 [Warning] WSREP: RBR event 1 Query apply warning: 1, 4780963
130210 14:28:08 [Warning] WSREP: Ignoring error for TO isolated action: source: ed2330e5-6b01-11e2-0800-1d4b82ab22d6 version: 2 local: 0 state: APPLYING flags: 65 conn_id: 8969855 trx_id: -1 seqnos (l: 163017, g: 4780963, s: 4780962, d: 4780962, ts: 1360524488670861699)
19:55:44 UTC - mysqld got signal 11 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona Server better by reporting any

key_buffer_size=33554432
read_buffer_size=131072
max_used_connections=9
max_threads=300
thread_count=3
connection_count=3
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_size)*max_threads = 689362 K  bytes of memory
Hope that's ok; if not, decrease some variables in the equation.

Thread pointer: 0x0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 0 thread_stack 0x40000
/usr/sbin/mysqld(my_print_stacktrace+0x35)[0x7f2dd5]
/usr/sbin/mysqld(handle_fatal_signal+0x4a4)[0x6bc344]
/lib/libpthread.so.0(+0xf8f0)[0x7fd6278758f0]
/usr/lib64/libgalera_smm.so(_ZN6gcache10RingBuffer14get_new_bufferEl+0x167)[0x7fd62423c677]
/usr/lib64/libgalera_smm.so(_ZN6gcache10RingBuffer6mallocEl+0x39)[0x7fd62423c899]
/usr/lib64/libgalera_smm.so(_ZN6gcache6GCache6mallocEl+0x97)[0x7fd62423e227]
/usr/lib64/libgalera_smm.so(gcs_defrag_handle_frag+0x92)[0x7fd6242f9822]
/usr/lib64/libgalera_smm.so(gcs_core_recv+0x4d1)[0x7fd6242ff281]
/usr/lib64/libgalera_smm.so(+0x165e60)[0x7fd624305e60]
/lib/libpthread.so.0(+0x69ca)[0x7fd62786c9ca]
/lib/libc.so.6(clone+0x6d)[0x7fd626aff21d]
You may download the Percona Server operations manual by visiting
in the manual which will help you identify the cause of the crash.
130210 14:55:45 mysqld_safe Number of processes running now: 0
130210 14:55:45 mysqld_safe WSREP: not restarting wsrep node automatically

kasi viswanadh jaladi

unread,
May 22, 2016, 3:21:21 PM5/22/16
to Percona Discussion
it seems the data is inconsistent on the node2. So better made the entire data directory empty and start the server. SST should happen and data should be in sync.
Reply all
Reply to author
Forward
0 new messages