The issue is not with "corrupt queues" but with non-existent bindings that are still
left recorded in some tables.
On 19 August 2016 at 16:20:29, Mike Roest (
mike....@replicon.com) wrote:
> So interesting piece of data. The corrupt queue seems to happen solely
> from attempting to delete the queue via the 3.2.3 admin UI. We've now had
> 2 other clusters get into this scenario with a broken queue when I try to
> work around the non working bindings by deleteing the queue through the UI
> and have our software rebuild the queue from a single process before the
> rest of processes come online.
>
> I'm still working on reproducing the actual issue where we get queue
> bindings that show in the Admin but don't actually capture messages.
>
>
>
> On Wed, Aug 17, 2016 at 1:06 PM, Mike Roest wrote:
>
> > Any other possibility's you can think of for a non-destructive recovery?
> > We're planning for a downtime on the weekend to rebuild the cluster but if
> > I can get away without it that would be helpful.
> >
> > Thanks again for all your help. I'm going to start an additional thread
> > about the bindings that seem to not be working once I can reproduce it in a
> > staging environment.
> >
> > --Mike
> >
> >
> > On Tue, Aug 16, 2016 at 3:36 PM, Mike Roest
> > wrote:
> >
> >> But deleting the vhost I would have to do on the running node correct?
> >> If I delete that vhost that's effectively the same as shutting down
> >> everything since it's the only vhost we have.
> >>
> >>
> >> On Tue, Aug 16, 2016 at 3:33 PM, Michael Klishin
> >> wrote:
> >>
> >>> You are deleting node data correctly but then the node syncs from another
> >>> node.
> >>>
> >>> Vhosts are deleted with rabbitmqctl, see its man page.
> >>>
> >>> On Wed, Aug 17, 2016 at 12:10 AM, Mike Roest
> >>>> wrote:
> >>>>
> >>>>> You can try deleting a single vhost the problematic exchange and
> >>>>> bindings is/are in,
> >>>>> not everything.
> >>>>>
> >>>>> On Wed, Aug 17, 2016 at 12:03 AM, Michael Klishin > >>>>> > wrote:
> >>>>>
> >>>>>> I find it very hard to believe that a node can fail to start with
> >>>>>> this very exception in the log with a blank
> >>>>>> node data directory: there simply aren't any exchanges or bindings to
> >>>>>> load, or peers to sync from.
> >>>>>>
> >>>>>> On Tue, Aug 16, 2016 at 11:56 PM, Mike Roest > >>>>>> > wrote:
> >>>>>>
> >>>>>>> Well that eval worked (on the up server):
> >>>>>>> ok
> >>>>>>> ...done.
> >>>>>>>
> >>>>>>> But down server still fails to startup with or without removing the
> >>>>>>> mnesia dir.
> >>>>>>>
> >>>>>>> RabbitMQ 3.2.4. Copyright (C) 2007-2013 GoPivotal, Inc.
> >>>>>>> ## ## Licensed under the MPL. See
http://www.rabbitmq.com/
> >>>>>>> ## ##
> >>>>>>> ########## Logs: /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>> ###### ## /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>> ##########
> >>>>>>> Starting broker...
> >>>>>>>
> >>>>>>> BOOT FAILED
> >>>>>>> ===========
> >>>>>>>
> >>>>>>> Error description:
> >>>>>>> {error,{{badmatch,{error,[{<<188,169,169,0,103,98,111,64,60
> >>>>>>> ,75,49,4,225,243,
> >>>>>>> 167,138>>,
> >>>>>>> "user"},
> >>>>>>> {<<149,242,33,30,66,247,121,1
> >>>>>>> 51,21,73,228,180,181,
> >>>>>>> 31,120,103>>,
> >>>>>>> "replicon"},
> >>>>>>> {<<139,12,224,72,32,237,89,12
> >>>>>>> 1,247,232,1,240,160,
> >>>>>>> 86,156,211>>,
> >>>>>>> "*"},
> >>>>>>> {<<135,223,75,10,43,90,124,15
> >>>>>>> 5,128,187,197,170,87,
> >>>>>>> 190,157,35>>,
> >>>>>>> "tenant"},
> >>>>>>> {root,none}],
> >>>>>>> ["policy-set-assignment","modified"]}},
> >>>>>>> [{rabbit_exchange_type_topic,follow_down_get_path,2},
> >>>>>>> {rabbit_exchange_type_topic,'-
> >>>>>>> remove_bindings/3-lc$^1/1-1-',1},
> >>>>>>> {rabbit_exchange_type_topic,remove_bindings,3},
> >>>>>>> {rabbit_binding,x_callback,4},
> >>>>>>> {rabbit_binding,'-process_deletions/1-fun-0-',2},
> >>>>>>> {dict,map_bucket,2},
> >>>>>>> {dict,map_bkt_list,2},
> >>>>>>> {dict,map_bkt_list,2}]}}
> >>>>>>>
> >>>>>>> Log files (may contain more information):
> >>>>>>> /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>> /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>
> >>>>>>> Stack trace:
> >>>>>>> [{rabbit_misc,execute_mnesia_transaction,1},
> >>>>>>> {rabbit_misc,execute_mnesia_tx_with_tail,1},
> >>>>>>> {rabbit_amqqueue,recover,0},
> >>>>>>> {rabbit,recover,0},
> >>>>>>> {rabbit,'-run_boot_step/1-lc$^1/1-1-',1},
> >>>>>>> {rabbit,run_boot_step,1},
> >>>>>>> {rabbit,'-start/2-lc$^0/1-0-',1},
> >>>>>>> {rabbit,start,2}]
> >>>>>>>
> >>>>>>>
> >>>>>>>
> >>>>>>> BOOT FAILED
> >>>>>>> ===========
> >>>>>>>
> >>>>>>> Error description:
> >>>>>>> {could_not_start,rabbit,
> >>>>>>> {bad_return,
> >>>>>>> {{rabbit,start,[normal,[]]},
> >>>>>>> {'EXIT',
> >>>>>>> {rabbit,failure_during_boot,
> >>>>>>> {error,
> >>>>>>> {{badmatch,
> >>>>>>> {error,
> >>>>>>> [{<<188,169,169,0,103,98,111,
> >>>>>>> 64,60,75,49,4,
> >>>>>>> 225,243,167,138>>,
> >>>>>>> "user"},
> >>>>>>> {<<149,242,33,30,66,247,121,15
> >>>>>>> 1,21,73,228,
> >>>>>>> 180,181,31,120,103>>,
> >>>>>>> "replicon"},
> >>>>>>> {<<139,12,224,72,32,237,89,121
> >>>>>>> ,247,232,1,240,
> >>>>>>> 160,86,156,211>>,
> >>>>>>> "*"},
> >>>>>>> {<<135,223,75,10,43,90,124,155
> >>>>>>> ,128,187,197,
> >>>>>>> 170,87,190,157,35>>,
> >>>>>>> "tenant"},
> >>>>>>> {root,none}],
> >>>>>>> ["policy-set-assignment","mod
> >>>>>>> ified"]}},
> >>>>>>> [{rabbit_exchange_type_topic,
> >>>>>>> follow_down_get_path,2},
> >>>>>>> {rabbit_exchange_type_topic,
> >>>>>>> '-remove_bindings/3-lc$^1/1-1-',1},
> >>>>>>> {rabbit_exchange_type_topic,re
> >>>>>>> move_bindings,3},
> >>>>>>> {rabbit_binding,x_callback,4},
> >>>>>>> {rabbit_binding,'-process_dele
> >>>>>>> tions/1-fun-0-',2},
> >>>>>>> {dict,map_bucket,2},
> >>>>>>> {dict,map_bkt_list,2},
> >>>>>>> {dict,map_bkt_list,2}]}}}}}}}
> >>>>>>>
> >>>>>>> Log files (may contain more information):
> >>>>>>> /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>> /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>
> >>>>>>> {"init terminating in do_boot",{rabbit,failure_durin
> >>>>>>> g_boot,{could_not_start,rabbit,{bad_return,{{rabbit,start,[n
> >>>>>>> ormal,[]]},{'EXIT',{rabbit,failure_during_boot,{error,{{badmatch,{error,[{<<16
> >>>>>>> bytes>>,"user"},{<<16 bytes>>,"replicon"},{<<16 bytes>>,"*"},{<<16
> >>>>>>> bytes>>,"tena
> >>>>>>> nt"},{root,none}],["policy-set-assignment","modified"]}},[{r
> >>>>>>> abbit_exchange_type_topic,follow_down_get_path,2},{rabbit_ex
> >>>>>>> change_type_topic,'-remove_bindings/3-lc$^1/1-1-',1},{rabbit
> >>>>>>> _exchange_type_topic,remove_bindings,3},{rabbit_binding,x_ca
> >>>>>>>> > ########## Logs: /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > ###### ## /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> > /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> > /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> >
> >>>>>>>> > {"init terminating in
> >>>>>>>> > do_boot",{rabbit,failure_during_boot,{could_not_start,rabbit
> >>>>>>>> ,{bad_return,{{rabbit,start,[normal,[]]},{'EXIT',{rabbit,fai
> >>>>>>>> lure_during_boot,{error,{{badmatch,{error,[{<<16
> >>>>>>>> > bytes>>,"user"},{<<16 bytes>>,"replicon"},{<<16 bytes>>,"*"},{<<16
> >>>>>>>> > bytes>>,"tena
> >>>>>>>> > nt"},{root,none}],["policy-set-assignment","modified"]}},[{r
> >>>>>>>> abbit_exchange_type_topic,follow_down_get_path,2},{rabbit_ex
> >>>>>>>> change_type_topic,'-remove_bindings/3-lc$^1/1-1-',1},{rabbit
> >>>>>>>> _exchange_type_topic,remove_bindings,3},{rabbit_binding,x_ca
> >>>>>>>> llback,4},{rabbit_binding,'-pro
> >>>>>>>> > cess_deletions/1-fun-0-',2},{dict,map_bucket,2},{dict,map_bk
> >>>>>>>> t_list,2},{dict,map_bkt_list,2}]}}}}}}}}}
> >>>>>>>> > [FAILED]
> >>>>>>>> >
> >>>>>>>> >
> >>>>>>>> > >>>>>>>>> ########## Logs: /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > >>>>>>>>> ###### ## /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> > >>>>>>>>> ##########
> >>>>>>>> > >>>>>>>>> Starting broker...
> >>>>>>>> > >>>>>>>>>
> >>>>>>>> > >>>>>>>>> BOOT FAILED
> >>>>>>>> > >>>>>>>>> ===========
> >>>>>>>> > >>>>>>>>>
> >>>>>>>> > >>>>>>>>> Error description:
> >>>>>>>> > >>>>>>>>> {error,{{badmatch,{error,[{<<1
> >>>>>>>> 49,242,33,30,66,247,121,151,2
> >>>>>>>> > >>>>>>>>> 1,73,228,180,181,
> >>>>>>>> > >>>>>>>>> 31,120,103>>,
> >>>>>>>> > >>>>>>>>> "replicon"},
> >>>>>>>> > >>>>>>>>> {<<139,12,224,72,32,237,89,12
> >>>>>>>> > >>>>>>>>> 1,247,232,1,240,160,
> >>>>>>>> > >>>>>>>>> 86,156,211>>,
> >>>>>>>> > >>>>>>>>> "*"},
> >>>>>>>> > >>>>>>>>> {<<135,223,75,10,43,90,124,15
> >>>>>>>> > >>>>>>>>> 5,128,187,197,170,87,
> >>>>>>>> > >>>>>>>>> 190,157,35>>,
> >>>>>>>> > >>>>>>>>> "tenant"},
> >>>>>>>> > >>>>>>>>> {root,none}],
> >>>>>>>> > >>>>>>>>> ["user","policy-set-assignment
> >>>>>>>> > >>>>>>>>> ","modified"]}},
> >>>>>>>> > >>>>>>>>> [{rabbit_exchange_type_topic,follow_down_get_path,2},
> >>>>>>>> > >>>>>>>>> {rabbit_exchange_type_topic,'-
> >>>>>>>> > >>>>>>>>> remove_bindings/3-lc$^1/1-1-',1},
> >>>>>>>> > >>>>>>>>> {rabbit_exchange_type_topic,remove_bindings,3},
> >>>>>>>> > >>>>>>>>> {rabbit_binding,x_callback,4},
> >>>>>>>> > >>>>>>>>> {rabbit_binding,'-process_deletions/1-fun-0-',2},
> >>>>>>>> > >>>>>>>>> {dict,map_bucket,2},
> >>>>>>>> > >>>>>>>>> {dict,map_bkt_list,2},
> >>>>>>>> > >>>>>>>>> {dict,map_bkt_list,2}]}}
> >>>>>>>> > >>>>>>>>>
> >>>>>>>> > >>>>>>>>> Log files (may contain more information):
> >>>>>>>> > >>>>>>>>> /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > >>>>>>>>> /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> > >>>>>>>>> /var/log/rabbitmq/rab...@rabbitmq1.log
> >>>>>>>> > >>>>>>>>> /var/log/rabbitmq/rab...@rabbitmq1-sasl.log
> >>>>>>>> > >>>>>>>>>
> >>>>>>>> > >>>>>>>>> {"init terminating in do_boot",{rabbit,failure_durin
> >>>>>>>> > >>>>>>>>> g_boot,{could_not_start,rabbit
> >>>>>>>> ,{bad_return,{{rabbit,start,[n
> >>>>>>>> > >>>>>>>>> ormal,[]]},{'EXIT',{rabbit,fai
> >>>>>>>> lure_during_boot,{error,{{badmatch,{error,[{<<16
> >>>>>>>> > >>>>>>>>> bytes>>,"replicon"},{<<16 bytes>>,"*"},{<<16
> >>>>>>>> bytes>>,"tenant"},{root,none}]
> >>>>>>>> > >>>>>>>>> ,["us
> >>>>>>>> > >>>>>>>>> er","policy-set-assignment","m
> >>>>>>>> odified"]}},[{rabbit_exchange_
> >>>>>>>> > >>>>>>>>> type_topic,follow_down_get_pat
> >>>>>>>> h,2},{rabbit_exchange_type_top
> >>>>>>>> > >>>>>>>>> ic,'-remove_bindings/3-lc$^1/1
> >>>>>>>> -1-',1},{rabbit_exchange_type_
> >>>>>>>> > >>>>>>>>> topic,remove_bindings,3},{rabb
> >>>>>>>> > >>>> an email to
rabbitmq-user...@googlegroups.com.
> >>>>>>>> > >>>> To post to this group, send email to
> >>>>>>>>
rabbitm...@googlegroups.com.
> >>>>>>>> > >>>> For more options, visit
https://groups.google.com/d/optout.
> >>>>>>>> > >>>>
> >>>>>>>> > >>>
> >>>>>>>> > >>>
> >>>>>>>> > >>>
> >>>>>>>> > >>> --
> >>>>>>>> > >>> MK
> >>>>>>>> > >>>
> >>>>>>>> > >>> Staff Software Engineer, Pivotal/RabbitMQ
> >>>>>>>> > >>>
> >>>>>>>> > >>> --
> >>>>>>>> > >>> You received this message because you are subscribed to the
> >>>>>>>> Google
> >>>>>>>> > >>> Groups "rabbitmq-users" group.
> >>>>>>>> > >>> To unsubscribe from this group and stop receiving emails from
> >>>>>>>> it, send
> >>>>>>>> > >>> an email to
rabbitmq-user...@googlegroups.com.
> >>>>>>>> > >>> To post to this group, send email to
> >>>>>>>>
rabbitm...@googlegroups.com.
> >>>>>>>> > >>> For more options, visit
https://groups.google.com/d/optout.
> >>>>>>>> > >>>
> >>>>>>>> > >>
> >>>>>>>> > >>
> >>>>>>>> > >>
> >>>>>>>> > >> --
> >>>>>>>> > >> Data's inconvienient when people have opinions.
> >>>>>>>> > >>
> >>>>>>>> > >> --
> >>>>>>>> > >> You received this message because you are subscribed to the
> >>>>>>>> Google Groups
> >>>>>>>> > >> "rabbitmq-users" group.
> >>>>>>>> > >> To unsubscribe from this group and stop receiving emails from
> >>>>>>>> it, send an
> >>>>>>>> > >> email to
rabbitmq-user...@googlegroups.com.
> >>>>>>>> > >> To post to this group, send email to
> >>>>>>>>
rabbitm...@googlegroups.com.
> >>>>>>>> > >> For more options, visit
https://groups.google.com/d/optout.
> >>>>>>>> > >>
> >>>>>>>> > >
> >>>>>>>> > >
> >>>>>>>> > >
> >>>>>>>> > > --
> >>>>>>>> > > MK
> >>>>>>>> > >
> >>>>>>>> > > Staff Software Engineer, Pivotal/RabbitMQ
> >>>>>>>> > >
> >>>>>>>> > > --
> >>>>>>>> > > You received this message because you are subscribed to the
> >>>>>>>> Google Groups
> >>>>>>>> > > "rabbitmq-users" group.
> >>>>>>>> > > To unsubscribe from this group and stop receiving emails from
> >>>>>>>> it, send an
> >>>>>>>> > > email to
rabbitmq-user...@googlegroups.com.
> >>>>>>>> > > To post to this group, send email to
> >>>>>>>>
rabbitm...@googlegroups.com.
> >>>>>>>> > > For more options, visit
https://groups.google.com/d/optout.
> >>>>>>>> > >
> >>>>>>>> >
> >>>>>>>> >
> >>>>>>>> >
> >>>>>>>> > --
> >>>>>>>> > Data's inconvienient when people have opinions.
> >>>>>>>> >
> >>>>>>>> > --
> >>>>>>>> > You received this message because you are subscribed to the
> >>>>>>>> Google Groups "rabbitmq-users"
> >>>>>>>> > group.
> >>>>>>>> > To unsubscribe from this group and stop receiving emails from it,
> >>>>>>>> send an email to
rabbitmq-user...@googlegroups.com.