But doing stop_app on all cluster nodes, and trying to get other nodes to join the first doesnt work every well.
1. Stop_all all the cluster_nodes )rabbitmqctl stop_app)
2. on the first node, restored the messages - tried to start the node. rabbitmqctl start_app
3. the first node fails to start because, the other nodes are stopped. And doesn't assume to be down.
At this point, i am running out of ideas.. any help with this would help taking this to production! thanks in advance...!!
Log files (may contain more information):
/var/log/rabbitmq/rab...@ec2-54-88-219-40.log
/var/log/rabbitmq/rab...@ec2-54-88-219-40-sasl.log
=INFO REPORT==== 13-Oct-2014::15:17:23 ===
Starting RabbitMQ 3.3.1 on Erlang R14B04
Copyright (C) 2007-2014 GoPivotal, Inc.
=INFO REPORT==== 13-Oct-2014::15:17:23 ===
node : rabbit@ec2-54-88-219-40
home dir : /var/lib/rabbitmq
config file(s) : /etc/rabbitmq/rabbitmq.config
cookie hash : AXRbPCxp7gfZ+OAqKZijEA==
log : /var/log/rabbitmq/rab...@ec2-54-88-219-40.log
sasl log : /var/log/rabbitmq/rab...@ec2-54-88-219-40-sasl.log
database dir : /var/lib/rabbitmq/mnesia/rabbit@ec2-54-88-219-40
=INFO REPORT==== 13-Oct-2014::15:17:24 ===
Limiting to approx 924 file handles (829 sockets)
=INFO REPORT==== 13-Oct-2014::15:17:24 ===
Error description:
{badmatch,{error,mnesia_not_running}}
Log files (may contain more information):
/var/log/rabbitmq/rab...@ec2-54-88-219-40.log
/var/log/rabbitmq/rab...@ec2-54-88-219-40-sasl.log
Stack trace:
[{rabbit_mnesia,init_from_config,0},
{rabbit_mnesia,init,0},
{rabbit,'-run_boot_step/1-lc$^1/1-1-',1},
{rabbit,run_boot_step,1},
{rabbit,'-start/2-lc$^0/1-0-',1},
{rabbit,start,2},
{application_master,start_it_old,4}]
=INFO REPORT==== 13-Oct-2014::15:17:25 ===
Error description:
{could_not_start,rabbit,
{bad_return,
{{rabbit,start,[normal,[]]},
{'EXIT',
{rabbit,failure_during_boot,
{badmatch,{error,mnesia_not_running}}}}}}}
Log files (may contain more information):
/var/log/rabbitmq/rab...@ec2-54-88-219-40.log
/var/log/rabbitmq/rab...@ec2-54-88-219-40-sasl.log