Cluster not starting anymore

Fabrice Le Dorze

unread,

May 21, 2024, 4:46:16 AM5/21/24

to Wazuh | Mailing List

Hi all

We have a Wazuh infra with one dashboard and a cluster of 2 nodes indexed/manager (on Prem, one in cloud)

By mistake, we broke nodes by installing wazuh-agent.
We could restore OmPrem node.
But we had to re-install manager and indexer with install script on Cloud node.
Since , cluster does not start anymore and on Dashboard side, it says " Wazuh dashboard server is not ready yet "

on OnPrem node :

wazuh-indexer 4.7.3-1
wazuh-manager 4.7.3-1

/var/ossec/bin/cluster_control -l
NAME TYPE VERSION ADDRESS
wazuh-node-1-onprem master 4.7.3 10.15.100.131

In /var/log/wazuh-indexer/wazuh-indexer-cluster.log,

[wazuh-node-1-vic] cluster-manager not discovered or elected yet, an election requires a node with id [q1epkbDbRFaBx2RbwSm4Rw], have discovered [{wazuh-node-1-vic}{-PnONKBfSWamy3zqDkpedw}{XYjqeHKST5qd5z5jnQEGYg}{10.15.100.131}{10.15.100.131:9300}{dimr}{shard_indexing_pressure_enabled=true}, {wazuh-node-1-az}{bU5Q1VnXQ9-xBLAUFWlSvA}{XUBfnlAdR5CS7DJ2wmDCNA}{10.205.5.131}{10.205.5.131:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is not a quorum; discovery will continue using [10.205.5.131:9300] from hosts providers and [{wazuh-node-1-vic}{-PnONKBfSWamy3zqDkpedw}{XYjqeHKST5qd5z5jnQEGYg}{10.15.100.131}{10.15.100.131:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 17, last-accepted version 114202 in term 17

on Cloud node :
wazuh-indexer 4.7.2-1
wazuh-manager 4.7.2-1

/var/ossec/bin/cluster_control -l
ERROR: Error 3012 - Cluster is not running

In /var/log/wazuh-indexer/wazuh-indexer-cluster.log,
[wazuh-node-1-az] cluster-manager not discovered or elected yet, an election requires two nodes with ids [-PnONKBfSWamy3zqDkpedw, bU5Q1VnXQ9-xBLAUFWlSvA], have discovered [{wazuh-node-1-az}{bU5Q1VnXQ9-xBLAUFWlSvA}{XUBfnlAdR5CS7DJ2wmDCNA}{10.205.5.131}{10.205.5.131:9300}{dimr}{shard_indexing_pressure_enabled=true}, {wazuh-node-1-vic}{-PnONKBfSWamy3zqDkpedw}{XYjqeHKST5qd5z5jnQEGYg}{10.15.100.131}{10.15.100.131:9300}{dimr}{shard_indexing_pressure_enabled=true}] which is a quorum; discovery will continue using [10.15.100.131:9300] from hosts providers and [{wazuh-node-1-az}{bU5Q1VnXQ9-xBLAUFWlSvA}{XUBfnlAdR5CS7DJ2wmDCNA}{10.205.5.131}{10.205.5.131:9300}{dimr}{shard_indexing_pressure_enabled=true}] from last-known cluster state; node term 0, last-accepted version 0 in term 0

I understand well, on OnPrem side, an ID q1epkbDbRFaBx2RbwSm4Rw is required but it does fit the current nodes ID ( OnPrem : -PnONKBfSWamy3zqDkpedw , Cloud : bU5Q1VnXQ9-xBLAUFWlSvA )

Is it the reason why the cluster does not start anymore ?
How to fix it ?

Or the root cause is something else ?
Thx

Rolly Davany Mougoue Kakanou

unread,

May 21, 2024, 7:52:40 PM5/21/24

to Wazuh | Mailing List

Hello Fabrice,

If you had to reinstall the cloud node using the install script, you will need to regenerate certificates for the new installation using the pre-existing root CA. This is the root-ca.pem and root-ca.key created during the initial install. You can follow the steps provided in the following sections of the Upscaling documentation to help you.

https://documentation.wazuh.com/current/user-manual/upscaling/adding-indexer-node.html#distributed-deployment

https://documentation.wazuh.com/current/user-manual/upscaling/adding-server-node.html#distributed-deployment

Best Regards,

Rolly Mougoue

Rolly Davany Mougoue Kakanou

unread,

May 21, 2024, 7:56:21 PM5/21/24

to Wazuh | Mailing List

Also I can see your Wazuh versions for On Prem installation and on Cloud node are different. Please note that the Wazuh central components must share the same version numbers down to the patch category on both nodes.

https://documentation.wazuh.com/current/upgrade-guide/compatibility-matrix/index.html

Reply all

Reply to author

Forward