Thanks a lot for your quick response. I will try this on live but ... why is this working perfectly in my test environment? It's a 2-node cluster, below the traces from both nodes when I stop and start the second node:
First node trace when second node stops
-----------------------------------------------------------------
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Shutdown request of Member [10.1.0.4]:9975 - 2a0ba087-ec28-4025-b621-fac431ac3d00 is handled
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Repartitioning cluster data. Migration tasks count: 233
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] All migration tasks have been completed. (repartitionTime=Sat Jun 17 10:26:18 UTC 2023, plannedMigrations=233, completedMigrations=233, remainingMigrations=0, totalCompletedMigrations=700)
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Connection[id=1, /10.1.0.5:9975->/
10.1.0.4:37933, qualifier=null, endpoint=[10.1.0.4]:9975, alive=false, connectionType=MEMBER, planeIndex=0] closed. Reason: Connection closed by the other side
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Connecting to /
10.1.0.4:9975, timeout: 10000, bind-any: true
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Could not connect to: /
10.1.0.4:9975. Reason: IOException[Connection refused to address /
10.1.0.4:9975]
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Connecting to /
10.1.0.4:9975, timeout: 10000, bind-any: true
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Could not connect to: /
10.1.0.4:9975. Reason: IOException[Connection refused to address /
10.1.0.4:9975]
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Connecting to /
10.1.0.4:9975, timeout: 10000, bind-any: true
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Could not connect to: /
10.1.0.4:9975. Reason: IOException[Connection refused to address /
10.1.0.4:9975]
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Connecting to /
10.1.0.4:9975, timeout: 10000, bind-any: true
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Could not connect to: /
10.1.0.4:9975. Reason: IOException[Connection refused to address /
10.1.0.4:9975]
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Removing connection to endpoint [10.1.0.4]:9975 Cause => java.io.IOException {Connection refused to address /
10.1.0.4:9975}, Error-Count: 5
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Removing Member [10.1.0.4]:9975 - 2a0ba087-ec28-4025-b621-fac431ac3d00
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Partition balance is ok, no need to repartition.
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6]
Members {size:1, ver:3} [
Member [10.1.0.5]:9975 - 4f3acadc-2e05-4198-bd4f-035c99e8b967 this
]
2023-06-17 10:26:18 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Committing/rolling-back live transactions of [10.1.0.4]:9975, UUID: 2a0ba087-ec28-4025-b621-fac431ac3d00
First node trace when second node starts
-------------------------------------------------------------------
2023-06-17 10:27:29 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Initialized new cluster connection between /
10.1.0.5:9975 and /
10.1.0.4:524092023-06-17 10:27:35 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6]
Members {size:2, ver:4} [
Member [10.1.0.5]:9975 - 4f3acadc-2e05-4198-bd4f-035c99e8b967 this
Member [10.1.0.4]:9975 - 27f888d8-72f8-4f7e-9005-cee7b8814448
]
2023-06-17 10:27:36 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] Repartitioning cluster data. Migration tasks count: 467
2023-06-17 10:27:36 [10.1.0.5]:9975 [ventusproxyCluster] [4.2.6] All migration tasks have been completed. (repartitionTime=Sat Jun 17 10:27:36 UTC 2023, plannedMigrations=467, completedMigrations=467, remainingMigrations=0, totalCompletedMigrations=1167)
Second node trace (starting)
----------------------------------------------
2023-06-17 10:27:27 [LOCAL] [ventusproxyCluster] [4.2.6] Interfaces is enabled, trying to pick one address matching to one of: [10.1.0.5, 10.1.0.5, 10.1.0.4, 10.1.0.4]
2023-06-17 10:27:27 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] Hazelcast 4.2.6 (20221125 - 622d299) starting at [10.1.0.4]:9975
2023-06-17 10:27:29 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] Using TCP/IP discovery
2023-06-17 10:27:29 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] CP Subsystem is not enabled. CP data structures will operate in UNSAFE mode! Please note that UNSAFE mode will not provide strong consistency guarantees.
2023-06-17 10:27:29 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] Diagnostics disabled. To enable add -Dhazelcast.diagnostics.enabled=true to the JVM arguments.
2023-06-17 10:27:29 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] [10.1.0.4]:9975 is STARTING
2023-06-17 10:27:29 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] Initialized new cluster connection between /
10.1.0.4:52409 and /
10.1.0.5:99752023-06-17 10:27:35 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6]
Members {size:2, ver:4} [
Member [10.1.0.5]:9975 - 4f3acadc-2e05-4198-bd4f-035c99e8b967
Member [10.1.0.4]:9975 - 27f888d8-72f8-4f7e-9005-cee7b8814448 this
]
2023-06-17 10:27:36 [10.1.0.4]:9975 [ventusproxyCluster] [4.2.6] [10.1.0.4]:9975 is STARTED
Thanks,
Joan.