The cluster is set up in Unicast mode, configured for a
single, unique IP address with no affinity in the port
rules.
Three times now, I have had a server just "fall off" the
public (10.0.0.x) network. Most recently it was the
server with cluster Host ID 2. In the event logs of the
servers, they all have a entry for WLBS event ID #65
stating "Initiating convergence on host 'x'. Reason:
Host 'y' is converging for an unknown reason." where 'x'
always refers to the ID of the server whose log I'm
reading and 'y' is one of the other two (but not
consistent). So Host 1 says it's converging on itself
because 3 is converging for an unknown reason, but host 2
say it's converging on itself because 1 in converging for
an unknown reason.
In the end 1 and 3 are online and functioning, and 2 won't
respond over its 10.0.0.x interface (though it will ping
on it's 192.168.1.x interface). NLB manager on 2 shows 2
as the only host participating in the cluster and its
state is "Converging", from which it never changes even
after stopping and starting NLB on that host. NLB manager
on 1 and 3 both show 1 and 3 in the cluster,
both "Converged".
Microsoft's TechNet says that if NLB Event ID 65 is
followed by NLB 28, 29, 71 or 72, then everything's fine.
If not (which is my case) I should "use Network Load
Balancing (NLB) Manager to determine the specific
configuration problem and then correct as appropriate."
Does anyone know what I can do?
Second you issues sounds like there may be a problem with whatever the
10.0.0.x interface is pluggged into (switch?). Try plugging the load
balanced interfaces of all three servers into a dumb hub and uplink the hub
to your switch or router. If you still have a problem post your wlbs
displays from all 3 servers.
--
Thanks,
Marc Reynolds
Microsoft Technical Support
This posting is provided "AS IS" with no warranties, and confers no rights.
"Paul Barber" <anon...@discussions.microsoft.com> wrote in message
news:2d3b01c3e1f4$693b9c50$a301...@phx.gbl...
Thanks for the reply.
>.
>