What is your smartconnect setup? Connections should be balanced across all of the nodes if smartconnect and your DNS is setup correctly.
On Wed, Apr 10, 2013 at 10:22 AM, GAZ <greg...@yahoo.com> wrote:
We had a node failure yesterday that took down our Isilon cluster. Every connection and 99.5% of the data traffic comes in Node 1. EMC came and installed this several months ago, but I suspect it isn't working as it should. Shouldn't these passive nodes come into play when a node becomes unresponsive? Support restarted the node and it is working now, but again every connection is on Node 1 and it is taking all the traffic. I guess this three node cluster has two passive nodes. It seems like a waste of money. We're on OneFS v6.5.5.16.
--
You received this message because you are subscribed to the Google Groups "Isilon Technical User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to isilon-user-gr...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
What is your smartconnect setup? Connections should be balanced across all of the nodes if smartconnect and your DNS is setup correctly.
You should have a delegation record (NS) in DNS that points to the smartconnect service IP address. If this is setup correctly the cluster will hand out up addresses of the nodes in your cluster. A simple NSLookup should show you if this is working correctly. If you get different IP addresses each time you issue the lookup it's setup correctly.
In your ip addess pools what is your Connection policy set as and do you have SmartConnect Advanced licensed?
The connection policy should help distribute the clients out to all the nodes. SmartConnect Advanced I believe is needed to rebalance the IP's if a single node goes down.
Also make sure you are not connecting to the smart connect service ip. You need to use the DNS name.
That is your problem right there. You need to troubleshoot the delegation and get that working to evenly balance across the nodes in your cluster. If all of your end users are connecting to a node up address than you most definitely will have 2 passive nodes from a connectivity perspective but all of the nodes are participating in reading and writing of data.You should get the DNS piece squared away so that the cluster responds to dns requests.
Like ketih said if the IP you are querying isn't responding to DNS that's a problem. You might be trying to query the wrong IP. I believe only the service IP will respond to DNS requests all the other IP's will probably just time out if you try to query them. On a 3 node cluster you should have a minimum of 4 IP's. One for the smartconnect name then at least one for each node.
--