It sounds like from this thread, rackspace incorrectly reports accessible servers as pending state.
The only solutions I can think of are:
* remove the server status check and instead only rely on TCP port accessibility
* doublecheck status on PENDING
Any other thoughts?
-A
--
We should talk about it, as right now, rackspace is the only one with this bug.
Even if the socket test is tuned to something we consider just long enough to be valid, it will cause an unnecessary delay on pending or terminated nodes. If someone has hundreds of nodes, this could add up quite a bit.
What I'd suggest is that we:
* raise this bug to rackspace, hopefully in a way they can see for themselves the glitch
* make the health check pluggable and default to current logic
* make a temporary change to rackspace (or module in rackspace) to double-check pending or skip state check
I suppose we could alternatively introduce a check strategy flag..
Wdyt?
-A