Hi Prvn,
We just face this problem, problem is 2 parts: (by the way we are using selenium grid 2 and our hub is linux, we have 2 type of nodes for chrome, 1 linux and 2 windows 2016 server )
1. In linux nodes, when we terminate the instance directly using boto 3, the java process is not able to closing gracefully, so it is not saying goodbye to hub. And hub will try to communicate during "timeout" parameter period. If you use lifecycle shut down for the node before terminating it, it is closing gracefully and saying goodbye to hub (you can see this log in hub "marking the node...." when you call lifecycle servlet shutdown) and hub is lefting this node after some node polling, another parameter group..
2. In windows nodes the same thing is happening. But we are facing a problem with the node shutdown via lifecycle servlet. When we do that we are not able to see any log in the hub. we are waiting "marking the node..." in the hub side. But we observe that the java process in windows node suddenly killed by the lifecycle servlet call and also we are getting http 200 from the node when we call shutdown end point. So we thought that possible problems in windows side can be:
a) It can not be related with security groups settings. Because linux ones are also using same security groups and using same hub.
b) It can be related a firewall rule in windows that is denying to go hub. But it is connecting to hub using again 4444 and it is running while testing. We are not observing any problem while test execution..
c) ...
We didn't find a way to solve the problem in windows nodes.. Any help would be appreciated.
20 Temmuz 2018 Cuma 06:39:41 UTC+3 tarihinde Prvn yazdı: