When I try ucx with a high number of workers, it throws port bind error as below.
[1600075627.538112] [v-012:210693:0] sock.c:376 UCX ERROR bind(fd=203 addr=
172.29.200.212:36367) failed: Address already in use
[1600075627.538172] [v-012:210693:0] listener.cc:53 UCX ERROR JUCX: Input/output error
[2020-09-14 05:27:07 -0400] [SEVERE] [worker-130] [Twister2MPIWorker-130] edu.iu.dsc.tws.rsched.schedulers.standalone.MPIWorkerStarter: Uncaught exception in thread Thread[Twister2MPIWorker-130,5,main]. Finalizing this worker...
edu.iu.dsc.tws.api.exceptions.Twister2RuntimeException: Couldn't initialize TWSChannel
at edu.iu.dsc.tws.api.resource.Network.initializeChannel(Network.java:73)
at edu.iu.dsc.tws.api.resource.WorkerEnvironment.<init>(WorkerEnvironment.java:136)
at edu.iu.dsc.tws.api.resource.WorkerEnvironment.init(WorkerEnvironment.java:251)
at edu.iu.dsc.tws.rsched.worker.Twister2WorkerStarter.execute(Twister2WorkerStarter.java:54)
at edu.iu.dsc.tws.rsched.worker.MPIWorkerManager.execute(MPIWorkerManager.java:66)
at edu.iu.dsc.tws.rsched.schedulers.standalone.MPIWorkerStarter.startWorker(MPIWorkerStarter.java:310)
at edu.iu.dsc.tws.rsched.schedulers.standalone.MPIWorkerStarter.startWorkerWithJM(MPIWorkerStarter.java:253)
at edu.iu.dsc.tws.rsched.schedulers.standalone.MPIWorkerStarter.<init>(MPIWorkerStarter.java:161)
at edu.iu.dsc.tws.rsched.schedulers.standalone.MPIWorkerStarter.main(MPIWorkerStarter.java:120)
Caused by: java.lang.reflect.InvocationTargetException
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
at edu.iu.dsc.tws.api.resource.Network.initializeChannel(Network.java:67)
... 8 more
Caused by: org.openucx.jucx.UcxException: Input/output error
at org.openucx.jucx.ucp.UcpListener.createUcpListener(Native Method)
at org.openucx.jucx.ucp.UcpListener.<init>(UcpListener.java:25)
at org.openucx.jucx.ucp.UcpWorker.newListener(UcpWorker.java:49)
at edu.iu.dsc.tws.comms.ucx.TWSUCXChannel.createUXCWorker(TWSUCXChannel.java:100)
at edu.iu.dsc.tws.comms.ucx.TWSUCXChannel.<init>(TWSUCXChannel.java:86)
... 13 more
[2020-09-14 05:27:07 -0400] [WARNING] [-] [JobMaster] edu.iu.dsc.tws.master.server.JMWorkerHandler: Worker [130] Failed.
[2020-09-14 05:27:07 -0400] [SEVERE] [-] [JobMaster] edu.iu.dsc.tws.master.server.WorkerMonitor: Worker: 130 FULLY_FAILED.