I recently tried to run storm-yarn on my cluster, and noticed that the supervisor jobs were asking YARN for a very large amount of memory. Looking at the code, it seems that the supervisor is asking for the MaxResourceCapability defined by the ResourceManager. Is this correct? For some reason, my YARN RM kept saying that it could not satisfy a container of this size. I changed the code for a quick test to use the master.container-size-mb property that is really only for the ApplicationMaster memory size, and it worked well. The supervisors came up without any trouble.
Also, where do the worker logs go to? I can't seem to find it on disk. (I found the supervisor.log, but not the worker-*.log)
One more question :) the worker jobs come up as separate applications, not as YARN resources. Does that mean if I set the worker to -Xmx10G, I need to make sure that the container running the supervisor is set to have 40G+ resource capability in YARN? (10G x possible 4 workers)
Thanks again! This is a very cool project.
Roshan