I'm running a master on Solaris (version details below), with about 370 minions, a mix of Linux and Solaris. I know this isn't a huge number, which is why I'm stumped as to the huge CPU load (% utlilized as well as loadavg) I'm seeing when doing highstate runs. Other state runs and one-off execution calls run fine.
The master is on a Solaris 11 logical domain, with 192 VCPUs and 64GB RAM.
/tmp/cpu_test:
file.exists
Running state.apply causes the CPU% to spike to 100, loadavg peaks around 135, and it takes about a minute and a half to complete.
Running state.sls with that same cpu_test SLS, it barely uses any CPU, and completes in about 12 seconds.
There's obviously some serious overhead caused by the state.apply run somewhere.
I know there are sites out there running thousands of minions, so something must be wrong with my setup, or something.
Can anyone point me in a direction to look for the cause? I'll be happy to provide any info needed.
-----------------------------------------------------------------------------------
Salt Version:
Salt: 2016.11.6
Dependency Versions:
cffi: 1.10.0
cherrypy: 11.0.0
dateutil: 2.6.1
docker-py: Not Installed
gitdb: 2.0.2
gitpython: 2.1.5
ioflo: Not Installed
Jinja2: 2.9.6
libgit2: Not Installed
libnacl: 1.5.1
M2Crypto: Not Installed
Mako: Not Installed
msgpack-pure: Not Installed
msgpack-python: 0.4.8
mysql-python: Not Installed
pycparser: 2.18
pycrypto: 2.6.1
pycryptodome: Not Installed
pygit2: Not Installed
Python: 2.7.11 (default, Jul 18 2017, 12:56:12)
python-gnupg: Not Installed
PyYAML: 3.12
PyZMQ: 16.0.2
RAET: Not Installed
smmap: 2.0.3
timelib: 0.2.4
Tornado: 4.5.1
ZMQ: 4.1.6
System Versions:
dist:
machine: sun4v
release: 5.11
system: SunOS
version: Not Installed