OK.
Maybe if I walk you through my setup process you can find the error since I'm getting the following error in my log:
2015-08-03 15:17:26.695 [info] <0.7.0> Application disco started on node disco_8989_master@discomaster
2015-08-03 15:17:26.695 [info] <0.97.0>@node_mon:slave_start:109 Starting node "disco_8989_slave" on "discoslave2" ("discoslave2")
2015-08-03 15:17:26.696 [info] <0.96.0>@node_mon:slave_start:109 Starting node "disco_8989_slave" on "discoslave1" ("discoslave1")
2015-08-03 15:17:26.696 [info] <0.95.0>@node_mon:slave_start:109 Starting node "disco_8989_slave" on "discoslave0" ("discoslave0")
2015-08-03 15:17:27.217 [error] <0.134.0> ** Connection attempt from disallowed node disco_8989_slave@discoslave1 **
2015-08-03 15:17:27.243 [error] <0.136.0> ** Connection attempt from disallowed node disco_8989_slave@discoslave2 **
2015-08-03 15:17:27.248 [error] <0.138.0> ** Connection attempt from disallowed node disco_8989_slave@discoslave0 **
The Master (clean Ubuntu trusty64 install)
=========
1. Create 'admin' user group with sudoers, etc
2. Configure sshd
3. Add all disco nodes to host file
4. Install Git and Python 2.7.x
5. Install Erlang
6. Create a 'disco' user in the 'admin' group
7. Upload disco user ssh key and configure ssh properties
8. Upload the common .erlang-cookie to disco user home
9. Set the DISCO_HOME location
10. Checkout disco to DISCO_HOME
11. Do 'make install'
12. Do 'python setup.py install'
13. Make sure disco user has correct permisions for directories (e.g. /usr/var/disco)
14. Insert the slave hostnames and worker counts in to /usr/var/disco/disco_8989.config
15. start disco master
Each Slave (clean Ubuntu trusty64 install)
=========
1. Create 'admin' user group with sudoers, etc
2. Configure sshd
3. Add all disco nodes to host file
4. Install Git and Python 2.7.x
5. Install Erlang
6. Create a 'disco' user in the 'admin' group
7. Upload disco user ssh key and configure ssh properties
8. Upload the common .erlang-cookie to disco user home
9. Set the DISCO_HOME location
10. Checkout disco to DISCO_HOME
11. Do 'make install-node'
12. Do 'python setup.py install'
13. Make sure disco user has correct permisions for directories (e.g. /usr/var/disco)
15. start disco slave
I can see the nodes listed in the Disco Master UI but they have a red bar not a black one.
From the troubleshooting guid I've tried the following:
* Can ssh without password from any disco node to any other
* Disco installed in same path on all machines
* Same .erlang-cookie on all machines with permissions 0400
* slave:start(localhost, "testnode") and 'net_adm:ping(testnode@localhost)' work as expected
* hostnames are resolved using hosts file not DNS - this is tricky to change
* ssh localhost "python DISCO_HOME/lib/disco/worker/classic/worker.py" - returns the expected response.
To me it looks like there's still some issue with the master talking to the slaves but I'm struggling to see what it might be - any suggestions?
Thanks
James