Ad-Hoc distributed issues

51 views
Skip to first unread message

Shriphani Palakodety

unread,
Mar 22, 2021, 3:53:34 PM3/22/21
to tlaplus
Hello folks,

I'm trying to get the distributed TLC setup running from the official tutorial here: https://tla.msr-inria.inria.fr/tlatoolbox/doc/model/distributed-mode.html

On my master node, I started TLAPlus, chose the FourQueens spec, picked Ad-hoc mode and it enters the waiting for workers stage.

On the slave node, I ran through the instructions. The output suggests it reads a few .tla files from /tmp, does semantic processing for the modules, and then fails because it can't connect to host - everything is open (i.e. master allows incoming tcp from everywhere). I've posted a full trace below.

Any help would be appreciated.

onu@squidroost3:~$ java -Djava.rmi.server.hostname=192.168.1.122 -cp tla2tools.jar tlc2.tool.distributed.TLCWorker 192.168.1.122
TLC Worker Version 2.16 of 31 December 2020
Parsing file /tmp/1616442606603/MC.tla
Parsing file /tmp/1616442606603/Queens.tla
Parsing file /tmp/1616442606603/TLC.tla
Parsing file /tmp/1616442606603/Naturals.tla
Parsing file /tmp/1616442606603/Sequences.tla
Parsing file /tmp/1616442606603/FiniteSets.tla
Semantic processing of module Naturals
Semantic processing of module Sequences
Semantic processing of module Queens
Semantic processing of module FiniteSets
Semantic processing of module TLC
Semantic processing of module MC
Starting... (2021-03-22 12:50:07)
TLC worker with 4 threads ready at: Mon Mar 22 12:50:07 PDT 2021
WARNING: An illegal reflective access operation has occurred
WARNING: Illegal reflective access by tlc2.tool.distributed.TLCWorker (file:/home/onu/tla2tools.jar) to method sun.rmi.server.UnicastRef.getLiveRef()
WARNING: Please consider reporting this to the maintainers of tlc2.tool.distributed.TLCWorker
WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations
WARNING: All illegal access operations will be denied in a future release
Exception in thread "TLCWorkerThread-000" java.lang.RuntimeException: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:473)
    at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.exceptionReceivedFromServer(StreamRemoteCall.java:303)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:279)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)
    at com.sun.proxy.$Proxy0.registerWorker(Unknown Source)
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:471)
    ... 1 more
Caused by: java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.transport.tcp.TCPEndpoint.newSocket(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.createConnection(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.newConnection(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(Unknown Source)
    at com.sun.proxy.$Proxy21.getURI(Unknown Source)
    at tlc2.tool.distributed.TLCServer.registerWorker(TLCServer.java:257)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
    at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
    at java.base/java.lang.reflect.Method.invoke(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
Caused by: java.net.ConnectException: Connection refused
    at java.base/sun.nio.ch.Net.connect0(Native Method)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.NioSocketImpl.connect(Unknown Source)
    at java.base/java.net.SocksSocketImpl.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPDirectSocketFactory.createSocket(Unknown Source)
    ... 25 more
Exception in thread "TLCWorkerThread-002" java.lang.RuntimeException: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:473)
    at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.exceptionReceivedFromServer(StreamRemoteCall.java:303)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:279)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)
    at com.sun.proxy.$Proxy0.registerWorker(Unknown Source)
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:471)
    ... 1 more
Caused by: java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.transport.tcp.TCPEndpoint.newSocket(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.createConnection(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.newConnection(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(Unknown Source)
    at com.sun.proxy.$Proxy21.getURI(Unknown Source)
    at tlc2.tool.distributed.TLCServer.registerWorker(TLCServer.java:257)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
    at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
    at java.base/java.lang.reflect.Method.invoke(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
Caused by: java.net.ConnectException: Connection refused
    at java.base/sun.nio.ch.Net.connect0(Native Method)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.NioSocketImpl.connect(Unknown Source)
    at java.base/java.net.SocksSocketImpl.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPDirectSocketFactory.createSocket(Unknown Source)
    ... 25 more
Exception in thread "TLCWorkerThread-001" java.lang.RuntimeException: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:473)
    at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.exceptionReceivedFromServer(StreamRemoteCall.java:303)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:279)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)
    at com.sun.proxy.$Proxy0.registerWorker(Unknown Source)
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:471)
    ... 1 more
Caused by: java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.transport.tcp.TCPEndpoint.newSocket(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.createConnection(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.newConnection(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(Unknown Source)
    at com.sun.proxy.$Proxy21.getURI(Unknown Source)
    at tlc2.tool.distributed.TLCServer.registerWorker(TLCServer.java:257)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
    at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
    at java.base/java.lang.reflect.Method.invoke(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
Caused by: java.net.ConnectException: Connection refused
    at java.base/sun.nio.ch.Net.connect0(Native Method)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.NioSocketImpl.connect(Unknown Source)
    at java.base/java.net.SocksSocketImpl.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPDirectSocketFactory.createSocket(Unknown Source)
    ... 25 more
Exception in thread "TLCWorkerThread-003" java.lang.RuntimeException: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:473)
    at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.rmi.ServerException: RemoteException occurred in server thread; nested exception is:
    java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.exceptionReceivedFromServer(StreamRemoteCall.java:303)
    at java.rmi/sun.rmi.transport.StreamRemoteCall.executeCall(StreamRemoteCall.java:279)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(UnicastRef.java:164)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(RemoteObjectInvocationHandler.java:217)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(RemoteObjectInvocationHandler.java:162)
    at com.sun.proxy.$Proxy0.registerWorker(Unknown Source)
    at tlc2.tool.distributed.TLCWorker$TLCWorkerRunnable.run(TLCWorker.java:471)
    ... 1 more
Caused by: java.rmi.ConnectException: Connection refused to host: 192.168.1.122; nested exception is:
    java.net.ConnectException: Connection refused
    at java.rmi/sun.rmi.transport.tcp.TCPEndpoint.newSocket(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.createConnection(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPChannel.newConnection(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastRef.invoke(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invokeRemoteMethod(Unknown Source)
    at java.rmi/java.rmi.server.RemoteObjectInvocationHandler.invoke(Unknown Source)
    at com.sun.proxy.$Proxy21.getURI(Unknown Source)
    at tlc2.tool.distributed.TLCServer.registerWorker(TLCServer.java:257)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    at java.base/jdk.internal.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
    at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
    at java.base/java.lang.reflect.Method.invoke(Unknown Source)
    at java.rmi/sun.rmi.server.UnicastServerRef.dispatch(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport$1.run(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.Transport.serviceCall(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport.handleMessages(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.lambda$run$0(Unknown Source)
    at java.base/java.security.AccessController.doPrivileged(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
    at java.base/java.lang.Thread.run(Unknown Source)
Caused by: java.net.ConnectException: Connection refused
    at java.base/sun.nio.ch.Net.connect0(Native Method)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.Net.connect(Unknown Source)
    at java.base/sun.nio.ch.NioSocketImpl.connect(Unknown Source)
    at java.base/java.net.SocksSocketImpl.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.connect(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.base/java.net.Socket.<init>(Unknown Source)
    at java.rmi/sun.rmi.transport.tcp.TCPDirectSocketFactory.createSocket(Unknown Source)
    ... 25 more



Markus Kuppe

unread,
Mar 22, 2021, 4:16:24 PM3/22/21
to tla...@googlegroups.com
On 22.03.21 12:52, Shriphani Palakodety wrote:
>
> I'm trying to get the distributed TLC setup running from the official
> tutorial here:
> https://tla.msr-inria.inria.fr/tlatoolbox/doc/model/distributed-mode.html
>
> On my master node, I started TLAPlus, chose the FourQueens spec, picked
> Ad-hoc mode and it enters the waiting for workers stage.
>
> On the slave node, I ran through the instructions. The output suggests
> it reads a few .tla files from /tmp, does semantic processing for the
> modules, and then fails because it can't connect to host - everything is
> open (i.e. master allows incoming tcp from everywhere). I've posted a
> full trace below.
>
> Any help would be appreciated.

Hi,

communication runs in all directions, i.e., the master node will also
connect to workers, and workers connect each other. Please make sure
that your network setup and firewall rules allow incoming connections on
all nodes.
If the above doesn't address your issue, please include the mapping from
nodes to IP addresses in your follow-up message.

Markus

Shriphani Palakodety

unread,
Mar 22, 2021, 4:42:30 PM3/22/21
to tlaplus
No luck here's the firewall setup on master:

onu@squidroost1:~/toolbox$ sudo ufw status verbose
Status: active
Logging: on (low)
Default: allow (incoming), allow (outgoing), disabled (routed)
New profiles: skip

To                         Action      From
--                         ------      ----
22                         ALLOW IN    Anywhere                  
Anywhere                   ALLOW IN    192.168.1.0/24            
Anywhere                   ALLOW IN    0.0.0.0                   
22 (v6)                    ALLOW IN    Anywhere (v6)             

And on the slave node:

onu@squidroost3:~sudo ufw status verbose
Status: active
Logging: on (low)
Default: allow (incoming), allow (outgoing), disabled (routed)
New profiles: skip

To                         Action      From
--                         ------      ----
Anywhere                   ALLOW IN    0.0.0.0                   

So all directions should be ok. But I still get this error.

Markus Kuppe

unread,
Mar 22, 2021, 11:23:28 PM3/22/21
to tla...@googlegroups.com

On 22.03.21 13:41, Shriphani Palakodety wrote:
> No luck here's the firewall setup on master:
>
> onu@squidroost1:~/toolbox$ sudo ufw status verbose
> Status: active
> Logging: on (low)
> Default: allow (incoming), allow (outgoing), disabled (routed)
> New profiles: skip
>
> To                         Action      From
> --                         ------      ----
> 22                         ALLOW IN    Anywhere
> Anywhere                   ALLOW IN    192.168.1.0/24
> Anywhere                   ALLOW IN    0.0.0.0
> 22 (v6)                    ALLOW IN    Anywhere (v6)
>
> And on the slave node:
>
> onu@squidroost3:~sudo ufw status verbose
> Status: active
> Logging: on (low)
> Default: allow (incoming), allow (outgoing), disabled (routed)
> New profiles: skip
>
> To                         Action      From
> --                         ------      ----
> Anywhere                   ALLOW IN    0.0.0.0
>
> So all directions should be ok. But I still get this error.


The next thing to check are problems with hostname resolution. Sometimes
it happens that a node advertises itself as e.g., 127.0.1.1, which then
causes connections to fail. Try setting 'java.rmi.server.hostname' to
each node's public IP address as mentioned in [1].

Markus

https://tla.msr-inria.inria.fr/tlatoolbox/doc/model/distributed-mode.html#limitations
Reply all
Reply to author
Forward
0 new messages