Sep:26,18:06:59,255 INFO (:Incoming-11,ee-interserver,172.25.14.2:) [org.infinispan.remoting.transport.jgroups.JGroupsTransport] ISPN000094: Received new cluster view for channel interserver: [172.25.14.2|13] (6) [172.25.14.2, 172.25.11.2, 172.25.5.2, 172.25.3.2, 172.25.22.2, 172.25.1.2]Sep:26,18:12:27,087 FINER (:TransferQueueBundler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending 1 msgs (124 bytes (0.40 of max_bundle_size) to 1 dests(s): [ee-interserver:172.25.22.2]
Sep:26,18:12:27,112 FINER (:OOB-10,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: received [dst: 172.25.14.2, src: 172.25.22.2 (3 headers), size=0 bytes, flags=OOB], headers are GMS: GmsHeader[LEAVE_REQ]: mbr=172.25.22.2, UNICAST3: DATA, seqno=10, conn_id=11, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:27,112 FINER (:OOB-10,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UNICAST3] 172.25.14.2 <-- DATA(172.25.22.2: #10, conn_id=11)
Sep:26,18:12:27,112 FINER (:OOB-10,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UNICAST3] 172.25.14.2: delivering 172.25.22.2#10
Sep:26,18:12:27,162 FINER (:ViewHandler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.GMS] 172.25.14.2: joiners=[], suspected=[], leaving=[172.25.22.2], new view: [172.25.14.2|14] (5) [172.25.14.2, 172.25.11.2, 172.25.5.2, 172.25.3.2, 172.25.1.2]
Sep:26,18:12:27,162 FINER (:ViewHandler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.GMS] 172.25.14.2: sending LEAVE response to 172.25.22.2
Sep:26,18:12:27,162 FINER (:ViewHandler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending msg to 172.25.22.2, src=172.25.14.2, headers are GMS: GmsHeader[LEAVE_RSP], UDP: [cluster_name=ee-interserver]
Sep:26,18:12:27,162 FINER (:ViewHandler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.GMS] 172.25.14.2: mcasting view [172.25.14.2|14] (5) [172.25.14.2, 172.25.11.2, 172.25.5.2, 172.25.3.2, 172.25.1.2] (5 mbrs)
Sep:26,18:12:27,164 FINER (:Incoming-18,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.GMS] 172.25.14.2: received delta view [172.25.14.2|14], ref-view=[172.25.14.2|13], left=[172.25.22.2]
Sep:26,18:12:27,164 FINE (:Incoming-18,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.GMS] 172.25.14.2: installing view [172.25.14.2|14] (5) [172.25.14.2, 172.25.11.2, 172.25.5.2, 172.25.3.2, 172.25.1.2]
Sep:26,18:12:27,164 FINER (:Incoming-18,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UNICAST3] 172.25.14.2: closing connections of non members [172.25.22.2]
Sep:26,18:12:27,164 FINE (:Incoming-18,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.NAKACK2] 172.25.14.2: removed 172.25.22.2 from xmit_table (not member anymore)
Sep:26,18:12:27,165 FINER (:Incoming-18,ee-interserver,172.25.14.2:) [org.jgroups.protocols.FRAG2] 172.25.14.2: removed 172.25.22.2 from fragmentation table
Sep:26,18:12:27,180 FINER (:Timer-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending msg to 172.25.22.2, src=172.25.14.2, headers are UNICAST3: ACK, seqno=10, conn_id=11, ts=22, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:27,180 FINER (:TransferQueueBundler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending 1 msgs (70 bytes (0.23 of max_bundle_size) to 1 dests(s): [ee-interserver:172.25.22.2]
(...)
Sep:26,18:12:31,691 FINER (:Timer-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UNICAST3] 172.25.14.2 --> XMIT(172.25.22.2: #3)
Sep:26,18:12:31,691 FINER (:Timer-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending msg to 172.25.22.2, src=172.25.14.2, headers are RequestCorrelator: id=200, type=RSP, id=12, rsp_expected=false, FORK: ee-interserver:interserver, UNICAST3: DATA, seqno=3, conn_id=6, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:32,132 FINER (:INT-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.STABLE] 172.25.14.2: discarded STABILITY message with different view-id [172.25.16.2|0] (my view-id=[172.25.14.2|14] (5) [172.25.14.2, 172.25.11.2, 172.25.5.2, 172.25.3.2, 172.25.1.2])Sep:26,18:12:32,192 FINER (:Timer-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UNICAST3] 172.25.14.2 --> XMIT(172.25.22.2: #3)Sep:26,18:12:32,192 FINER (:Timer-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending msg to 172.25.22.2, src=172.25.14.2, headers are RequestCorrelator: id=200, type=RSP, id=12, rsp_expected=false, FORK: ee-interserver:interserver, UNICAST3: DATA, seqno=3, conn_id=6, UDP: [cluster_name=ee-interserver]Sep:26,18:12:32,192 FINER (:TransferQueueBundler,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: sending 1 msgs (124 bytes (0.40 of max_bundle_size) to 1 dests(s): [ee-interserver:172.25.22.2]Sep:26,18:12:32,275 FINER (:INT-1,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: received [dst: <null>, src: 172.25.1.2 (2 headers), size=0 bytes, flags=INTERNAL], headers are MERGE3: INFO: view_id=[172.25.14.2|14], logical_name=172.25.1.2, physical_addr=172.25.1.2:55201, UDP: [cluster_name=ee-interserver]Sep:26,18:12:32,571 FINER (:INT-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.UDP] 172.25.14.2: received [dst: 172.25.14.2, src: 172.25.3.2 (2 headers), size=103 bytes, flags=OOB|NO_RELIABILITY|INTERNAL], headers are STABLE: [STABLE_GOSSIP] view-id= [172.25.14.2|14], UDP: [cluster_name=ee-interserver]Sep:26,18:12:32,571 FINER (:INT-2,ee-interserver,172.25.14.2:) [org.jgroups.protocols.pbcast.STABLE] 172.25.14.2: handling digest from 172.25.3.2:
Sep:26,18:12:48,707 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.STABLE] 172.25.22.2: stable task started
Sep:26,18:12:48,756 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.UDP] sockets will use interface 172.25.22.2
Sep:26,18:12:48,762 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.UDP] socket information:
mcast_addr=239.249.0.82:45691, bind_addr=/172.25.22.2, ttl=24
sock: bound to 172.25.22.2:55201, receive buffer size=20000000, send buffer size=640000
mcast_sock: bound to 172.25.22.2:45691, send buffer size=640000, receive buffer size=25000000
Sep:26,18:12:48,775 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.UDP] 172.25.22.2: sending msg to null, src=172.25.22.2, headers are PING: [type=GET_MBRS_REQ, cluster=ee-interserver], UDP: [cluster_name=ee-interserver]
Sep:26,18:12:48,937 INFO (:ServerService Thread Pool -- 8:) [org.jboss.as.jpa] WFLYJPA0010: Starting Persistence Unit (phase 2 of 2) Service 'com.barco.nms.server.ear#firebird'
Sep:26,18:12:48,978 FINER (:TransferQueueBundler,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: sending 1 msgs (111 bytes (0.36 of max_bundle_size) to 1 dests(s): [ee-interserver]
(...)
Sep:26,18:12:51,421 FINER (:INT-1,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: received [dst: <null>, src: 172.25.14.2 (2 headers), size=0 bytes, flags=INTERNAL], headers are FD_ALL: heartbeat, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:51,425 FINER (:INT-2,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: received [dst: <null>, src: 0591b4e3-74b3-7655-1d43-74e949444c2a (2 headers), size=0 bytes, flags=INTERNAL], headers are FD_ALL: heartbeat, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:51,568 FINER (:INT-1,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: received [dst: <null>, src: eb6746f9-ad4f-d89f-7522-95e2e9a90070 (2 headers), size=0 bytes, flags=INTERNAL], headers are FD_ALL: heartbeat, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:51,777 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.GMS] 172.25.22.2: no members discovered after 3007 ms: creating cluster as first member
Sep:26,18:12:51,781 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.NAKACK2]
[172.25.22.2 setDigest()]
existing digest: []
new digest: 172.25.22.2: [0 (0)]
resulting digest: 172.25.22.2: [0 (0)]
Sep:26,18:12:51,781 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.GMS] 172.25.22.2: installing view [172.25.22.2|0] (1) [172.25.22.2]
Sep:26,18:12:51,783 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.STABLE] resuming message garbage collection
Sep:26,18:12:51,784 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.FD_SOCK] 172.25.22.2: VIEW_CHANGE received: [172.25.22.2]
Sep:26,18:12:51,788 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.STABLE] 172.25.22.2: reset digest to 172.25.22.2: [-1]
Sep:26,18:12:51,789 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.UFC] new membership: [172.25.22.2]
Sep:26,18:12:51,790 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.MFC] new membership: [172.25.22.2]
Sep:26,18:12:51,791 FINER (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.STABLE] 172.25.22.2: reset digest to 172.25.22.2: [-1]
Sep:26,18:12:51,791 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.STABLE] resuming message garbage collection
Sep:26,18:12:51,792 FINE (:MSC service thread 1-5:) [org.jgroups.protocols.pbcast.GMS] 172.25.22.2: created cluster (first member). My view is [172.25.22.2|0], impl is org.jgroups.protocols.pbcast.CoordGmsImpl
Sep:26,18:12:54,422 FINER (:INT-1,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: received [dst: <null>, src: 172.25.14.2 (2 headers), size=0 bytes, flags=INTERNAL], headers are FD_ALL: heartbeat, UDP: [cluster_name=ee-interserver]
Sep:26,18:12:55,093 FINER (:INT-2,ee-interserver,172.25.22.2:) [org.jgroups.protocols.UDP] 172.25.22.2: received [dst: <null>, src: 172.25.14.2 (2 headers), size=103 bytes, flags=OOB|NO_RELIABILITY|INTERNAL], headers are STABLE: [STABILITY] view-id= [172.25.14.2|14], UDP: [cluster_name=ee-interserver]
Sep:26,18:12:55,094 FINER (:INT-2,ee-interserver,172.25.22.2:) [org.jgroups.protocols.pbcast.STABLE] 172.25.22.2: discarded STABILITY message with different view-id [172.25.14.2|14] (my view-id=[172.25.22.2|0] (1) [172.25.22.2])
<stack name="udp-interserver">
<transport type="UDP" socket-binding="jgroups-interserver-udp">
<property name="ip_ttl">24</property>
<property name="log_discard_msgs">false</property>
</transport>
<protocol type="PING"/>
<protocol type="MERGE3"/>
<protocol type="FD_SOCK" socket-binding="jgroups-udp-fd"/>
<protocol type="FD_ALL"/>
<protocol type="VERIFY_SUSPECT"/>
<protocol type="pbcast.NAKACK2"/>
<protocol type="UNICAST3"/>
<protocol type="pbcast.STABLE"/>
<protocol type="pbcast.GMS"/>
<protocol type="UFC"/>
<protocol type="MFC"/>
<protocol type="FRAG2"/>
<protocol type="RSVP"/>
</stack>
// we won't handle the stable_digest, if its members don't match the membership in my own digest,
// this is part of the fix for the NAKACK problem (bugs #943480 and #938584)
if(!view_id.equals(this.view.getViewId())) {
log.trace("%s: discarded STABILITY message with different view-id %s (my view-id=%s)",
local_addr, view_id, view);
// resetDigest();
return;
}
log.trace("%s: received stability msg from %s: %s", local_addr, sender, printDigest(stable_digest));
num_stability_msgs_received++;
resetDigest();> <mailto:jgroups-dev+unsub...@googlegroups.com>.
FINER: 10.0.75.1: discarded STABILITY message with different view-id [10.202.86.37|0] (my view-id=MergeView::[10.0.75.1|5] (2) [10.0.75.1, 10.202.86.33], 1 subgroups: [10.202.86.33|1] (2) [10.202.86.33, 10.0.75.1])”
public TestAppJgroups() throws Exception{
channel =new JChannel("udp.xml");
}
public static void main(String[] args) {
System.setProperty("java.net.preferIPv4Stack" , "true");
if (args == null || args.length == 0) {
throw new RuntimeException("please specify the port for the webserver");
}
logger.log(Level.WARNING, "starting ");
int port = Integer.parseInt(args[0]);
try {
TestAppJgroups app = new TestAppJgroups();
HttpServer server = HttpServer.create(new InetSocketAddress(port), 0);
//I add a webserver here so that I can manually stop the app (system.exit) and request the view of the channel
//
server.start();
app.start();
}catch (Exception e) {
e.printStackTrace();
}
}
public void start() throws Exception{
channel.setReceiver(this);
channel.setName(Inet4Address.getLocalHost().getHostAddress());
channel.connect("jgroups-test-cluster");
eventLoop(); //as in http://www.jgroups.org/tutorial/
channel.close();
}
<config xmlns="urn:org:jgroups" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="urn:org:jgroups http://www.jgroups.org/schema/jgroups.xsd">
<UDP mcast_addr="${jboss.partition.udpGroup:239.249.0.99}"
mcast_port="${jboss.hapartition.mcast_port:45566}"
tos="8"
ucast_recv_buf_size="20000000"
ucast_send_buf_size="640000"
mcast_recv_buf_size="25000000"
mcast_send_buf_size="640000"
loopback="false"
discard_incompatible_packets="true"
enable_bundling="false"
max_bundle_size="64000"
max_bundle_timeout="30"
ip_ttl="${jgroups.udp.ip_ttl:10}"/>
<PING/>
<MERGE3/>
<FD_SOCK/>
<FD_ALL/>
<VERIFY_SUSPECT/>
<pbcast.NAKACK2/>
<UNICAST3/>
<pbcast.STABLE/>
<pbcast.GMS/>
<UFC/>
<MFC/>
<FRAG2/>
<RSVP/>
</config>Oct 02, 2018 2:26:43 PM org.jgroups.protocols.pbcast.STABLE handleStabilityMessage
FINER: 10.202.86.33: PATCH: NOT discarded STABILITY message with different view-id [10.202.86.37|0] (my view-id=[10.202.86.33|0])
Oct 02, 2018 2:26:43 PM org.jgroups.protocols.pbcast.STABLE handleStabilityMessage
FINER: 10.202.86.33: received stability msg from 10.202.86.37:
Oct 02, 2018 2:26:43 PM org.jgroups.protocols.pbcast.STABLE resetDigest
FINER: 10.202.86.33: reset digest to 10.202.86.33: [-1]
Oct 02, 2018 2:26:43 PM org.jgroups.protocols.pbcast.NAKACK2 stable
FINER: 10.202.86.33: received stable digest 10.202.86.37: [0 (0)]
Oct 02, 2018 2:26:47 PM org.jgroups.protocols.TP passMessageUp
FINER: 10.202.86.33: received [dst: <null>, src: 10.202.86.37 (2 headers), size=0 bytes, flags=INTERNAL], headers are MERGE3: INFO: view_id=[10.202.86.37|0], logical_name=10.202.86.37, physical_addr=172.25.8.2:62444, UDP: [cluster_name=interOR-test-cluster]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.TP passMessageUp
FINER: 10.202.86.33: received [dst: <null>, src: 10.202.87.31 (2 headers), size=0 bytes, flags=INTERNAL], headers are MERGE3: INFO: view_id=[10.202.87.31|22], logical_name=10.202.87.31, physical_addr=172.25.20.2:64271, UDP: [cluster_name=interOR-test-cluster]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.pbcast.STABLE sendStableMessage
FINER: 10.202.86.33: sending stable msg to 10.202.86.33: 10.202.86.33: [0]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.TP down
FINER: 10.202.86.33: sending msg to 10.202.86.33, src=10.202.86.33, headers are STABLE: [STABLE_GOSSIP] view-id= [10.202.86.33|0], UDP: [cluster_name=interOR-test-cluster]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.TP loopback
FINER: 10.202.86.33: looping back message [dst: 10.202.86.33, src: 10.202.86.33 (2 headers), size=21 bytes, flags=OOB|NO_RELIABILITY|INTERNAL]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.TP passMessageUp
FINER: 10.202.86.33: received [dst: 10.202.86.33, src: 10.202.86.33 (2 headers), size=21 bytes, flags=OOB|NO_RELIABILITY|INTERNAL], headers are STABLE: [STABLE_GOSSIP] view-id= [10.202.86.33|0], UDP: [cluster_name=interOR-test-cluster]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.pbcast.STABLE updateLocalDigest
FINER: 10.202.86.33: handling digest from 10.202.86.33:
mine: 10.202.86.33: [-1]
other: 10.202.86.33: [0]
result: 10.202.86.33: [0]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.pbcast.STABLE resetDigest
FINER: 10.202.86.33: reset digest to 10.202.86.33: [-1]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.pbcast.NAKACK2 stable
FINER: 10.202.86.33: received stable digest 10.202.86.33: [0 (0)]
Oct 02, 2018 2:26:52 PM org.jgroups.protocols.pbcast.NAKACK2 stable
FINER: 10.202.86.33: deleting msgs <= 0 from 10.202.86.33// we won't handle the stable_digest, if its members don't match the membership in my own digest,
// this is part of the fix for the NAKACK problem (bugs #943480 and #938584)I'm going to be on PTO next week, so I won't be able to look into this
> <mailto:jgroups-dev+unsub...@googlegroups.com>.
Caused by: java.lang.VerifyError: Bad return type
Exception Details:
Location:
org/jboss/as/clustering/jgroups/JChannelFactory.createChannel(Ljava/lang/String;)Lorg/jgroups/Channel; @541: areturn
Reason:
Type 'org/jgroups/JChannel' (current frame, stack[0]) is not assignable to 'org/jgroups/Channel' (from method signature)
<UDP mcast_addr="${jboss.partition.udpGroup:239.249.0.99}"
mcast_port="${jboss.hapartition.mcast_port:45566}"
tos="8"
ucast_recv_buf_size="50M"
ucast_send_buf_size="50M"
mcast_recv_buf_size="50M"
mcast_send_buf_size="50M"
max_bundle_size="64K"
ip_ttl="${jgroups.udp.ip_ttl:10}"/>
<PING/>
<MERGE3 max_interval="30000"
min_interval="10000"/>
<FD_SOCK/>
<FD_ALL/>
<VERIFY_SUSPECT timeout="5000" />
<pbcast.NAKACK2 xmit_interval="500"
xmit_table_num_rows="100"
xmit_table_msgs_per_row="2000"
xmit_table_max_compaction_time="30000"
use_mcast_xmit="false"
discard_delivered_msgs="true"/>
<UNICAST3 xmit_interval="500"
xmit_table_num_rows="100"
xmit_table_msgs_per_row="2000"
xmit_table_max_compaction_time="60000"
conn_expiry_timeout="0"/>
<pbcast.STABLE desired_avg_gossip="50000"
max_bytes="4M"/>
<pbcast.GMS print_local_addr="true" join_timeout="2000"/>
<UFC max_credits="2M"
min_threshold="0.4"/>
<MFC max_credits="2M"
min_threshold="0.4"/>
<FRAG2 frag_size="60K" />
<RSVP resend_interval="2000" timeout="10000"/>
</config>
> > <mailto:jgroups-dev+unsub...@googlegroups.com>.
> > To post to this group, send email to
> jgrou...@googlegroups.com
> > <mailto:jgrou...@googlegroups.com>.
> > To view this discussion on the web visit
> >
> https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com>
>
> >
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com?utm_medium=email&utm_source=footer
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com?utm_medium=email&utm_source=footer>>.
>
> > For more options, visit
> https://groups.google.com/d/optout
> <https://groups.google.com/d/optout>.
>
> --
> Bela Ban | http://www.jgroups.org
>
> --
> You received this message because you are subscribed to the Google
> Groups "jgroups-dev" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to jgroups-dev...@googlegroups.com
> <mailto:jgroups-dev+unsub...@googlegroups.com>.
On 27/09/18 12:38 PM, Thomas Houtekier wrote:
> Maybe interesting to add is that this happens on one particular setup.
So what's the difference between the 2 setups? Same code base, same
config, same network?
You're saying that this works on one setup, but not on another one?
> We have been trying to reproduce this situation on another setup but
> have never seen this problem, even with far more nodes.
If everything is the same, but the network/OS(?) is different, I'm
inclined to believe that this is a network issue...
> On a setup with 35 nodes we ran a test that constantly randomly
> restarts nodes and verifies if the group is complete again (after the
> restart). It ran fine for multiple days.
Was this with 3.6.4, or with 4.0.x?
> <mailto:jgroups-dev+unsub...@googlegroups.com>.
> > <mailto:jgroups-dev+unsub...@googlegroups.com <javascript:>>.
> > To post to this group, send email to jgrou...@googlegroups.com
> <javascript:>
> > <mailto:jgrou...@googlegroups.com <javascript:>>.
> > To view this discussion on the web visit
> >
> https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com>
>
> >
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com?utm_medium=email&utm_source=footer
> <https://groups.google.com/d/msgid/jgroups-dev/56a93b4d-85e7-451e-813b-66f0e303b30a%40googlegroups.com?utm_medium=email&utm_source=footer>>.
>
> > For more options, visit https://groups.google.com/d/optout
> <https://groups.google.com/d/optout>.
>
> --
> Bela Ban | http://www.jgroups.org
>
> --
> You received this message because you are subscribed to the Google
> Groups "jgroups-dev" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to jgroups-dev...@googlegroups.com
> <mailto:jgroups-dev+unsub...@googlegroups.com>.
Line 1: Oct 05, 2018 12:13:30 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting
Line 2160: Oct 05, 2018 12:32:06 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting Line 1: Oct 05, 2018 12:17:36 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting
Line 1333: Oct 05, 2018 12:26:12 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting
Line 1608: Oct 05, 2018 12:28:38 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting Line 1: Oct 05, 2018 12:22:29 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting
Line 685: Oct 05, 2018 12:30:45 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting
Line 1813: Oct 05, 2018 12:34:01 PM testinteror.jgroups.TestAppJgroups mainWARNING: starting Oct 05, 2018 12:18:06 PM org.jgroups.protocols.FlowControl handleViewChange
FINER: new membership: [172.25.8.2, 172.25.13.2]Oct 05, 2018 12:18:27 PM org.jgroups.protocols.pbcast.Merger$MergeTask consolidateMergeData
FINER: 172.25.8.2: consolidated view=MergeView::[172.25.8.2|1] (2) [172.25.8.2, 172.25.13.2], 2 subgroups: [172.25.8.2|0] (1) [172.25.8.2], [172.25.13.2|0] (1) [172.25.13.2] FINER: 172.25.13.2: received GET_MBRS_REQ from 172.25.14.2, sending response 172.25.13.2, name=172.25.13.2, addr=172.25.13.2:65413, server
Oct 05, 2018 12:22:30 PM org.jgroups.protocols.TP down
FINER: 172.25.13.2: sending msg to 172.25.14.2, src=172.25.13.2, headers are PING: [type=GET_MBRS_RSP], UDP: [cluster_name=interOR-test-cluster]
Oct 05, 2018 12:22:30 PM org.jgroups.protocols.TP$BaseBundler sendBundledMessages
Oct 05, 2018 12:22:51 PM org.jgroups.protocols.Discovery sendDiscoveryResponse
FINER: 172.25.8.2: received GET_MBRS_REQ from 172.25.14.2, sending response 172.25.8.2, name=172.25.8.2, addr=172.25.8.2:60314, coord
Oct 05, 2018 12:22:51 PM org.jgroups.protocols.TP down
FINER: 172.25.8.2: sending msg to 172.25.14.2, src=172.25.8.2, headers are PING: [type=GET_MBRS_RSP], UDP: [cluster_name=interOR-test-cluster]
Oct 05, 2018 12:23:20 PM org.jgroups.protocols.TP passMessageUp
FINER: 172.25.8.2: received [dst: <null>, src: 172.25.14.2 (2 headers), size=0 bytes, flags=INTERNAL], headers are MERGE3: INFO: view_id=[172.25.14.2|0], logical_name=172.25.14.2, physical_addr=172.25.14.2:53184, UDP: [cluster_name=interOR-test-cluster]
Oct 05, 2018 12:23:20 PM org.jgroups.protocols.TP passMessageUp
FINER: 172.25.8.2: received [dst: <null>, src: 172.25.14.2 (2 headers), size=0 bytes, flags=INTERNAL], headers are MERGE3: INFO: view_id=[172.25.14.2|0], logical_name=172.25.14.2, physical_addr=172.25.14.2:53184, UDP: [cluster_name=interOR-test-cluster]
Oct 05, 2018 12:23:23 PM org.jgroups.protocols.TP down
Oct 05, 2018 12:24:03 PM org.jgroups.protocols.MERGE3$ViewConsistencyChecker _run
FINE: I (172.25.8.2) will be the merge leader
Oct 05, 2018 12:24:03 PM org.jgroups.protocols.MERGE3$ViewConsistencyChecker _run
FINER: merge participants are [172.25.8.2, 172.25.14.2]Oct 05, 2018 12:34:52 PM org.jgroups.protocols.pbcast.GMS up
FINER: 172.25.13.2: received full view: MergeView::[172.25.14.2|6] (3) [172.25.14.2, 172.25.8.2, 172.25.13.2], 3 subgroups: [172.25.13.2|0] (1) [172.25.13.2], [172.25.8.2|5] (1) [172.25.8.2], [172.25.14.2|0] (1) [172.25.14.2]
Oct 05, 2018 12:34:52 PM org.jgroups.protocols.pbcast.GMS up
FINER: 172.25.14.2: received full view: MergeView::[172.25.14.2|6] (3) [172.25.14.2, 172.25.8.2, 172.25.13.2], 3 subgroups: [172.25.13.2|0] (1) [172.25.13.2], [172.25.8.2|5] (1) [172.25.8.2], [172.25.14.2|0] (1) [172.25.14.2]
Oct 05, 2018 12:35:13 PM org.jgroups.protocols.pbcast.GMS up
FINER: 172.25.8.2: received full view: MergeView::[172.25.14.2|6] (3) [172.25.14.2, 172.25.8.2, 172.25.13.2], 3 subgroups: [172.25.13.2|0] (1) [172.25.13.2], [172.25.8.2|5] (1) [172.25.8.2], [172.25.14.2|0] (1) [172.25.14.2]
> <mailto:jgroups-dev+unsub...@googlegroups.com>.
We found out that a firewall-rule was wrongly configured which caused that the server never received the GET_MBRS_RSP message: there was a firewall-rule, but on the wrong UDP port.
Then it was still not working. We found a similar problem related to the FD_SOCK protocol.
For that protocol, only a server-port was configured (and enabled in the firewall), but the client_bind_port was not configured. It isn't configured in the default config of wildfly either. The result was that it uses a random TCP port to connect to its neighbor, which was not allowed by the firewall. The result was (quite similar as the UDP-config problem) that a connection (for FD_SOCK) could be created, but a packet sent from the server to the client was never received by jgroups because of the firewall.