Thanks,
Rajiv
We are running a simple remote actor system (Akka 2.2.1) and can reproduce events that cause addresses to be quarantined:15:39:25,672 INFO Â (AppServer-akka.actor.default-dispatcher-4) [WARN] [09/11/2013 15:39:25.672] [AppServer-akka.actor.default-dispatcher-20] [Remoting] Tried to associate with unreachable remote address [akka.tcp://Publ...@10.14.10.13:2552]. Address is now quarantined, all messages to this address will be delivered to dead letters.
My question is not about why the quarantine was initiated but rather how to control the amount of time that the address is kept in quarantine? Â I observe that after ~1 minute the address is removed from quarantine and the connection is re-established. Â Is there a way to reduce the wait?
I see the following configuration property exists:# This settings controls how long a system will be quarantined after# catastrophic communication failures that result in the loss of system# messages. Quarantining prevents communication with the remote system# of a given UID. This function can be disabled by setting the value# to "off".quarantine-systems-for = 60sWe are running with the following settings (in addition to "off" have tried quarantine-systems-for 5s) but it seems to have no impact.   akka.remote.retry-window = 5s   akka.remote.gate-invalid-addresses-for = 5s   akka.remote.quarantine-systems-for = off   akka.remote.retry-gate-closed-for = 0 sIs there some other setting we are missing here?Thanks
--
>>>>>>>>>> Read the docs: http://akka.io/docs/
>>>>>>>>>> Check the FAQ: http://akka.io/faq/
>>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user
---
You received this message because you are subscribed to the Google Groups "Akka User List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to akka-user+...@googlegroups.com.
To post to this group, send email to akka...@googlegroups.com.
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/groups/opt_out.
Patrik Nordwall
Typesafe -Â Â Reactive apps on the JVM
Twitter: @patriknw
Hi Endre,Here is my JBoss 7.1.1.Final server log.  You will see that I've started the server, let it run for a few seconds.  Everything appears to be running normally and then the following events occur:At 10:01:59, 296 I set breakpoint within my subscriber and it gets hit.  You see the heartbeat messages stop there.  The first message logged after that is when I kill the debugger and let things flow normally again -- it tells us the publisher is unreachable.At 10:02:32, 077 you see the message that the subscriber has now quarantined the address of the publisher.The subscriber then attempts to reconnect to the publisher every 5 seconds but we see "tried to associated with unreachable address - Address is now quarantined..."At 10:03:07, 238 Finally the quarantine is lifted and the subscriber reconnects with the publisher.So, I was wrong, looks like the quarantine is lasting 30 seconds and not 60 (but when you are under fire and waiting for a reconnect 30 seconds can feel like an eternity!).  Still, 30s is much more that the 5 seconds we have it set to in the conf file (also, tried with value "off" and still got quarantine logged here).Here's a copy of our conf:appServer { akka {  daemonic = on  actor.provider = "akka.remote.RemoteActorRefProvider"  jvm-exit-on-fatal-error = off  loglevel = "DEBUG"  remote {   netty.tcp {    hostname = "" // we set this programmatically    port = 2554    maximum-frame-size = 1048576b   }   retry-window = 5s   gate-invalid-addresses-for = 5s   quarantine-systems-for = 5s  } }}Thanks for your assistance.
That aside, the initial question remains...why did the observed quarantine period - 30s - differ from what we thought we had configured with the akka.remote.quarantine-systems-for = off/5s property?
Is this property used for something different than remote death watch quarantine described here?
With clarification on that think we can put this one to bed.
Hi Derek,
That aside, the initial question remains...why did the observed quarantine period - 30s - differ from what we thought we had configured with the akka.remote.quarantine-systems-for = off/5s property?
The question is, is there any traffic that can trigger reconnect after the quarantine elapsed? Do you have some internal timer that fires in the 30s range? If there is traffic, and reconnect does not happen after the quarantine is lifted, then that might be a bug. I am asking, because immediately before the successful association I saw this line in your logs:
10:03:07,221 WARNÂ [com.ft.messaging.MarketDataSubscriber] (AppServer-akka.actor.default-dispatcher-12) MarketDataSubscriber attempting to connect to akka.tcp://MarketDataPublisherSys...@192.168.212.184:2552/user/marketDataPublisher...
Hi Derek,
That aside, the initial question remains...why did the observed quarantine period - 30s - differ from what we thought we had configured with the akka.remote.quarantine-systems-for = off/5s property?
The question is, is there any traffic that can trigger reconnect after the quarantine elapsed? Do you have some internal timer that fires in the 30s range? If there is traffic, and reconnect does not happen after the quarantine is lifted, then that might be a bug. I am asking, because immediately before the successful association I saw this line in your logs:
10:03:07,221 WARNÂ [com.ft.messaging.MarketDataSubscriber] (AppServer-akka.actor.default-dispatcher-12) MarketDataSubscriber attempting to connect to akka.tcp://MarketDataPublisherSys...@192.168.212.184:2552/user/marketDataPublisher...