cent7
24.01-20260204150225
Single server, no HA
After recently updating, I'm having issues with stability of sipxbridge. Overnight/daily, I will lose registration with the carrier (VOIP.ms) and the debug log of sipxbridge.log will stop recording. The last lines are:
"2026-03-01T21:37:09.467000Z":77323:JAVA:DEBUG:<sip server>:PipelineThread-0:00000000:sipxbridge:"Starting to parse."
"2026-03-01T21:37:09.467000Z":77324:sipXbridge:DEBUG:<sip server>:PipelineThread-0:00000000:sipxbridge:"Starting to parse."
Scrolling above doesn't reveal any significant issues except for one I caught which points to DNS, buuuut...
VOIP.ms shows loss of registration. In sipxcom, if I select the SIP trunk statistics, it shows registration status as AUTHENTICATED, but when I click Refresh, I get the error screen. The sipxconfig.log file captures an OutOfMemory exception when this happens.
The sipxbridge.log file recently captured a java exception that it was unable to resolve the domain (or the sip domain, which I setup as the same as network domain.)
DNS is enabled on this instance, but I also have a network DNS server running on the router. I have the router IP configured as a forwarder/primary external dns server in sipxcom. I can ping/nslookup the sip domain from my workstation (which uses DNS on the router) and resolve to the IP of the sip server. I cannot ping the sip domain from the sip server using the built-in DNS. If sipxbridge is attempting to find itself, how can it?
/etc/hosts does not contain an entry for the sip domain.
The IP does exist as a forwarder in the named.conf DNS configuration, but will DNS requests forward if attempting to resolve on the same domain?
I added a custom A-record in sipxcom to add the sip domain and resolve to the sip server, but it still crashes.
Prior to this version, it ran flawlessly. Possibly something committed in the two releases in February is misbehaving? I am also seeing this behavior on another sipxcom instance I manage in another location. It could very well be a misconfiguration on my part as I set both up nearly identical.
I don't think this DNS issue is the root cause, as calls were still made/received even after observing the errors in the log file. But, nothing is showing in the log at the time it freezes/crashes to point to a problem.
Performing a 'service sipxbridge restart' or 'service sipxecs stop'/'service sipxecs start' fixes the immediate issue temporarily.
Any suggestions where I might look? When my sip instance experiences this, I can keep it in that state to assist with troubleshooting.
Thanks in advance.