Hello,
I'm working for IServ GmbH (https://iserv.eu/) and we've integrated BBB into our IServ Schoolserver
as a direct response to the SARS-CoV-2 pandemic. We're currently
hosting about 260 active BBB instances (the exact number varies with
demand) plus a couple of spares and development instances.
First
of all let me be clear that we use BBB in an unsupported scenario. We
run BBB inside of a docker container. This allows us much easier
deployment within our infrastructure and ensures consistency across many
instances. Also, we don't need to update our tool chains to work with
Ubuntu 16.04 as all of our software is designed to run on Debian
stable/testing. Running inside a docker container requires a couple of
workarounds I'm not going to go into here and I don't think it should
matter much for the problem described below. We also don't use scalelite or Greenlight, but instead just BBB's API.
The problem: Sometimes, essentially at random, it looks like FreeSWITCH seems to stop accepting new connections. To us the problem is visible through nginx's log file (all times in UTC):
2020/05/04 09:24:59 [error] 5448#5448: *734320 connect() failed (110: Connection timed out) while connecting to upstream, client: 198.18.1.1, server: bbbhag06.videoconference.iserv.eu, request: "GET /ws HTTP/1.1", upstream: "https://116.202.241.247:7443/ws", host: "bbbhag06.videoconference.iserv.eu"root@bbbhag06:/# systemctl status freeswitch
● freeswitch.service - freeswitch
Loaded: loaded (/lib/systemd/system/freeswitch.service; disabled; vendor preset: enabled)
Drop-In: /etc/systemd/system/freeswitch.service.d
└─iserv.conf, override.conf
Active: active (running) since Wed 2020-04-29 00:11:17 UTC; 5 days ago
Main PID: 5205 (freeswitch)
CGroup: /docker/d6ec8a55ae21fbb69816a0e1867b54d8939c0321330574fa5b6fb974cec1e8ee/system.slice/freeswitch.service
└─5205 /opt/freeswitch/bin/freeswitch -u freeswitch -g daemon -ncwait -nonat -nf
‣ 5205 /opt/freeswitch/bin/freeswitch -u freeswitch -g daemon -ncwait -nonat -nf
Warning: Journal has been rotated since unit was started. Log output is incomplete or unavailable.55a95f4e-60ef-4c27-9042-62fa6a792c6f 2020-05-04 09:25:09.397541 [ERR] switch_rtp.c:6027 RTCP UNPROTECT ERR
55a95f4e-60ef-4c27-9042-62fa6a792c6f 2020-05-04 09:25:09.397541 [WARNING] switch_rtp.c:6952 INVALID RTCP PACKET TYPE 51 VER 3 LEN 17388
6a510796-d384-496e-8a4b-aba5732f4b44 2020-05-04 09:33:56.577544 [WARNING] switch_rtp.c:6257 SRTP audio unprotect failed with code 9 (replay check failed) 74 bytes 10 errors
7c1ca9b3-90ca-4982-b812-810e7c1afadc 2020-05-04 09:38:57.077548 [ERR] switch_rtp.c:6027 RTCP UNPROTECT ERR
7c1ca9b3-90ca-4982-b812-810e7c1afadc 2020-05-04 09:38:57.077548 [WARNING] switch_rtp.c:6952 INVALID RTCP PACKET TYPE 0 VER 2 LEN 144
3c96fe2d-4526-4206-9ce7-8c7b6385a615 2020-05-04 09:41:03.257541 [ERR] switch_rtp.c:3266 audio Handshake failure 1. This may happen when you use legacy DTLS v1.0 (legacyDTLS channel var is set) but endpoint requires DTLS v1.2.Kind regards,
Kim-A. Brodowski
--Kind regards,
Kim-A. Brodowski
You received this message because you are subscribed to the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/5162e20d-eefd-4c13-b7e3-311d00ff69ac%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/CANTERJFg34eoRPBwaVk9AtUb_Mu%2BpnUzJEtON7xWJfTDjhavNw%40mail.gmail.com.
Good morning,
it looks like this did not fix the problems for us. To give you an idea of the scope: Of our about 260 BigBlueButton instances 6 were affected by the problem this morning. One of them had BigBlueButton 2.2.11 with bbb-freeswitch-core version 2:2.2.0-109 installed.
Kind regards,
Kim-Alexander Brodowski
--
You received this message because you are subscribed to a topic in the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/bigbluebutton-dev/u7C3o0q3IJo/unsubscribe.
To unsubscribe from this group and all its topics, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/431f7f99-c15d-4b3a-a1f5-61071cceab47%40googlegroups.com.
Upon further inspection the server running 2.2.11 might just have experienced a FreeSWITCH crash, which has the same symptoms, but most likely a different underlying cause:
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Main process exited, code=killed, status=6/ABRT
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Unit entered failed state.
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Failed with result 'signal'.
May 18 09:48:20 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Service hold-off time over, scheduling restart.
Kind regards,
Kim-Alexander Brodowski
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/c44329272f2a7de441e4fc3a2df9ab1b%40brodowski.cc.
Upon further inspection the server running 2.2.11 might just have experienced a FreeSWITCH crash, which has the same symptoms, but most likely a different underlying cause:
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Main process exited, code=killed, status=6/ABRT
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Unit entered failed state.
May 18 09:48:19 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Failed with result 'signal'.
May 18 09:48:20 bbbitg01.videoconference.iserv.eu systemd[1]: freeswitch.service: Service hold-off time over, scheduling restart.Kind regards,
Kim-Alexander Brodowski
Am 18.05.2020 10:21, schrieb Kim-Alexander Brodowski:
Good morning,
it looks like this did not fix the problems for us. To give you an idea of the scope: Of our about 260 BigBlueButton instances 6 were affected by the problem this morning. One of them had BigBlueButton 2.2.11 with bbb-freeswitch-core version 2:2.2.0-109 installed.
---
Kind regards,
Kim-Alexander Brodowski
Am 15.05.2020 19:10, schrieb Kim-Alexander Brodowski:
Hello,I'm currently working with my colleagues on our deployment plan. We're probably going to rollout these changes on our testing branch to about 10 servers later this weekend. Thank you so much for your support.Kind regards,Kim-A. Brodowski--
You received this message because you are subscribed to a topic in the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/bigbluebutton-dev/u7C3o0q3IJo/unsubscribe.
To unsubscribe from this group and all its topics, send an email to bigbluebutton-dev+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/431f7f99-c15d-4b3a-a1f5-61071cceab47%40googlegroups.com.
--
You received this message because you are subscribed to a topic in the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/bigbluebutton-dev/u7C3o0q3IJo/unsubscribe.
To unsubscribe from this group and all its topics, send an email to bigbluebutton-dev+unsubscribe@googlegroups.com.
To unsubscribe from this group and all its topics, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/431f7f99-c15d-4b3a-a1f5-61071cceab47%40googlegroups.com.
--
You received this message because you are subscribed to a topic in the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/bigbluebutton-dev/u7C3o0q3IJo/unsubscribe.
To unsubscribe from this group and all its topics, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/c44329272f2a7de441e4fc3a2df9ab1b%40brodowski.cc.
--
You received this message because you are subscribed to the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/5818d9c2-4bdb-4d24-9730-cb3960f8f556%40googlegroups.com.
Also, Kim: can you please get the output of /opt/freeswitch/bin/fs_cli -x version from a 2.2.11 server that got the WS stack stuck?
Thanks.
--
You received this message because you are subscribed to the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/2d97f14a-f82c-43a6-b3b9-095ea42500b3%40googlegroups.com.
Also, Kim: can you please get the output of /opt/freeswitch/bin/fs_cli -x version from a 2.2.11 server that got the WS stack stuck?
Thanks.
--
You received this message because you are subscribed to the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/dc37ed50-902c-4562-8143-879db34cc62c%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "BigBlueButton-dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bigbluebutton-...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bigbluebutton-dev/2d4f8194-bcca-4141-9e41-bd714c85d22a%40googlegroups.com.