[NEED HELP] Kafka connector stops producing whenever there is a blip in RDS connection

11 views
Skip to first unread message

safi junaid

unread,
Aug 18, 2025, 10:10:38 AMAug 18
to debezium

HI team,

We have multiple kafka connect pods, hosting around 10 debezium MYSQL connectors connected to RDS. These produces messages to MSK brokers and from there are being consumed by respective services.

Our connectors stop producing messages randomly every now and then, exactly for 14 minutes whenever we see below message:

INFO: Keepalive: Trying to restore lost connection to aurora-prod-cluster.cluster-asdasdasd.us-east-1.rds.amazonaws.com:3306

And auto-recovers in 14mins exactly. During this 14 mins, If i restart the connect pod on which this connector is hosted, the connector recovers in ~3-5 mins.

I tried tweaking lot of configurations with my kafka, tried adding below as well:
database.additional.properties: "socketTimeout=20000;connectTimeout=10000;tcpKeepAlive=true"

But nothing helped.

But I can not afford the delay of 15mins for few of my very important tables as it is extremely critical and breaches our SLA with clients.

Anyone faced this before and what can be the issue here?

Any help will be greatly appreciated.


Chris Cranford

unread,
Aug 18, 2025, 10:13:05 AMAug 18
to debe...@googlegroups.com
Hi -

Just for completeness in case others stumble onto this, the thread was opened also on Zulip -
#community-sqlserver > ✔ Kafka connectors stops producing due to a blip in db conn @ 💬. The solution was to add `use.nongraceful.disconnect`.

-cc
--
You received this message because you are subscribed to the Google Groups "debezium" group.
To unsubscribe from this group and stop receiving emails from it, send an email to debezium+u...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/debezium/330bc647-9043-4177-bef1-fcc695c8685cn%40googlegroups.com.

Reply all
Reply to author
Forward
0 new messages