Hi,I understand that an embedded Debezium does not have the sameguarantees as in Kafka Connect deployment. However, as I see it,if I run an embedded Debezium on Kubernetes with offset storageinto an EBS volume and a very tight healthcheck, it seems that inthe worst case, there will be a few moments of downtime, and someduplicated messages. Is it correct?
Does the above configuration can handle the throughput and speedof a medium-traffic MySQL databases?
If it's not, is it possible to scale-out Debezium and have severalworkers reading from the same binlog?
Does embedded Debezium expose a replication lag metric forPostgreSQL and MySQL? This is the most essential metric as Isee it.
Thanks!
I'd start with a single instance and see whether you run into any issues.
There's monitoring (via JMX) for the MySQL connector only atm: https://debezium.io/docs/connectors/mysql/#monitoring. It exposes a "SecondsBehindMaster" parameter which represents the event lag. It's up high on the agenda to provide equivalent monitoring for the other connectors, too (in fact it should be one of the next things we get to once the two new connectors, Oracle and SQL Server have stabilized a bit).
Thanks very much for the detailed response :)I'd start with a single instance and see whether you run into any issues.
The real question is, if multiple consumers can read different offsets simultaneously, not to read the same events. (something like one reads the odd offsets and the other the even offsets. Sorry, I'm not familiar with MySQL binlog)
There's monitoring (via JMX) for the MySQL connector only atm: https://debezium.io/docs/connectors/mysql/#monitoring. It exposes a "SecondsBehindMaster" parameter which represents the event lag. It's up high on the agenda to provide equivalent monitoring for the other connectors, too (in fact it should be one of the next things we get to once the two new connectors, Oracle and SQL Server have stabilized a bit).I think there's a problem with the SecondsBehindMaster parameters: