Hi,
I am new to Debezium and doing a POC right now. I am using an old version (1.9) and plan to upgrade soon.
I am running Debezium with SQL Server on AWS MSK. I have around 140 tables that I need to replicate and currently, I have 1 connector running around 20 tables. I decided to add the remaining 120 tables and I see that the performance is very slow. All tables included, the size is less than 40 GB and the tables I added are less than 10 GB combined. Due to limitations with Glue, I am currently using only JSON with schema turned on but given the size of my dataset, I thought it shouldn't be a huge problem.
Metrics:
1. I see the processing is mostly about 1 table at a time. Max, I see about 3-4 topics in parallel.
2. The throughput however appears to be very low. I am seeing on average less than 4 MB per second received from SQL server.
3. Incoming messages per second is on average 220.
I am sure I am missing things here that I should consider for better performance. I appreciate any feedback. Thanks!
Regards,
Bhaski