Hi,
We are using Debezium with Oracle XStream and I would like to ask for help to optimize or scale Debezium.
Debezium replicates 6 tables from a database. One of this table generates a lot of change. Every night the application updates all the records in this table. It’s around 20 million changes per night. (This application generates a lot of small transactions, it doesn’t update all the data in one large bulk. )
Unfortunately, our configuration can handle only 600-700 event per second, it means that Debezium processing time is around 8-10 hours. Our goal would be around 5000-7000 per second. So, we are very far from required performance.
This is an image which shows CPU and memory usage while we are processing the data. It uses only one core at 100%.

My questions are:
· How can we optimize our debezium or xtream? I tried to set poll.interval.ms, max.batch.size, and max.queue.size higher but there wasn’t any effect on performance
· How can we scale Debezium? Is it a good idea to create more outbound servers for different tables at Oracle side? As I’ve read about it, it isn’t a good solution, because in this case, more than one capture process will read the same file.
(Please consider, that Debezium processing was only 4.5 hours. The reason why this happened, is because capture process stopped, due to RMAN deleted the archive log file. It isn’t a problem. We use this configuration only in dev environment because we don’t have enough space for archive log files, and we don’t want to stop our database. So it’s intentional, not a bug or misconfiguration.)
Thanks, Marcel
This is our configuration:
{
"name": "xstream-connector-2",
"config": {
"connector.class": "io.debezium.connector.oracle.OracleConnector",
"transforms.TABLE4.replacement": "spli-debezium_table4",
"transforms.TABLE1.type": "org.apache.kafka.connect.transforms.RegexRouter",
"transforms.TABLE5.type": "org.apache.kafka.connect.transforms.RegexRouter",
"transforms.TABLE5.replacement": "spli-debezium_table5",
"tasks.max": "1",
"database.history.kafka.topic": "spli-debezium_schema-changes",
"transforms": "TABLE1,TABLE2,TABLE3,TABLE4,TABLE5",
"database.tablename.case.insensitive": "true",
"transforms.TABLE2.regex": ".*SCHEMA.TABLE2",
"transforms.TABLE3.replacement": "spli-debezium_table3",
"transforms.TABLE4.type": "org.apache.kafka.connect.transforms.RegexRouter",
"include.schema.changes": "false",
"decimal.handling.mode": "string",
"internal.database.oracle.version": "11",
"database.history.skip.unparseable.ddl": "true",
"database.out.server.name": "OUTBOUND_SERVER",
"database.oracle.version": "11",
"database.user": "DEBEZIUM_PROXY",
"database.dbname": "DBNAME",
"database.connection.adapter": "xstream",
"database.history.kafka.bootstrap.servers": "KAFKA",
"transforms.TABLE1.regex": ".*SYMBOLS.TABLE1",
"database.server.name": "DBNAME",
"database.port": "PORT",
"key.converter.schemas.enable": "false",
"transforms.TABLE1.replacement": "spli-debezium_table1",
"transforms.TABLE4.regex": ".*SCHEMA.TABLE4",
"database.hostname": "DBHOST",
"transforms.TABLE2.type": "org.apache.kafka.connect.transforms.RegexRouter",
"database.password": "PW",
"value.converter.schemas.enable": "false",
"transforms.TABLE3.type": "org.apache.kafka.connect.transforms.RegexRouter",
"name": "xstream-connector-2",
"table.include.list": ".*SCHEMA.TABLE1,.*SCHEMA.TABLE2,.*SCHEMA.TABLE3,.*SCHEMA.TABLE4',.*SCHEMA.TABLE5,.*SCHEMA.TABLE6",
"transforms.TABLE3.regex": ".*SCHEMA.TABLE3",
"transforms.TABLE5.regex": ".*SCHEMA.TABLE5",
"transforms.TABLE2.replacement": "spli-debezium_table2",
"key.converter": "org.apache.kafka.connect.json.JsonConverter",
"value.converter": "org.apache.kafka.connect.json.JsonConverter",
"poll.interval.ms":"5000",
"max.batch.size":"8192",
"max.queue.size":"327680"
}
}
--
You received this message because you are subscribed to the Google Groups "debezium" group.
To unsubscribe from this group and stop receiving emails from it, send an email to debezium+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/debezium/e52b125f-354b-471e-a266-906c3b653837n%40googlegroups.com.