Just an update on this. This is what we got back from AWS and are testing now. I will update if this resolves our problem.
There is a known issue (in case of Debezium Streaming) regarding WAL sender error when decoding.
There is a workaround for this issue which is to turn off the write-through cache by modifying your parameter group, setting the rds.logical_wal_cache parameter to 0 and then restart your writer instance.
Without the write-through cache, Aurora PostgreSQL uses the Aurora storage layer in its implementation of the native PostgreSQL logical replication process.
It does so by writing WAL data to storage and then reading the data back from storage to decode it and send (replicate) to its targets (subscribers).
This can result in small performance bottleneck during logical replication for Aurora PostgreSQL DB clusters but it does not cause any loss of data from existing logical replication.
Thank you,