Thanks for the update, Rob.
On Wed, 9 Jul 2025 05:55:30 -0700 (PDT)
"'Rob Stradling' via Certificate Transparency Policy"
> Snippet <
https://developers.cloudflare.com/rules/snippets/>. This has
> a very similar goal to Let's Encrypt's CTile
> <
https://github.com/letsencrypt/ctile> project, but it uses
> Cloudflare's CDN caching instead of S3 object storage.
CF_CTile is extremely cool (only 83 lines!) and is something I had long hoped someone would write.
> Our metrics show that our PostgreSQL-based logs (the Elephant and
> Tiger shards) are now handling their current traffic volumes well,
> with room for growth and to handle traffic peaks. The rate limits
> for the Elephant and Tiger shards are now set as follows:
>
> - Per IP: no limits defined.
> - Per shard: 3,000 req/sec.
> - Per shard, for "old" submissions
> <
https://github.com/google/certificate-transparency-go/blob/master/CHANGELOG.md#ctfe-rate-limiting-of-non-fresh-submissions>:
> 40 req/sec for add-(pre-)chain requests where the certificate's
> notBefore timestamp is older than 28 hours at time of submission.
>
> Sadly, our MariaDB-based logs (the Mammoth and Sabre shards) are
> still struggling to handle their current traffic volumes reliably,
> even with much lower rate limits (Per shard: 400 req/sec).
I'm surprised, since AIUI CTile solved all of Let's Encrypt's read path problems. Are you still seeing significant amounts of read traffic hitting the origin, or is it the write load that's causing the problems?
Do you have Tiered Caching <
https://developers.cloudflare.com/cache/how-to/tiered-cache/> enabled in Cloudflare? It looks like that can reduce the number of requests hitting the origin.
> Now that we have built confidence in our PostgreSQL-based logs, we
> feel the time has come to start making plans to deprecate our use of
> MariaDB. Our plan so far has been to wind down the Sabre and Mammoth
> shards once the Tiger and Elephant shards have proven themselves and
> transitioned to Usable. However, we're now wondering if it would be
> less disruptive for the ecosystem if we instead look to migrate the
> existing Sabre and Mammoth shard databases from MariaDB to
> PostgreSQL, and let these shards continue to operate as Usable logs.
> If any community members have opinions on which approach to take,
> we're keen to hear from you.
It will be 70 days until Tiger is Usable in Chrome, which IMO is too long for Sabre and Mammoth to continue in this state. If migrating them to PostgreSQL is the fastest way to get their availability back to an acceptable level, then I think that's the preferable option.
Regards,
Andrew