I just wanted to chime in on this issue and give a few more details on the technical challenges we are facing with this log. The main challenge we are facing are the IOPS on our Cassandra nodes, mostly due to the get-entries endpoint load. This traffic is not only slowing down the get-entries calls but is also causing the backlog of new entries from being merged into the tree. When we turn off the get-entries endpoint the signer is able to empty the backlog and catch up. When we turn it back on, the signing backlog grows and struggles again. We agree with Andrew’s suggestion and turned off accepting any new entries again to protect against a MMD violation.
We are committing to doing everything we can to save this log but in order to do this the immediate need would be to increase our IOPS and the number Cassandra nodes. However this puts us into a difficult spot in that in order to add more nodes we will need to keep the add-chain and add-pre-chain endpoints off for a significant amount of time.
Cassandra only allows one node to be added to the cluster at a time, each node could take up to a day to complete (due to the IO load it takes to add a new node) and we are thinking we would need to add up to 12 new nodes. This means this whole process would take up to about 12 days to complete. We also will most likely need to take the get-entries endpoint on and off a bit during this time depending on the IO needs to add the new nodes.
This would obviously put us very much over our uptime requirements and thus we would like to get everyone feedback and thoughts on allowing this log to take this amount of down time to get into a more stable state.
--You received this message because you are subscribed to the Google Groups "Certificate Transparency Policy" group.To unsubscribe from this group and stop receiving emails from it, send an email to ct-policy+...@chromium.org.To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/ct-policy/7fc9fa65-b60c-4406-bd50-c5a46c1212d3n%40chromium.org.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/ct-policy/278e9f20-6b2d-4dd6-90ad-7d07c2b6e17a%40app.fastmail.com.
To view this discussion visit https://groups.google.com/a/chromium.org/d/msgid/ct-policy/278e9f20-6b2d-4dd6-90ad-7d07c2b6e17a%40app.fastmail.com.