Thanks so much for sharing back! This does make sense. I have also experienced issues with EFS related to things like this. I would have suggested checking disk performance if I had realised a GoCD server replacement+upgrade was part of what had changed :-)
While it has some challenges in terms of being AZ specific, generally I have had better experience mounting EBS volumes for use by GoCD (rather than network-based stores such as EFS), although that does limit which AZ your GoCD server can run in (without manual intervention), so it depends on your wider deployment architecture whether that is acceptable.
I can't think of any major reason the GoCD server version change on its own would cause higher throughput usage than your older version GoCD Server. One thing worth thinking about is that, in my recollection, EFS in bursting mode does vary speeds a lot based on the size of the storage. If your
new server has much lower storage/use of EFS than your
old server then the limits may be different. (e.g if you wiped a lot of artifacts while re-using the same EFS volume or created a new EFS volume which is a lot smaller) I'd suggest comparing the AWS side metrics for your EFS throughput between the two to compare what their usage, credits, and limits are per
https://docs.aws.amazon.com/efs/latest/ug/performance.html.
For small EFS volumes, the baseline throughput is pretty terrible (15 MiBps read, 5 MiBps continuously), and GoCD servers tend to be rather write heavy if you have heavy use of artifacts within GoCD itself.
I am not sure if use of https-git has any major implications for disk usage on the git side of things compared to ssh , but I would not have thought it'd majorly change the throughput requirements. If EFS volume size doesn't explain issues, and you have changed all of the material URLs to https://, perhaps you want to compare other aspects of your material configuration for changes in
- the # of distinct materials known to GoCD on the Materials tab (old vs new)
- the # of these materials that are auto-updating (polling, the default) compared to having auto-update disabled (e.g if you also use Webhooks)
-Chad