Hello XNAT Experts,
We are experiencing performance issues with our XNAT instance and I was hoping to get some guidance or suggestions.
Our specs:
version 1.9.0, build 407
CentOS 7
Postgres 11.21
Several days ago we started seeing CPU usage spike for our server; the tomcat process was routinely consuming 100% across almost every core. At this point the VM had 64 GB of ram (32 for tomcat), and 10 cores. In response we more than doubled the core count (24 now), but the issue is still persisting.
Anecdotally, it looks like CPU use spikes every time someone tries to download data, or even look at some data (e.g. clicking "Details" in the prearchive). We have seen an increase in users over the past month, but it seems to be happening when even one user does this; the site slows to a crawl, our uploads & archiving slow down, &c.
Importantly, we have not changed anything about our configuration -- except for adding CPUs to the VM in response. It might just be that we need to start scaling to multiple servers, but the increase in users (2 - 3 frequent to 5 - 7 frequent) feels like it should not be impacting the service this much.
Any tips/thoughts/suggestions are much appreciated.
Thank you!
Alex
PS - I've attached a photo from the Java Melody CPU chart from today for reference. This type of activity was also happening across the weekend.