Hi,
We are looking at importing a large number of items as part of our launch (~200,000). Imports seemed to be slow from the start but were at least tolerable. As the size of the repository has grown import time has grown significantly along with it.
I've tried different "batch" sizes to see if that had an impact but the pattern still seems to be the same.
Currently importing 100 items is taking well over 1 hour. I should mention that the resources involved could be scaled up further -- but I assume they should be sufficient for the tasks this involves (exception maybe SOLR as that's less familiar to me). Based on how fast SOLR indexes items using the "index-discovery" command I can't see it being so slow here.
Is this a known or common problem? Is there anything others have done to speed this up?
To be clear in this instance I am referring to Item Import via Simple Archive Format -- though I've noticed similar behaviour with the CSV import capabilities via the UI.
We are on v7.3 currently.
Thanks,
Steve