Curation tasks in DSpace 5 vs DSpace 6

17 views
Skip to first unread message

Alan Orth

unread,
Aug 20, 2020, 2:01:16 AM8/20/20
to DSpace Technical Support
Dear list,

I've implemented a curation task to read country names from item metadata and add new metadata fields with appropriate ISO 3166-1 Alpha2 codes if they don't already exist. On DSpace 5 the task finishes in an hour or sometimes two, but on DSpace 6 it runs for twelve hours and I end up killing it. As far as I can tell I ported the DSpace 5 version¹ to DSpace 6 faithfully², though I'm wondering if I missed something with regards to caching, as that seems to have been removed (or internalized) with the service API / Hibernate overhaul. I would be grateful if someone could take a look.

Another thing I note is that when I do "-i all" to process all items in the repository the curation task will curate each item multiple times, one for each collection it is mapped to. Our repository has ~90,000 items and in our case that results in reprocessing ~25,000 items(!). Would it be better to write a standalone Java utility for this rather than using the curation interface?

Thank you,

Reply all
Reply to author
Forward
0 new messages