Hello everyone!
I have what might be a silly question. My organization stores AIPs as compressed bags and we are looking into ways to run fixity checks on them in cloud storage environments, specifically in AWS S3. Can fixity checks be run on compressed files without unzipping them first or do they have to be uncompressed for fixity tools to run? Does anyone know if fixity checks on compressed AIPs can be run in Archivematica’s Fixity tool or in any other program?
- Ima Oduok--
You received this message because you are subscribed to the Google Groups "archivematica" group.
To unsubscribe from this group and stop receiving emails from it, send an email to archivematic...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/archivematica/7d34fa46-755b-45b7-9e2e-d67103ec70f5n%40googlegroups.com.
Hi all,
MIT Libraries has been experimenting with fixity in the cloud – but we’re using uncompressed bags, so unfortunately our use case doesn’t align with Ima’s. We’ve developed a toolset that takes Archivematica AIP UUIDs as input and uses AWS Lambda to verify AIPs stored in S3. The code is here https://github.com/MITLibraries/s3-bagit-validator. Large files and AIPs sometimes time out, but we’ve got local workarounds for those. We’re just getting started in production so I don’t yet have useful data about costs and time, but I hope to share more in the future. If you’d like more information feel free to contact me!
Take care,
Charlie
To view this discussion visit https://groups.google.com/d/msgid/archivematica/CAAr2QtspK6Wh56%3DpYdHM7ff5b%2B2yDNEOTFxWcH%2Bu9nCQb08gCQ%40mail.gmail.com.
Hi Sarah,
Thank you for that information. I have a follow up question about fixity checking compressed packages.
It sounds like the Fixity app unzips the package to run the checks but also creates a checksum for the zipped packages as well. Say you have a zipped package and one of the files within it is changed. Would the checksum for the zipped package change, indicating that one of the files is not as it should be? Or does the zipped package checksum remain the same so long as all the files are present, regardless of any changes to the files themselves?
From:
'Sarah Romkey' via archivematica <archiv...@googlegroups.com>
Date: Friday, July 25, 2025 at 8:26 AM
To: archiv...@googlegroups.com <archiv...@googlegroups.com>
Subject: Re: [archivematica] Fixity checking compressed AIPs
--
You received this message because you are subscribed to a topic in the Google Groups "archivematica" group.
To unsubscribe from this topic, visit
https://groups.google.com/d/topic/archivematica/stqielB2X_s/unsubscribe.
To unsubscribe from this group and all its topics, send an email to
archivematic...@googlegroups.com.
To view this discussion visit
https://groups.google.com/d/msgid/archivematica/CAAr2QtspK6Wh56%3DpYdHM7ff5b%2B2yDNEOTFxWcH%2Bu9nCQb08gCQ%40mail.gmail.com.
Hi Mariecris,
We’ve only had it in production since May but so far the Lambda costs are lower than $10 for fixity checking 5 TB of content. That said, the download and local workflow costs for edge cases like the big AIPs could be significant enough to impact budget.
Have a nice weekend,
Charlie Hosale
Digital Preservation Coordinator
MIT Libraries | Scholarly Communications & Collections Strategy
To view this discussion visit https://groups.google.com/d/msgid/archivematica/8f83ab21-495c-44c2-b177-9c04dd804cd1n%40googlegroups.com.