Hi all,
We’re in the process of establishing our Archivematica processing workflow. One of the main files types we’ll be working with are the raw disk images (.img) that we create using our KryoFlux. Ideally we’d like to have Archivematica to extract the files from each disk image upon transfer since, at least in most cases, we're more interested in preserving the contents of the disk image rather than the disk image itself.
Our dilemma is that while you can configure Archivematica to identify .img files by their extension (this got addressed in Dorothy Waugh's query to the DC Google Group last year), currently none of Archivematica’s existing commands are able to perform file extraction for raw disk images--or from E01 disk images, for that matter.
Theoretically, we could solve this problem simply by changing our workflow so that we extracted the files from our disk images before ingesting them into Archivematica. While doing file extraction manually may not be such a huge deal in the short-term, obviously this would both slow down our workflow to some degree and introduce the opportunity for human error, which would be disadvantageous overall.
I'm interested to hear from other folks who are using Archivematica to process raw/e01 disk images: what does your current workflow look like? Do you extract the files before transferring them into Archivematica, or do you just process and preserve just the disk images themselves? If you do extract the files, at what point do you do this, and what tool do you use to do so?
I'd appreciate any thoughts you guys have to share about the above.
Thanks in advance!
Shira Peltzman
Digital Archivist, UCLA Library
Hi Shira,
I just wrote a paper on that very thing, published last month in Code4Lib journal:
http://journal.code4lib.org/articles/11986
You are correct that Archivematica isn’t well suited to extracting files from Kryoflux images. I’ve come up with some semi-automated processes using a variety of tools depending on which disk formats I’m working on. I describe those processes in a reasonable amount of detail in the paper. Not sure if any of them are relevant to your particular case, but I’d be happy to answer questions if you have any.
Best,
John
--
You received this message because you are subscribed to the Google Groups "Digital Curation" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
digital-curati...@googlegroups.com.
To post to this group, send email to
digital-...@googlegroups.com.
Visit this group at https://groups.google.com/group/digital-curation.
For more options, visit https://groups.google.com/d/optout.
To post to this group, send email to digital...@googlegroups.com.