Truncated EAD XML export

35 views
Skip to first unread message

Jenny Mitcham

unread,
Jun 30, 2014, 12:13:55 PM6/30/14
to ica-ato...@googlegroups.com
Hi,

We are in the process of setting up and testing AtoM 2.0.1.

I've just tried importing one of our catalogues as EAD and it worked fine. Now I am having a go at exporting it back out again as EAD and am encountering some quirks. 

When I use Chrome to carry out the export it saves the file as .htm with html start and end tags (though the DOCTYPE is EAD). If I just change the <html> tags to <xml> I'm hoping that will just solve the problem?

When I use Internet Explorer to carry out the same export, it does save it as xml but it truncates the end of the file - so the resulting file doesn't hold the whole catalogue. It is a big catalogue so I wondered if there was an export limit in place?

I note the documentation on the wiki says that there may be problems in certain browsers, but it should work fine in Chrome and IE.

Cheers,
Jen

--
Jenny Mitcham
Digital Archivist
Borthwick Institute for Archives
University of York
Heslington
York
YO10 5DD

Telephone: 01904 321170

Borthwick Institute website: http://www.york.ac.uk/borthwick/
Digital archiving blog: http://digital-archiving.blogspot.co.uk/
Tools for Research Data Management blog: http://uoy-rdmproject.blogspot.co.uk/
Twitter: @Jenny_Mitcham

Dan Gillean

unread,
Jun 30, 2014, 5:43:23 PM6/30/14
to ica-ato...@googlegroups.com
Hi Jen,

What versions of each browser are you using? How big are the EAD files you are trying to export?

I did some testing on our development branch this morning, using IE (11.0.9) and Chrome (35.0.1916.153), and in both cases, I succeeded in exporting a full (non-truncated) EAD file, and saving it as XML. The file was not giant, but large enough for testing (5 series, a decent number of lower level files and items - you can see a copy of it in our ArchivesCanada Beta demo site here: http://archivescanada.accesstomemory.org/george-gale-fonds)  - when saved it was about 210KB.

A couple things of note:

1) There is a known issue with roundtripping EAD files in AtoM right now, where the <!DOCTYPE> and <ead> element namespace declarations are causing roundtripping (e.g. export, then re-import) issues. There is a simple workaround to this, as well as more backstory, on the related issue ticket: https://projects.artefactual.com/issues/6064

Please note especially the comments at note #11 on the ticket for the roundtripping workaround.

2) Performing import and export through the user interface will mean that everything must run through the browser. The browser generally has execution timeout limits in place, so if you were trying to export a very large fonds/collection, it's possible that the EAD is truncated due to a browser timeout during the request.

We do have the ability to export via the command-line as well - you might try this for a very large description. I'm still working on our documentation review and rewrite for 2.0, and haven't yet had the chance to properly document this functionality (coming soon!) but some minimal instructions are available here: https://www.qubit-toolkit.org/wiki/XML_import/export#Bulk_Export_via_Command_Line_Interface_.28CLI.29


Regards,

Dan Gillean, MAS, MLIS
AtoM Product Manager / Systems Analyst,
Artefactual Systems, Inc.
604-527-2056
@accesstomemory


--
You received this message because you are subscribed to the Google Groups "ICA-AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-user...@googlegroups.com.
To post to this group, send email to ica-ato...@googlegroups.com.
Visit this group at http://groups.google.com/group/ica-atom-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/ica-atom-users/CAF8JE%3Do1BLx0bd8tAOzQ4w%3DcV3Z86m_nJD_ZBMM9rbj-o3PV3A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages