Exporting/Importing ead to version 2.4

105 views
Skip to first unread message

hannah...@googlemail.com

unread,
Jul 24, 2018, 7:46:36 AM7/24/18
to AtoM Users
Hi

I'm just testing importing and exporting using version 2.4 in Vagrant. I exported some ead files and then purged the database using the command line. I have then tried to import the same ead files using the user interface. These jobs have mainly come up with the error messages "Exception: field name is null or empty". Is there something basic that I've missed? 

With thanks, H

Dan Gillean

unread,
Jul 30, 2018, 10:51:52 AM7/30/18
to ICA-AtoM Users
Hi Hannah, 

Roundtripping EAD XML files in AtoM should work. I've been away so apologies for the delay, but I'll do some local testing to see if I can reproduce this issue and get back to you.

Regards, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

--
You received this message because you are subscribed to the Google Groups "AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.
To post to this group, send email to ica-atom-users@googlegroups.com.
Visit this group at https://groups.google.com/group/ica-atom-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/ica-atom-users/77b04aa0-04de-41e2-9c6a-967ea3da06c6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

hannah...@googlemail.com

unread,
Jul 31, 2018, 8:20:35 AM7/31/18
to AtoM Users
Hi Dan

Thank you for looking into this for me. I've just tried again just in case and it's the same problem. Here's a copy of the log:


[info] [2018-07-31 05:11:34] Job 1616 "arFileImportJob": Job started.
[info] [2018-07-31 05:11:34] Job 1616 "arFileImportJob": Importing XML file: papers-of-the-clifton-suspension-bridge-trust July 2018.xml.
[info] [2018-07-31 05:11:34] Job 1616 "arFileImportJob": Indexing imported records.
[info] [2018-07-31 05:11:34] Job 1616 "arFileImportJob": Update type: import-as-new
[info] [2018-07-31 05:11:35] Job 1616 "arFileImportJob": Exception: field name is null or empty

With many thanks, 

Hannah










On Monday, July 30, 2018 at 3:51:52 PM UTC+1, Dan Gillean wrote:
Hi Hannah, 

Roundtripping EAD XML files in AtoM should work. I've been away so apologies for the delay, but I'll do some local testing to see if I can reproduce this issue and get back to you.

Regards, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

On Tue, Jul 24, 2018 at 7:46 AM, hannahmlittle via AtoM Users <ica-ato...@googlegroups.com> wrote:
Hi

I'm just testing importing and exporting using version 2.4 in Vagrant. I exported some ead files and then purged the database using the command line. I have then tried to import the same ead files using the user interface. These jobs have mainly come up with the error messages "Exception: field name is null or empty". Is there something basic that I've missed? 

With thanks, H

--
You received this message because you are subscribed to the Google Groups "AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-user...@googlegroups.com.
To post to this group, send email to ica-ato...@googlegroups.com.

Dan Gillean

unread,
Jul 31, 2018, 5:30:48 PM7/31/18
to ICA-AtoM Users
Hi Hannah, 

I'm going to have to follow up with our developers to see if they have ideas. I haven't seen that particular error message before. In the meantime, would you be willing to send me one of your test EAD XML files off-list? I've just now successfully exported some EAD XML from the clipboard, unzipped the download, and then imported them back in via the User interface options, all in my 2.4.x Vagrant box. I'd like to take a look at one of the files if possible to see if I can recreate the issue that way. You can reach me using dan [at] artefactual [dot] com. 

Thanks! 



Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.
To post to this group, send email to ica-atom-users@googlegroups.com.

hannah...@googlemail.com

unread,
Aug 2, 2018, 5:04:11 AM8/2/18
to AtoM Users
Hi Dan

Just to let you know that I've emailed the file. 

Kind regards, Hannah

Dan Gillean

unread,
Aug 2, 2018, 6:16:54 PM8/2/18
to ICA-AtoM Users
Hi Hannah, 

I received your file, and got a different error message when I first tried to import it - my error message said: "Unable to resolve host: undefined."

So, I opened your EAD file, and tried searching for "undefined" and sure enough - I found that listed as the base URL every time your site is reference, whether in the header, or when pointing to a digital object. An example: 



Have you set the Base URL setting in your AtoM instance? This setting is found in Admin > Settings > Site information, and there should have been a big orange banner reminding you to do so when you first installed AtoM. See: 

The base URL is used to create absolute URLs included in XML exports (e.g. MODS and EAD exports). For example, your AtoM site is made up a series of web pages. Each page has a full Uniform Resource Locator (URL) something like http://www.your-atom-site.com/your-description. The Base URL is the part of this URL that does not change - in this example, http://www.your-atom-site.com.

Setting this value will ensure that links included in your XML exports will be properly formed. Do not include a slash / at the end of your base URL - AtoM will automatically add this when building the absolute URLs.

I would try doing a find/replace in your EAD XML document to add any valid URL to this path - e.g. find "undefined" and replace it with "www.example.com" and then try your import again. 




One thing to note about roundtripping like this: the EAD XML file includes file path links to digital objects you'd uploaded in your original description... but if you've purged your data, then you've also purged these objects, so they will not be automatically pulled in and linked your your description. Instead, AtoM will go look for the object at the given URL, find it missing or inaccessible, note it in the log, and continue the import. 

The CSV export will also provide a URL as the default path for a digital object. However, with the CSV, you can also optionally add a digitalObjectPath column. This column can point to a local file path on your AtoM server, and can upload your digital objects from there. Since it looks like your digital object file names are unique (based on the related reference code), this should be fairly simple to do. 

This means you could create temporary subdirectory below the root AtoM directory and put a copy of all your digital objects in it, and then use this to re-link them once you re-import. For example, the default installation directory for AtoM (if you followed our recommended instructions) is generally /usr/share/nginx/atom. You could add a subdirectory here called "images", and then add your images there - e.g. A065600-16-001_141.jpg. You would then reference the full file path in your import CSV, in the digitalObjectPath column - e.g. /usr/share/nginx/atom/images/A065600-16-001_141.jpg

So, if you wanted to roundtrip with CSV instead of EAD, you could: 
  • Export your descriptive hierarchy as CSV
  • Add a copy of all your digital objects into a local folder on your AtoM server - "images"
  • Open the file in a spreadsheet application
  • Find the digitalObjectURI column
  • Change the header from digitalObjectURI to digitalObjectPath
  • update each value in this column to include a file path instead of a URL
  • Save the CSV and reimport
For example, this upload in your EAD file: 

http://[BASE URL HERE]/uploads/r/clifton-suspension-bridge-archives/9/2/d/92df8a8fb4bb9ca1c404ee0e1065d9bde47a52a93435c03f39c9bad8373946d2/A065600-16-001_141.jpg

Would become: 

/usr/share/nginx/atom/images/ A065600-16-001_141.jpg


Anyway, let's see if we can get roundtripped imports working for you. Let me know if: 
  1. doing a find/replace in your EAD file to add a proper URL value (even if it is example.com) will allow you to import the file, and
  2. if adding or updating your base URL will allow you to export and re-import an EAD file properly without manual edits

Cheers, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.
To post to this group, send email to ica-atom-users@googlegroups.com.

hannah...@googlemail.com

unread,
Aug 8, 2018, 11:04:16 AM8/8/18
to AtoM Users
Hi Dan 

Thanks for this. I replaced all the 'undefined' in the xml file with www.example.com and I'm really sorry but now I'm getting a 500 internal server error.  

With thanks,

Hannah

Dan Gillean

unread,
Aug 8, 2018, 12:07:18 PM8/8/18
to ICA-AtoM Users
Hi Hannah, 

Are you able to access and share the 500 error message? See the following for information on where to look: 

Hopefully with the full error message, I can get some input from our team on next steps. 

Cheers, 


Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.

To post to this group, send email to ica-ato...@googlegroups.com.
Visit this group at https://groups.google.com/group/ica-atom-users.
Reply all
Reply to author
Forward
0 new messages