Hi Mark,
You'll be happy to learn that there are no changes in the ISAD(G) CSV import template between 2.3 and 2.4. The only changes in our templates between 2.3 and 2.4 are the following:
Authority records
- Add new CSV template for import/export of repository data
I have yet to add these to the
wiki - my apologies. Part of the delay is that we've discovered that a number of fields are not roundtripping in the new repository CSV import/export. We're currently working on a fix for this that will be included in the next version, and backported to our stable/2.4.x branch when completed. We won't be creating a new 2.4 tarball, but anyone upgrading or installing AtoM for the first time can follow option 2 in our installation instructions (
install from our git repository), and can always later easily pull in any other patches and fixes we merge back into the stable branch without a full upgrade.
In the meantime, users can always find our CSV templates included in the AtoM code (or via our GitHub page), at l
ib/task/import/example. You can also use the clipboard to export a repository record to see a sample of the repository CSV as it currently is, and/or add occupation access points to an authority record and export it, or manually add the 2 new columns to your template for import.
I'll try to get the new samples on our wiki soon.
Regarding OpenRefine, it is a GREAT tool for normalizing your data prior to a migration, and we use it internally at Artefactual whenever we can. The facets alone are a powerful way to look for names and terms with variant spellings etc that you can easily resolve and merge with a couple clicks. You can split data into different columns or concatenate data into one column, and much much more.
There are many great free resources out there worth checking out! Start on the OpenRefine homepage - you'll find a whole free book as well as some videos linked there:
The OpenRefine GitHub wiki also has a huge list of resources, including a curated list of external resources such as tutorials and the like, here:
There's also a public user forum, here:
If you do a web search you will find even more resources. Here's a simple introductory tutorial I found online, from a Canadian archivist (though it doesn't use archival examples at all):
If you are looking to perform a specific action (like concatenating 2 columns together), you can generally just do a web search for what you want, and usually find a pre-crafted example of how to construct the query. Good luck!