Issues with apostrophes and hypens

515 views
Skip to first unread message

t.jo...@gmail.com

unread,
Nov 1, 2017, 9:30:14 PM11/1/17
to AtoM Users
Hi, 

On a recent upload to AtoM, we noticed that some entries were missing large areas of text and generally followed the use of an apostrophe or sometimes a hyphen or colon. This didn't happen every time an apostrophe was used but this issue appeared under both 'Collections' and 'Items'. Has anyone else had this issue and is there a work around?

Looking forward to any help because inputting the data manual is quite time consuming!

T

GR Mulcaster

unread,
Nov 1, 2017, 11:36:43 PM11/1/17
to AtoM Users

What delimiter did you use in building your CSVs and did you encode in UTF 8?
Which spreadsheet did you use to build your CSVs?

Dan Gillean

unread,
Nov 2, 2017, 11:45:55 AM11/2/17
to ICA-AtoM Users
Hi T, 

I think that GR's questions are on the right track. AtoM expects CSV files to be UTF-8 encoded, and using unix-style line endings. If you've used a spreadsheet application like Microsoft Excel to prepare your data, it's possible these settings are not being preserved - Microsoft by default uses its own custom character encodings, for example, which can cause issues during import. See: 

If at all possible, I suggest you consider using LibreOffice Calc as your spreadsheet application for data prep - it's free/open source and available on Linux, Mac, and Windows. It gives you a lot more control over delimiter and encoding settings. 

Another thing to consider - even if you use Calc, but you are cutting and pasting from something like a Word document, it's possible to accidentally copy non-UTF-8 characters into your document, which can cause issues. For example, Microsoft loves its "smart quotes" - the curly apostrophes and quotations that are angled differently depending on whether they are opening or closing characters. There are ways to disable this in applications like Word, but this is just one example of things to watch out for! 

Cheers, 

Dan Gillean, MAS, MLIS
AtoM Program Manager
Artefactual Systems, Inc.
604-527-2056
@accesstomemory

--
You received this message because you are subscribed to the Google Groups "AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.
To post to this group, send email to ica-atom-users@googlegroups.com.
Visit this group at https://groups.google.com/group/ica-atom-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/ica-atom-users/aaa0cc81-04e5-4052-9934-63f7c70f9d45%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Nathanael

unread,
Nov 3, 2017, 6:15:11 AM11/3/17
to AtoM Users
I've found the text after apostrophe's goes missing when importing from CSV, so now I always check and replace any 'curly' apostrophes with straight ones. The issue doesn't seem to arise when typing straight into Excel, only when copying text in from Word or other places.

T Jones

unread,
Nov 7, 2017, 1:59:51 AM11/7/17
to ica-ato...@googlegroups.com
Hi, 

Thank you to everyone for your input and advice! We've learnt a lot and the problem seems to have been solved with all the workarounds.

T :)

On Fri, Nov 3, 2017 at 9:15 PM, Nathanael <nathana...@millsarchive.org> wrote:
I've found the text after apostrophe's goes missing when importing from CSV, so now I always check and replace any 'curly' apostrophes with straight ones. The issue doesn't seem to arise when typing straight into Excel, only when copying text in from Word or other places.

--
You received this message because you are subscribed to the Google Groups "AtoM Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ica-atom-users+unsubscribe@googlegroups.com.
To post to this group, send email to ica-atom-users@googlegroups.com.
Visit this group at https://groups.google.com/group/ica-atom-users.
Reply all
Reply to author
Forward
0 new messages