Handling non-UTF characters in ingest form

10 views
Skip to first unread message

Kelsey

unread,
Nov 30, 2016, 10:54:13 AM11/30/16
to islandora
Hi everyone,
I have a use case that involves uploading a large number of mathematical papers that include lots of special, non xml friendly characters. We will have people uploading, generally copy-pasting from the papers into the ingest forms. In cases where there are special characters in the abstract, this causes issues as the form doesn't seem to sanitize anything but does block the ingest process until the character is removed. Right now my workaround is to have them paste the text into word/notepad++ first to check for the characters, then add to the form. Would be curious to see if anyone has a better process or workaround, or if perhaps I've got things configured wrong and this should not be an issue after all. Thanks for your thoughts!

Kelsey
Reply all
Reply to author
Forward
0 new messages