[Dspace-tech] unicode error with importer

1 view
Skip to first unread message

Jose Blanco

unread,
Aug 24, 2015, 12:36:29 PM8/24/15
to dspac...@lists.sourceforge.net

I am trying to import some records into our 1.2 installation and I’m getting the following error:

 

org.xml.sax.SAXParseException: An invalid XML character (Unicode: 0x1a8bab) was found in the element content of the document.

        at org.apache.xerces.framework.XMLParser.reportError(XMLParser.java:1060)

        at org.apache.xerces.framework.XMLDocumentScanner.reportFatalXMLError(XMLDocumentScanner.java:644)

        at org.apache.xerces.framework.XMLDocumentScanner$ContentDispatcher.dispatch(XMLDocumentScanner.java:1356)

        at org.apache.xerces.framework.XMLDocumentScanner.parseSome(XMLDocumentScanner.java:381)

        at org.apache.xerces.framework.XMLParser.parse(XMLParser.java:952)

        at org.apache.xerces.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:123)

        at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:151)

        at org.dspace.app.itemimport.ItemImport.loadXML(ItemImport.java:838)

        at org.dspace.app.itemimport.ItemImport.loadDublinCore(ItemImport.java:559)

        at org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:452)

        at org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:334)

        at org.dspace.app.itemimport.ItemImport.main(ItemImport.java:282)

org.xml.sax.SAXParseException: An invalid XML character (Unicode: 0x1a8bab) was found in the element content of the document.

 

I think the cause of this is an o with an umlaut over it.  I searched the mailing list archive and found an email where it says to put encoding=”iso-8859-1” in the header and I did this, but I’m still getting the error.  Here is my dublin_core.xml file:

 

<dublin_core encoding="iso-8859-1">

<dcvalue element="identifier" qualifier="none">http://www.hti.umich.edu/cgi/t/te

xt/text-idx?c=busadwp;idno=B2036022.0001.001</dcvalue>

<dcvalue element="publisher" qualifier="none">University of Michigan. Business S

chool.</dcvalue>

<dcvalue element="format" qualifier="none">sgml</dcvalue>

<dcvalue element="rights" qualifier="none">These pages may be freely searched an

d displayed.  Permission must be received for subsequent distribution in print o

r electronically.  Please go to http://www.umdl.umich.edu/ for more information.

</dcvalue>

<dcvalue element="title" qualifier="none">International portfolio investment : t

heory, evidence, and institutional framework / Söhnke M. Bartram, Gunter Dufey.<

/dcvalue>

<dcvalue element="type" qualifier="none">text</dcvalue>

<dcvalue element="date" qualifier="none">2001.</dcvalue>

<dcvalue element="description" qualifier="none">University of Michigan. Business

 School. Faculty Research.</dcvalue>

<dcvalue element="description" qualifier="none">"February 2001. Revised."</dcval

ue>

<dcvalue element="description" qualifier="none">Includes bibliographical referen

ces (p. 75-104)</dcvalue>

<dcvalue element="description" qualifier="none">Also available online.</dcvalue>

<dcvalue element="language" qualifier="none">ENG</dcvalue>

<dcvalue element="creator" qualifier="none">Bartram, Söhnke M.</dcvalue>

<dcvalue element="creator" qualifier="none">Dufey, Gunter.</dcvalue>

<dcvalue element="description" qualifier="none">University of Michigan Business

Administration Working Papers; Working paper (University of Michigan. Business S

chool. Faculty Research) ;  -- no. 01-006.; </dcvalue>

</dublin_core>

 

Any help will be greatly appreciated.

 

Thanks!

Jose

Reply all
Reply to author
Forward
0 new messages