java.io.IOException: Invalid UTF-16 surrogate detected: d835

34 views
Skip to first unread message

Mengu Yazicioglu

unread,
Dec 6, 2022, 10:00:02 AM12/6/22
to DSpace Community

Hi all,

We're facing a invalid character error and also problems in OAI.

when we look at something like http:// ... /oai/request?verb=ListRecords&resumptionToken=oai_dc////17322,
we got t
HTTP Status 500 – Internal Server Error

Type Exception Report

Message com.lyncode.xoai.dataprovider.exceptions.WritingXmlException: Error trying to output ''

Description The server encountered an unexpected condition that prevented it from fulfilling the request.

Exception

java.io.IOException: com.lyncode.xoai.dataprovider.exceptions.WritingXmlException: Error trying to output ''

 for Dspace OAI-PMH Data Provider.

In the log files java.io.IOException: Invalid UTF-16 surrogate detected: d835 is written.

Strange things occur, when we correct the related wrong record, the following 100 records give the error.

Another thing OAI records cannot be created for a while, synced with these errors


Does anyone have an idea about this issue?

We're in Dspace 6.4 on Debian Linux

Thanks in advance

Mengü


Mengu Yazicioglu

unread,
Dec 16, 2022, 1:47:21 AM12/16/22
to DSpace Community
Hi all,

This situation is getting worse.
We've deleted 100 items that seem wrong in OAI, then error goes next 100 items which were reached correctly before.
Whenever we change or delete an item in the 100items boundary, it shifts following 100items.

I've deleted and import all oai data but nothing changed.

Does anyone any idea?

Thanks and best regards

Mengu Yazicioglu <mengu.ya...@gmail.com>, 6 Ara 2022 Sal, 18:00 tarihinde şunu yazdı:
--
All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
---
You received this message because you are subscribed to the Google Groups "DSpace Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dspace-communi...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-community/6bbbb6e1-2911-42a0-8a16-42c6c7ee07f8n%40googlegroups.com.

Tim Donohue

unread,
Dec 16, 2022, 11:14:11 AM12/16/22
to Mengu Yazicioglu, DSpace Community
Hi,

Have you tried clearing the OAI cache as well?  Try running "./dspace oai clean-cache", see docs at: https://wiki.lyrasis.org/display/DSDOC6x/OAI+2.0+Server#OAI2.0Server-IndexingOAIcontent

This error is really odd to me, as it almost sounds like DSpace's OAI indexing is hitting invalid data...  You might want to look closely at the DSpace logs when you re-import everything into OAI.  I wonder if there's some issue with your data and the OAI import is failing?

In searching Google for this "Invalid UTF-16 surrogate" error, I also noticed that there was a similar bug in older versions of the JDK (https://bugs.java.com/bugdatabase/view_bug.do?bug_id=JDK-8207760)  Maybe check what version of JDK you are running and make sure you are on the latest version of JDK 8?

Those are my best guesses right now based on limited information.  I suspect there may be more errors in your logs though, so I'd recommend looking more closely at your logs to see if any more clues can be found there.  See our troubleshooting guide for help: https://wiki.lyrasis.org/display/DSPACE/Troubleshoot+an+error

If you find more detailed errors, let us know on this list.

Tim

From: dspace-c...@googlegroups.com <dspace-c...@googlegroups.com> on behalf of Mengu Yazicioglu <mengu.ya...@gmail.com>
Sent: Friday, December 16, 2022 12:47 AM
To: DSpace Community <dspace-c...@googlegroups.com>
Subject: Re: [dspace-community] java.io.IOException: Invalid UTF-16 surrogate detected: d835
 
Reply all
Reply to author
Forward
0 new messages