"Caused by: com.fasterxml.jackson.core.JsonParseException: Unexpected character (',' (code 44)): expected a value",
" at [Source: (byte[])\"{\"id\":\"W2143785713\",\"doi\":\"10.1093/pan/2.1.97\",\"doi_registration_agency\":\"Crossref\",\"display_name\":\"Traits versus Issues: Factor versus Ideal-Point Analysis of Candidate Thermometer Ratings\",\"title\":\"Traits versus Issues: Factor versus Ideal-Point Analysis of Candidate Thermometer Ratings\",\"publication_year\":1990,\"publication_date\":\"1990-01-01\",\"language\":\"en\",\"language_id\":\"
https://openalex.org/languages/en\",\"ids\":{\"openalex\":\"
https://openalex.org/W2143785713\",\"doi\":\"
https://doi.org/10.1093/pan\"[truncated 13991 bytes]; line: 1, column: 880]",
"at com.fasterxml.jackson.core.JsonParser._constructError(JsonParser.java:1840) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.base.ParserMinimalBase._reportError(ParserMinimalBase.java:712) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.base.ParserMinimalBase._reportUnexpectedChar(ParserMinimalBase.java:637) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.json.UTF8StreamJsonParser._handleUnexpectedValue(UTF8StreamJsonParser.java:2622) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.json.UTF8StreamJsonParser._nextTokenNotInObject(UTF8StreamJsonParser.java:857) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.json.UTF8StreamJsonParser.nextToken(UTF8StreamJsonParser.java:754) ~[jackson-core-2.10.4.jar:2.10.4]",
"at com.fasterxml.jackson.core.base.ParserMinimalBase.skipChildren(ParserMinimalBase.java:237) ~[jackson-core-2.10.4.jar:2.10.4]",
It is not of much help because the JSON is truncated and does not show the problem.
I'm getting the S3 objects by the AWS Java SDK version 2.30.16. It is not of great help that OpenAlex JSON data is published without checksums.
But I can not tell if the error is triggered by AWS SDK or my program.
I will try to to verify the JSON syntax directly after pulled by AWS SDK.and to make sure there is no Elasticsearch quirk (I never encountered one).
In the end, I have to skip the whole chunk with the error, in the hope there are not many more.
Thanks for your interest.