Hello,
We have a problem with one dataset. When I trying to publish the dataset I got the following error:
Failed to Publish Dataset – The dataset could not be published because one or more of the datafiles in the dataset could not be validated (physical file missing, checksum mismatch, etc.) Please contact support for further assistance.
I cannot delete the Draft, or publish it. If the admin deletes the draft, and attempts to update the files, this occurs again. I'm not sure what to do.
The storage service used is AWS S3. Is there any solution for this problem?
Dataverse version: 5.12.1
S3 cmd version: s3cmd-2.3.0-1.el7.noarch
Log error during the publish process:
[2022-11-16T22:17:11.830-0300] [Payara 5.2021.6] [INFO] [] [edu.harvard.iq.dataverse.util.FileUtil] [tid: _ThreadID=308 _ThreadName=__ejb-thread-pool1] [timeMillis: 1668647831830] [levelValue: 800] [[
Failed to open datafile id 4247 for reading]]
[2022-11-16T22:17:11.873-0300] [Payara 5.2021.6] [WARNING] [] [edu.harvard.iq.dataverse.DatasetServiceBean] [tid: _ThreadID=308 _ThreadName=__ejb-thread-pool1] [timeMillis: 1668647831873] [levelValue: 900] [[
CommandException caught when executing the asynchronous portion of the Dataset Publication Command.]]
[2022-11-16T22:17:33.674-0300] [Payara 5.2021.6] [WARNING] [] [edu.harvard.iq.dataverse.dataaccess.ImageThumbConverter] [tid: _ThreadID=88 _ThreadName=http-thread-pool::http-listener-1(6)] [timeMillis: 1668647853674] [levelValue: 900] [[
could not read image with ImageIO.read()]]
[2022-11-16T22:17:33.675-0300] [Payara 5.2021.6] [WARNING] [] [com.amazonaws.services.s3.internal.S3AbortableInputStream] [tid: _ThreadID=88 _ThreadName=http-thread-pool::http-listener-1(6)] [timeMillis: 1668647853675] [levelValue: 900] [[
Not all bytes were read from the S3ObjectInputStream, aborting HTTP connection. This is likely an error and may result in sub-optimal behavior. Request only the bytes you need via a ranged GET or drain the input stream after use.]]
[2022-11-16T22:17:34.859-0300] [Payara 5.2021.6] [WARNING] [] [edu.harvard.iq.dataverse.dataaccess.ImageThumbConverter] [tid: _ThreadID=88 _ThreadName=http-thread-pool::http-listener-1(6)] [timeMillis: 1668647854859] [levelValue: 900] [[
could not read image with ImageIO.read()]]
[2022-11-16T22:17:34.859-0300] [Payara 5.2021.6] [WARNING] [] [com.amazonaws.services.s3.internal.S3AbortableInputStream] [tid: _ThreadID=88 _ThreadName=http-thread-pool::http-listener-1(6)] [timeMillis: 1668647854859] [levelValue: 900] [[
Not all bytes were read from the S3ObjectInputStream, aborting HTTP connection. This is likely an error and may result in sub-optimal behavior. Request only the bytes you need via a ranged GET or drain the input stream after use.]]
I appreciate any help or suggestion.
Best Regards,