Hello everyone,
At UMD, we are looking at the Fedora 4 to Fedora 5 migration using the fcrepo-import-export tool.
Here are some of my finding after initial investigation:
1. Import into fcrepo5 mostly fails with 409 and 412 errors
After exporting a small dataset from fcrepo4, I removed all acl:accessControl, ldp:Resource, ldp:Container, ldp:contains triples from the exported ttl files, but the import failed with 409/412 errors. After enabling DEBUG logs on the repository, there were stacktraces with "predicate may not take an object in the server managed namespaces (
http://www.w3.org/ns/ldp#)" even though none of the ttl files had "ldp:Container". See attached file for full stacktace.
2. Binary files are imported but files are 0 bytes.
3. Import/export operations should support retry and resume options.
When I tried exporting a larger container (rdfType: n-triples), the export failed due to a spurious error after about 4 hours. A retry option with a backoff timer could help prevent spurious errors from leading to failure. Also, in our dev instance, the export rate seemed to be about 25K items/hour. At that rate, it would take about a week to finish the export of our current production data. So, we would need to have a way to track progress to be able to resume the operation in case of failure.
More information:
- I tried export / import using both 0.3.0 and 0.4.0-SNAPSHOT versions.
- I enabled "relaxed" mode on the repository before attempting imports.
- I tried fcrepo4 export containing a just couple of simple no-relationship items was successfully imported into fcrepo5.
- The fcrepo4 sample dataset export that was failing to import to fcrepo5 did successfully import to a fcrepo4 instance, but it took 2 passes (dependent items were imported in the secondd pass).
Has anyone encountered similar errors during imports into fcrepo5? Also, If anyone else has experimented with either import-export or other options to migrate from 4-to-5, please share your thoughts.
Thanks
Mohamed Abdul RasheedSoftware Systems Development & Research
B0112 McKeldin Library
University of Maryland, College Park
College Park 20742