Dear Dataverse Community,
so far we tested Dataverse in Göttingen, and within weeks we would
like to launch it as a more visible and usable service (still as a
beta service). During the preparation period we had lots of testers
who uploaded and sometimes published unwanted materials just to help
us to test the system. Some of the users in this period has real data
published along academic papers. We would like to clear the database,
and delete the test data, but keep the important ones.
The documentation suggests that we should drop the whole database and
recreate a fresh one, but apparently with this step we would loose the
important materials as well. Once a data were published, we could only
do the deaccession action, which - I guess keeps the data.
The third way (and that's why I've tried to understand the database
schema) would be manipulate Dataverse with SQL commands, delete the
unwanted files from the storage, and then reindex what is left. I hope
that we are not the first who try to do something like that.
Do you happen to have any script for this task?
Best,
Péter
--
Péter Király
software developer
GWDG, Göttingen - Europeana - eXtensible Catalog - The Code4Lib Journal
http://linkedin.com/in/peterkiraly