question about re-harvesting

25 views
Skip to first unread message

Jacek Chudzik

unread,
Oct 23, 2025, 9:19:36 AMOct 23
to Dataverse Users Community
Hi, 

we have a bunch of Dataverse installations which are harvested to one main instance.

Now, we are in need to enable access to this cumulated data. I thought, that /oai server will work well for that, but in guide there is information that: 
"Only the published datasets in your Dataverse installation can be made harvestable".
Also when creating query you "define a set of local datasets"

And yes, default "no name" OAI setSpec/Description will give me no results.

Does this mean that it is impossible to harvest data from Dataverse harvester by /oai?

Best regards,
Jacek

Leonid Andreev

unread,
Oct 28, 2025, 5:39:38 PM (11 days ago) Oct 28
to Dataverse Users Community
Hello, 
I just looked at the code and no, it won't allow you to create a set containing non-local datasets. (There's a hard-coded "IS_HARVESTED:false" that we insert into the Solr search query that's used to define an OAI set). 
But, thinking about it, I don't see a problem with making this possible. Please go ahead and open a GitHub issue, something like "Allow creating OAI sets with harvested datasets". 
I would keep the current behavior as the default (simply because I'm assuming this is what most Dataverse instances want; as I can't remember anyone else asking about or requesting this). But we should be able to add this as an optional feature.

All the best, 
-Leo

Jacek Chudzik

unread,
Oct 29, 2025, 2:47:47 AM (11 days ago) Oct 29
to Dataverse Users Community
Hi,

thank you very much for acknowledging that this is not possible at the moment and being open to change it.


Have a nice day,
Jacek
Reply all
Reply to author
Forward
0 new messages