search in pdf files

19 views
Skip to first unread message

Christian Bischof

unread,
Aug 20, 2025, 5:59:34 AMAug 20
to Dataverse Users Community
Hello,

as far as I understand, the search only works in the metadata (from datasets and ingested files) and not in uploaded pdf files. Is there a way to make pdf’s searchable, is there an option in solrconfig.xml? Or is the only way to use an external search services https://guides.dataverse.org/en/6.7/developers/search-services.html

Best
Christian

James Myers

unread,
Aug 20, 2025, 6:59:04 AMAug 20
to dataverse...@googlegroups.com

The Dataverse software has a setting to enable full text indexing (of published, non-restricted, non-embargoed files) in solr – see https://guides.dataverse.org/en/6.7/installation/config.html#solrfulltextindexing. There is also a setting to limit the size of files that are indexed. As full text indexing is resource intensive, larger installations may not want to turn this on.

 

-- Jim

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/dataverse-community/1adc6a1a-ca33-402c-9eff-6eab8a77d996n%40googlegroups.com.

Christian Bischof

unread,
Aug 22, 2025, 7:21:00 AMAug 22
to Dataverse Users Community
thx,  a quite easy solution. Carefully reading the documentation is usually helpful...
Best
Reply all
Reply to author
Forward
0 new messages