After Upgrade from DSpace 7.6.0 to 9.1, Search Results Return Fewer Documents (Full-Text Indexing Issue)

60 views
Skip to first unread message

Erivelto Henrique

unread,
Aug 14, 2025, 10:13:48 AMAug 14
to DSpace Technical Support

Hello DSpace Community,

I recently upgraded our DSpace instance from version 7.6.0 to 9.1. Before the upgrade, searching for a specific term would return 105 documents. After the upgrade, the same search only returns 79 documents. Our repository contains a total of 41,356 documents, and it is essential that 100% of the document content is indexed and searchable.

To ensure full-text extraction and indexing, I have set the following parameters in our local.cfg:

textextractor.max-chars = -1 
textextractor.use-temp-file = true

After the upgrade, I performed a full re-extraction and reindexing using:

./dspace filter-media -f 
./dspace index-discovery -b

No errors appear in the Solr logs during indexing, and the process completes successfully. However, the number of search results is still significantly lower than before the upgrade.

Has anyone experienced similar issues with full-text indexing or search result discrepancies after upgrading to DSpace 9.x? Are there any additional steps or configuration changes required to ensure all document content is indexed and searchable in the new version?

Any advice or suggestions would be greatly appreciated!

Thank you,

Erivelto Alves

DSpace Technical Support

unread,
Aug 27, 2025, 12:00:18 PM (11 days ago) Aug 27
to DSpace Technical Support
Hi Erivelto,

I'm unaware of any significant changes to the indexing processes that have occurred between 7.6.x and 9.x.  The overall behavior should be the same.  My immediately "guess" is that something is throwing an error **during the indexing process**, or the indexing process is somehow failing to complete.  I'd recommend looking closely at your dspace.log file (on the backend) when the "./dspace index-discovery -b" command is run.  If errors are being logged during that process, it can impact what is able to be indexed.

Tim

Reply all
Reply to author
Forward
0 new messages