Hi,
I'm having some trouble with searches inside of PDF files.
For some reason, the search only seems to work when the PDF is small
(less than 15MB or 100 pages).
I've uploaded some small files, and after running "dspace filter-media",
I get results for searches inside of files with these sizes:
1.4MB and 10 pages
11MB and 57 pages
I uploaded somewhat larger files:
17MB and 101 pages
58MB and 218 pages
15MB and 137 pages
For these documents, the search does not find anything at all inside the
documents, even if I search for something in the first page.
Additionally, I've downloaded a large file from another DSpace from a
contact of mine. This file has 625MB and 188 pages. In his DSpace
(running dspace 7) the search works. In mine it does not.
When I run "dspace filter-media" I don't get any errors, and
thumbnails/miniatures are being generated for all documents.
Does anyone know what could be happening?
Thank you,
Roberto Greiner
Notes:
All files are PDF with OCR. My platform:
Ubuntu 24.04.1
Dspace 8.0
Installed based in
https://wiki.lyrasis.org/display/DSDOC8x/Installing+DSpace
Using Apache as proxy
--
-----------------------------------------------------
Marcos Roberto Greiner
Os otimistas acham que estamos no melhor dos mundos
Os pessimistas tem medo de que isto seja verdade
James Branch Cabell
-----------------------------------------------------