Fulltext search

34 views
Skip to first unread message

bdo...@univ.haifa.ac.il

unread,
Jan 23, 2020, 9:17:50 AM1/23/20
to dspac...@googlegroups.com
Hello,

I followed the instruction in this post: http://dspace.2283337.n4.nabble.com/Full-Text-Search-issue-td4689285.html but still I can't perform a search on my fulltext pdf files, most of them are in Hebrew.

Please advise,
Regards,
Boaz


_____________________________________
Sent from http://dspace.2283337.n4.nabble.com

Tim Donohue

unread,
Jan 27, 2020, 5:02:34 PM1/27/20
to bdo...@univ.haifa.ac.il, dspac...@googlegroups.com
We unfortunately don't have enough information to help you. 

Have you checked your log files to see if any errors occurred during the indexing? What version of DSpace are you using?  Additionally, please be aware that PDFs must already have the text embedded, so image based PDFs are not possible to index in DSpace at this time.

You can also run the "filter-media" command in verbose mode (-v) to see exactly what text it is able to extract.  This can be useful to debug whether it is able to get any text from a specific PDF document.  Again, see the documentation at https://wiki.lyrasis.org/display/DSDOC6x/Mediafilters+for+Transforming+DSpace+Content#MediafiltersforTransformingDSpaceContent-Executing(viaCommandLine)

Tim

From: dspac...@googlegroups.com <dspac...@googlegroups.com> on behalf of bdo...@univ.haifa.ac.il <bdo...@univ.haifa.ac.il>
Sent: Thursday, January 23, 2020 3:53 AM
To: dspac...@googlegroups.com <dspac...@googlegroups.com>
Subject: [dspace-tech] Fulltext search
 
--
All messages to this mailing list should adhere to the DuraSpace Code of Conduct: https://duraspace.org/about/policies/code-of-conduct/
---
You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dspace-tech...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-tech/733009456.157349.1579773219706.JavaMail.administrator%40n4.nabble.com.
Reply all
Reply to author
Forward
0 new messages