IR+ full-text searching?

13 views
Skip to first unread message

Paul Hoffman

unread,
Jul 27, 2011, 11:47:17 AM7/27/11
to irp...@googlegroups.com
Has any thought been given to implementing full-text search of published
files' contents?

Thanks,

Paul.

--
Paul Hoffman <pa...@flo.org>
Systems Librarian
Fenway Libraries Online
c/o Wentworth Institute of Technology
550 Huntington Ave.
Boston, MA 02115
(617) 445-2914
(617) 442-2384 (FLO main number)

Nate Sarr

unread,
Jul 27, 2011, 11:57:22 AM7/27/11
to irp...@googlegroups.com
Hi Paul,
 
   IR+ does it's best to do full text indexing of file contents of the following:
 
    .pdf, ppt, pptx, .doc, .docx, .xls, .xlsx, .rtf
 
   There are some caveats.  There is a file size limit to prevent memory problems on very large files above 6mb but this can be configured.   
   If the pdf is an image or password protected, the text cannot be extracted. 
 
   IR+ does it's best to pull the text out and index it - if it can't this, it does not stop publishing or upload.
 
Hope this helps
-Nate

--
You received this message because you are subscribed to the Google Groups "irplus" group.
To post to this group, send email to irp...@googlegroups.com.
To unsubscribe from this group, send email to irplus+un...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/irplus?hl=en.


Reply all
Reply to author
Forward
0 new messages