Thepast 2 months I've been having problems again with searching PDF content in attachments on Outlook.
These problems have come and gone in the past year, I managed to fix them at the time but alas, it's broken again.
In control panel --> software you clearly see the adobe ifilter (newest version) installed but it's never recognized in the indexing menu. It just doesn't work. I've tried reinstalling it and reinstalling Adobe, I've tried a different W10 PC, etc....
To JoeP: I have used Copernic Desktop Search for several years, including before I went to the full Acrobat and was just using Reader. I think that Copernic has its own PDF interpreter as I never installed a PDF ifilter.
If you just google ifilter you will find that different versions are available. For the current Acrobat 9 there are two: one for 64-bit, the other for 32-bit. I think you will find that the right filter is available for your version of Acrobat as well.
If the indexer/search engine for MS has similar problems, it could be related to the version of pdf files on your system. Try looking at the file version and see if the hits and misses fall into any clear groupings.
I apologize if I have missed it, but in all of those posts I am still not clear on whether you have Adobe Acrobat but not the latest, or Adobe Reader. We do know that you have Windows 7. Whether or not you have Acrobat, you can still get the latest Reader, which is version 9.
Adobe PDF IFilter 6.0. Once indexing was complete, Search was unable to find the same document. I then uninstalled the Adobe IFilter and reinstalled the Foxit IFilter and confirmed that once again I could search successfully after the index was rebuilt.
I've got my solr indexes working correctly, and I'm able to search text from pages (content items) and I'm able to locate pdf's by extention in my queries, but I'm not able to crawl the pdf's and return the text from within the pdf's.
I admit I'm very new to solr/Sitecore, and this is probably easy to do, and it's just escaping me. I would like to try sitecore's built in filters before branching out to tika or pdfbox or something else.
So it would seem that is configured correctly for crawling pdf files. I read on the Sitecore doc site that you also need adobe pdf ifilter v9 (as v11 has issues) so I have installed v9 and set the path in my environment variables & rebooted.
The questions I have are:
1) How do I know the ifilter is doing anything / configured properly?
2) How do I activate it? (rebuild indexes? restart solr / Sitecore? Which i've done, multiple times)
3) What else I need to do to be able to perform the above query?
3a8082e126