Mini not searching text of PDFs

3 views
Skip to first unread message

jfray

unread,
Dec 15, 2009, 11:24:13 AM12/15/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
My situation:
- Google mini is running on our intranet
- We have many PDFs, which google has crawled in the past and indexed.
- Mini is not indexing/presenting results for newer PDFs.
- These newer PDFs were created using Acrobat 9.
- When I check properties for these PDF, they show "pdf producer" as
Distiller 9, and the "application" as Pscrip5.dll, and PDF version 1.5
- The good PDFs show "application" as MS Word, "producer" as Acrobat
PDF Writer 4.05, and PDF Version 1.2.

Thanks.

jfray

unread,
Dec 15, 2009, 11:55:53 AM12/15/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
My Original Post:
Update:
- I went into mini diagnostics.
- The "missing" PDFs are being successfully crawled as new documents
and indexed by mini.
- The problem is that they are not appearing in search results.
- I've created compare/contrast versions of PDFs -- old vs.new -- and,
indeed, mini presents the older PDFs, but does not present the newer
versions.

Thanks.

jfray

unread,
Dec 18, 2009, 2:18:45 PM12/18/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Does anyone have experience with this? Insight?

I have hundreds -- dare I say, thousands -- of policies on our
intranet in PDF format. They are regularly updated. Those created with
early versions of Acrobat display fine. Those created within the last
couple months with Acrobat 9 do not display, despite being crawled and
indexed.

Help is desperately needed. I can provide samples of good and bad
documents, if need be.

brianb

unread,
Dec 21, 2009, 9:50:44 PM12/21/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
It might be a good idea to contact support and see if they can help
(if you have not already). You may also want to make sure you are on
the latest Mini version of 5.0.4.G.24 + VM Patch#1 as I believe the
PDF indexer was updated then too.

Brian

jfray

unread,
Dec 23, 2009, 9:33:58 AM12/23/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
I would contact Google support, except they won't let me. I'm out of
my two year window. Instead, as part of their Cash for Google business
model, I'm required to purcahse a new mini in order to enjoy full
Google support. Same for downloading upgrades.

Instead, I'm forced to grope around in these forums.

A nice customer server model, eh? Makes me want to run out and by a
Droid..... not.

> > > Thanks.- Hide quoted text -
>
> - Show quoted text -

MJQ

unread,
Jan 6, 2010, 11:52:46 AM1/6/10
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
I have a similar issue and have submitted it through support...I will
let you know what they say.

JMarkham

unread,
Jan 7, 2010, 11:53:19 AM1/7/10
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Hi,

Sorry for interrupting the conversation, but I'm confused by a few
things.

> I would contact Google support, except they won't let me. I'm out of
> my two year window. Instead, as part of their Cash for Google business
> model, I'm required to purcahse a new mini in order to enjoy full
> Google support. Same for downloading upgrades.

I understand your chagrine, being in the position to use something
that isn't actively supported. It's a messy place to find yourself.
I know, I've been there a time or three. But...

Am I reading you right, and you actually think Google should endlessly
give you things for free after your agreed-upon contract term was up?
I challenge you to find a company that supports things for free
endlessly. If you (or your company) wanted more than two years
support, perhaps a longer term contract should have been purchased, or
the contract renewed when it ended.

Now, if your mini is over two years old, it probably can no longer
handle the latest software updates. That piece of hardware only cost
$3K for a reason (which included very powerful search software AND two
years support). Purchase hardware for yourself, and you're likely out
at least a minimum of $5K (you might get away with $3K for a decent
server, if you can build it yourself). Don't forget, now you will
need a separate service contract for your hardware, unless you built
it yourself, and can support it yourself, which still costs money in
the form of your pay and parts purchases.

Heaven forbid if you want Microsoft server software on your new
hardware. Last I checked a for-profit Microsoft server license was in
excess of $10K (It was $24K at one time in my past). So, there's $15K
if you go with a Microsoft setup and purchase your own hardware, and
they still won't support you directly (aside from Q/A forums like this
one, of course, and the occasional KB publication) unless you ALSO
take out a support contract or negotiate a corporate licensing bundle
(those run in the millions of dollars, just for the record). Oh, but
WAIT! We still don't have any Search software yet. I think this
point is belabored enough... :)

> Instead, I'm forced to grope around in these forums.
>
> A nice customer server model, eh? Makes me want to run out and by a
> Droid..... not.

I can only speak for myself, of course, but I find Google's model to
be very economical and well thought out (as if you couldn't tell
already :D ). Why else would you deprecate hardware after two years
or so (and charge a pittance for the update compared to alternatives,
I feel obligated to point out), unless you want your ongoing customers
to keep up to date on both hardware and software, thereby doing your
best to ensure that your customers (and their customers) enjoy the
best experience from your product? Did I mention they do this for a
pittance compared to other alternatives? By the way, Google's charges
are a pittance compared to other alternatives, except....

Oh wait, there IS no decent alternative to Google Search out there
yet, and certainly nothing that you can purchase and use yourself, and
definitely nothing that also comes with hardware and support for that
hardware all in one package... Hmmm....

Ending thought is, it's infinitely more enjoyable to read suggested
changes to Google's model (and the rationale for those changes),
rather than complaints. That is all. :)

Thanks for listening (or not),

Jeff

P.S. No, I am not a "Fan Boi." I simply appreciate good business
when I see it.

Reply all
Reply to author
Forward
0 new messages