Hi Jan,
If the record is being indexed by Google already, then they should be aware of the PDF already, and there's not much DSpace can do to force Google to full text index the PDF. That said, it's worth noting there are two main types of PDFs, and only one of which
is easily indexed:
- PDFs created from digital files or OCRed images. These PDFs have embedded text and are more easily full text indexed.
- PDFs created from scanned files (without OCR). These are image-based PDFs with no embedded text, and they are often
not able to be full text indexed, unless the system which grabs the PDF is able to OCR it reliably in an automatic fashion.
So, if the PDFs you are talking about were created from scanned images, then make sure to OCR them so that they are easier to index.
If you have other questions let us know on this list.
Tim