PDF extract/thumbnail problems with Mac OS X created PDFs

406 views
Skip to first unread message

ddbtest...@gmail.com

unread,
Feb 23, 2015, 10:14:51 AM2/23/15
to resour...@googlegroups.com

I can upload PDFs created by Adobe apps into RS without issues.  The thumbnail gets correctly created and the document is successfully text indexed.

However, from PDFs created by OS X (10.10 in this case, and most likely earlier versions), the generic PDF icon appears with no preview, and text indexing does not work.

From my error logs:

  **** Warning:  File has an invalid xref entry:  8.  Rebuilding xref table.

   **** This file had errors that were repaired or ignored.
   **** The file was produced by: 
   **** >>>> Mac OS X 10.10.2 Quartz PDFContext <<<<
   **** Please notify the author of the software that produced this
   **** file that it does not conform to Adobe's published PDF
   **** specification.

However, when I attempt the convert on the command line with this syntax, it works in my test directory (although the above error is still displayed, so perhaps this is just a warning):

convert -colorspace rgb -density 400 test2.pdf -resize 25% test2.pdf

I have this in my config:

$imagemagick_path = '/usr/bin';
$ghostscript_path = '/usr/bin';
$ffmpeg_path = '/usr/bin';
$exiftool_path = '/usr/bin';
$antiword_path = '/usr/bin';
$pdftotext_path = '/usr/bin';
$unoconv_path="/usr/bin"; 
$unoconv_extensions=array("doc","docx","odt","odp","html","rtf","txt","ppt","pptx","sxw","sdw","html","psw","rtf","sdw","pdb","bib","txt","ltx","sdd","sda","odg","sdc"); 
$alt_types=array("","pdf");

Ghostscript and ImageMagick claim to be up to date according to yum.
Server is running CentOS 6 and installation check passes successfully.


Does anyone have any ideas how I can troubleshoot this further, or obtain a more verbose conversion log?


Many thanks in advance.

Jeff Nova

unread,
Feb 23, 2015, 1:54:58 PM2/23/15
to resour...@googlegroups.com
Please send me a sample trouble file, and we'll have a look. 

Thanks!
Jeff

--
Jeff Nova
Chief Executive Officer
Colorhythm LLC

Main Office:  +1 415-399-9921
Mobile:  +1 510-710-9590

--
ResourceSpace: Open Source Digital Asset Management
http://www.resourcespace.org
---
You received this message because you are subscribed to the Google Groups "ResourceSpace" group.
To unsubscribe from this group and stop receiving emails from it, send an email to resourcespac...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

ddbtest...@gmail.com

unread,
Feb 24, 2015, 5:25:03 AM2/24/15
to resour...@googlegroups.com
Thanks Jeff, your help is much appreciated!

I've attached three PDFs - one which works for me (encoded by Indesign CC 2014), and another two that don't (one encoded by OS X by "printing" to a PDF from MS Word, and the other from our multi function device scanner/copier/printers).
I've also included a screen grab of how the unrecognised documents appear within my RS install.

Many thanks in advance for any tips or troubleshooting suggestions.

Some more from my apache error log since uploading these test files:

   **** Page has no /MediaBox attribute. Using the current page size.
GPL Ghostscript 8.70: Unrecoverable error, exit code 1
   **** Warning:  File has an invalid xref entry:  8.  Rebuilding xref table.
   **** Page has no /MediaBox attribute. Using the current page size.
GPL Ghostscript 8.70: Unrecoverable error, exit code 1
test2-osx.pdf
test1-adobe.pdf
test3-mfd.pdf
in-my-rs.png

Jeff Nova

unread,
Feb 24, 2015, 5:02:52 PM2/24/15
to ResourceSpace
They work here perfectly.  Can you upgrade your Ghostscript?  We are using 9.10 and you are at 8.70.  I suspect once you upgrade, these will simply work.

Best,
Jeff

ddbtest...@gmail.com

unread,
Feb 25, 2015, 8:25:57 AM2/25/15
to resour...@googlegroups.com

Thanks Jeff, very useful info.  And I appreciate you taking the time to test this.

Annoyingly Ghostscript 9 doesn't seem available via the standard yum repositories on CentOS.
I tried a manual installation (building and compiling myself), but RS doesn't seem to then recognise it as a valid install..   ( as per:  http://superuser.com/questions/752951/how-to-upgrade-ghostscript-to-9-x-on-centos-6-5 )

Oh well, at least I have a direction to pursue now!   I'll try and figure out how to get the server correctly working with GS 9.   Thanks!

Søren Grønning

unread,
Feb 25, 2015, 8:48:27 AM2/25/15
to resour...@googlegroups.com

Did you try editing the path to your newly compiled Ghostscript install in config.php, in case your new path is different from the yum installed version from the official install?

That might be what's missing!

ddbtest...@gmail.com

unread,
Feb 25, 2015, 9:16:14 AM2/25/15
to resour...@googlegroups.com
Yes, that was it!  All working now.
You guys rock, thanks!   :)
Reply all
Reply to author
Forward
0 new messages