Dear All,
So far we've been uploading jpg images into our DSpace system and had
no problems with getting thumbnails for them later.
Unfortunately, recently after uploading a dozen of items with tiff
images (their size is between 4 and 15 Mb) couldn't get thumbnails for
them. Filter-media script returns error message. Here is the portion of
the log file, with some critical messages:
ERROR filtering, skipping bitstream #7542
java.io.FileNotFoundException: no such entry: "0Table"
java.io.FileNotFoundException: no such entry: "0Table"
at
org.apache.poi.poifs.filesystem.DirectoryNode.getEntry(DirectoryNode.java
:283)
at
org.textmining.text.extraction.WordExtractor.extractText(WordExtractor.java:60)
at
org.dspace.app.mediafilter.WordFilter.getDestinationStream(WordFilter.java:97)
at
org.dspace.app.mediafilter.MediaFilter.processBitstream
(MediaFilter.java:155)
at
org.dspace.app.mediafilter.MediaFilterManager.filterBitstream(MediaFilterManager.java:327)
at
org.dspace.app.mediafilter.MediaFilterManager.filterItem(MediaFilterManager.java:296)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersItem(MediaFilterManager.java:266)
at
org.dspace.app.mediafilter.MediaFilterManager.applyFiltersAllItems(MediaFilterManager.java:234)
at
org.dspace.app.mediafilter.MediaFilterManager.main(MediaFilterManager.java:185)
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at gnu.gcj.runtime.FinalizerThread.run(libgcj.so.70)
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at gnu.gcj.runtime.FinalizerThread.run
(libgcj.so.70)
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at gnu.gcj.runtime.FinalizerThread.run(libgcj.so.70)
java.lang.Throwable
: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at gnu.gcj.runtime.FinalizerThread.run(libgcj.so.70)
java.lang.Throwable: Warning: You did not close the PDF Document
at org.pdfbox.cos.COSDocument.finalize(COSDocument.java:384)
at gnu.gcj.runtime.FinalizerThread.run(libgcj.so.70)
FILTERED: bitstream 7682 and created
'articles_bridging_20000615.pdf.txt'
FILTERED: bitstream 7683 and created
'articles_sustainable_developement_20000815.pdf.txt'
GC Warning: Repeated allocation of very large block (appr. size
20230144):
May lead to memory leak and poor performance.
FILTERED: bitstream 7684 and created
'articles_venture_20001215.pdf.txt'
FILTERED: bitstream 7685 and created
'articles_rethinking_20010215.pdf.txt'
FILTERED: bitstream 7686 and created
'articles_relationship_20010515.pdf.txt'
FILTERED: bitstream 7687 and created
'articles_org_capacity_20021115.pdf.txt'
GC Warning: Out of Memory! Returning NIL!
Exception in thread "main" java.lang.OutOfMemoryError
<<No stacktrace available>>
Is there any limit of the file size filtering?
Any help is highly appreciated.
Best regards,
Branko Kovacevic
Records Coordinator
Open Society Archives
Arany Janos u. 32
1051
Budapest, Hungary
phone: (36-1) 327-3266 or 327-2029
e-mail:
kov...@ceu.hu website:
www.osa.ceu.hu++++++++++++++++++++++++++++