PDF extracted texts disappearing

19 views
Skip to first unread message

apbw

unread,
Aug 10, 2022, 9:55:03 PM8/10/22
to DSpace Community
My filter-media job runs nightly (cron job owned by the user dspace), reporting success, but the PDF extracted text files that it claims to have created cannot be found the next morning. 

When I re-run the command manually for some of these handles, it will create the extracted text files, and I can view the bitstreams in the item record immediately, and the contents show up in search results immediately. 

Any ideas about what could be happening to the ones it thinks it created overnight? 

Thanks!

~Amy



Mark H. Wood

unread,
Aug 11, 2022, 8:33:47 AM8/11/22
to dspace-c...@googlegroups.com
I would check the ownership and permission masks of the assetstore
directories. Though I would expect a permission problem to cause an
error message. Does the cron job leave any clues in the DSpace log?

--
Mark H. Wood
Lead Technology Analyst

University Library
Indiana University - Purdue University Indianapolis
755 W. Michigan Street
Indianapolis, IN 46202
317-274-0749
www.ulib.iupui.edu
signature.asc
Reply all
Reply to author
Forward
0 new messages