PDF Text Extraction not automatically occurring on upload

67 views
Skip to first unread message

akshay ts

unread,
Jul 9, 2025, 7:05:50 PM7/9/25
to AtoM Users

Hello everyone,

I’m encountering an issue with the PDF text extraction in Access to Memory. When I upload a PDF, the system doesn’t automatically extract the text as I expected. As a result, when I search for a word that’s present in the PDF, the search doesn’t return the item containing the PDF with that word.

However, if I re-index the PDF using the command " php symfony digitalobject:extract-text " and re-index the search, the search returns the items containing the PDF with that word.

Has anyone else experienced this? Is there a setting or configuration that I'm possibly missing to ensure the text extraction happens automatically upon upload? Or could there be a process that's not running properly in the background?

Can anyone please provide some insights or guidance on this?

Thanks in advance.

Best Regards,
Akshay Karthik
The Australian National University.



Reply all
Reply to author
Forward
0 new messages