Hi Tim,
Thank you for the details you provided. We have made progress on this issue.
I configured the lines "textextractor.max-chars
= -1", "textextractor.use-temp-file
= true" then restarted the tomcat.
On a machine with 8GB RAM and max heap size 6G, I ran the "filter-media -f" command. It ran for a while then failed with the output below:
----------------------------------------------------------------------------------
File: SundararajA.pdf.jpg
FILTERED: bitstream 84c9128e-34a7-42e5-a83d-64be008bb082 (item: 10292/14803) and created 'SundararajA.pdf.jpg'
File: ATEM Poster - Serena OP.pdf.txt
FILTERED: bitstream 76432201-ee2b-481c-8c18-2889c935b2df (item: 10292/4602) and created 'ATEM Poster - Serena OP.pdf.txt'
File: ATEM Poster - Serena OP.pdf.jpg
#
# There is insufficient memory for the Java Runtime Environment to continue.
# Native memory allocation (malloc) failed to allocate 837021948 bytes for AllocateHeap
# An error report file with more information is saved as:
-----------------------------------------------------------------------------------
I have attached the error report "hs_err_pid4031.log".
The storage spaces are as below:
--------------------------------------------------------
Filesystem Type Size Used Avail Use% Mounted on
/dev/mapper/vg_root-lv_root xfs 20G 6.9G 14G 35% /
/dev/sda1 xfs 507M 221M 287M 44% /boot
/dev/mapper/vg_root-lv_var xfs 8.0G 3.0G 5.1G 38% /var
/dev/sdc xfs 1.0T 454G 570G 45% /DISK2
/dev/mapper/vg_root-lv_tmp xfs 16G 1.1G 15G 7% /tmp
/dev/mapper/vg_root-lv_var_log xfs 4.0G 597M 3.5G 15% /var/log
---------------------------------------------------------
Any idea or suggestion would be much appreciated.
Regards,
Bryan