Dear Support team,
We are running dspace 6.3 and have noticed that the filter-media function that generates the .TXT files has some corrupted characters. Please check the attached file for a sample of the generated text. I was looking for an encoding configuration that can be applied at the level of filter-media function, however, I applied the encoding at the level of java used to run filter-media command using -Dfile-encoding param as shown below. But this doesn't solve the problem.
[dspace@quspace ~]$ echo $JAVA_OPTS
-Xms2048m -Xmx2048m -Dfile.encoding=UTF-8
Please note that the attached file was generated from an Adobe PDF bitstream.
Any advise?