Hi,
I'm focus on generate a searchable pdf file in Right to Left language (e.g. Hebrew and Arabic)
I'm working with python on ubuntu and windows.
while I'm using tesseract or pytesseract I'm getting the results that are in the wrong orientation. (Left to right instead RTL)
should i add any language type or something else ? there is a another way to extract text in Alto xml or hocr and after that combine with the jpg file and create a searchable pdf file?
looking forward your advice,
thanks in advance,
Elishai