Hello,
AFAIK, French includes all English characters, you can just set the OCR
to French.
I'm not sure there is anything else you can do. Based on my experience
(I'm French myself), Tesseract always had some issues with accents.
Maybe a solution would be to improve Tesseract training, but it is quite
difficult and it is out of the scope of Paperwork.
Anyway, when Paperwork indexes a document, Paperwork strips all the
accents. When you search something, it also strips all the accents from
your query. So for searching, accents don't matter.
Regarding languages in Flatpak, when installing an application, Flatpak
looks at the system locale and install the corresponding files but
nothing more. I don't know of any method to add additional locales.
If you need support for more languages, I would suggest installing
Paperwork "the manual way":
- Install Libinsane (
https://doc.openpaper.work/libinsane/latest/libinsane/install.html )
- Install Paperwork (
https://gitlab.gnome.org/World/OpenPaperwork/paperwork/blob/master/doc/install.debian.markdown#build-dependencies
)
- You can then install all the packages tesseract-ocr-XXXX you need on
your system
The drawback is that it's much harder to uninstall cleanly.
Best regards,
> --
> You received this message because you are subscribed to the Google
> Groups "Paperwork" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
paperwork-gu...@googlegroups.com.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/paperwork-gui/7546a828-6d43-4ebc-9187-1f06020b255f%40googlegroups.com
> [1].
>
>
> Links:
> ------
> [1]
>
https://groups.google.com/d/msgid/paperwork-gui/7546a828-6d43-4ebc-9187-1f06020b255f%40googlegroups.com?utm_medium=email&utm_source=footer