I know that Tesseract v4 CLI is able to produce the output as PDF with txt layer. The question is whether this functionality is also available via its API?
If so, the other question is whether Tess4J will expose that API to Java, too (I know that this is a separate product, but maybe someone is familiar with both products, otherwise I will go to Tess4J form to ask if such API is planned to be exposed).