Hi everyone. I'm making a program on that uses tesseract to get a word from a manga with a snipping-tool like program, and translates that word with JMdict.
The thing is tesseract gives weird values for vertical, small selections. I'm going to explain it in more detail:
Say I get a full horizontal line in Japanese, like the following one:

The output "元来日本語は漢文に倣い、文字を上" is perfect
Getting a full vertical line gives no problems either:

Gives the same correct output. Now if I want to get only words, when examining horizontal text there are no problems, while with the vertical text the output is almost always (except when examining a Kanji alone) wrong, like this:


The first one returns 日本語 while the second one returns 髑升田.
They are both from the same file, same size, same font, yet the results vary greatly-
Another example, this time from a manga:

The output is 今日の勝敗よりも, again, correct.
But going word by word we start to have errors:

Output 由」〉
and

Output 健雛
Why is it that it can examine the full line without problem, but have so much trouble getting vertical words? I am using psm 8 for words, but it only seems to work with horizontal ones, and I can't get my head around it. I've been trying to find a solution to this all day, but without success. I'm not an expert programmer by any means, this is more of a college project, but any insight would be really, really appreciated. Thank you for reading.