Hi all.
I wrote fast editor for the boxfiles. I wrote it using QT library, so
it can be built on most popular platforms.
I use it instead of TessBoxer.
Please, look at
http://code.google.com/p/owlboxer/
> Hi all.
> I wrote fast editor for the boxfiles. I wrote it using QT library, so
> it can be built on most popular platforms.
> I use it instead of TessBoxer.
> Please, look athttp://code.google.com/p/owlboxer/
If you download the QT sdk, then it includes an editor which can build
the owlboxer code.
My issues with it (please correct me if I'm doing something wrong):
1. Loading the box file, didn't load the image. I loaded them image
first, then loaded the box file.
2. Entering unicode characters could only be done through copy&paste
which is slower than just typing in the hex values.
1. Yes. Loading the box file, didn't load the image. You can load any
available image. And you can load image first.
2. You can edit current box just press any letter key on your
keyboard. For example, now I edit box file for the russian symbols. I
have the russian keyboard layout on my windows XP, and I just typing
russian letter for the current box. It's faster than typing hex
values. So, copy&paste is not the best way.
On 7 май, 21:13, "Rob H." <hksny...@gmail.com> wrote:
> If you download the QT sdk, then it includes an editor which can build
> the owlboxer code.
> My issues with it (please correct me if I'm doing something wrong):
> 1. Loading the box file, didn't load the image. I loaded them image
> first, then loaded the box file.
> 2. Entering unicode characters could only be done through copy&paste
> which is slower than just typing in the hex values.
1. Ergonomically speaking, If you load a box then the corresponding
image should be loaded... and vice versa.
I'm not aware of any reason that someone would want to load an image
without a box file... or vice versa.
Since Tesseract generates a box/txt file with the same name as the
image, your editor should try to load both the image+box file at the
same time by default.
If both files are not in the same directory (e.g. if you keep images
in one directory and box files in another), then display a file
browser window to have the user select the corresponding box or image.
2. The characters I want to use are not mapped to any known keyboard
layouts. So I can't type them directly.
The only option is to copy/paste which is more tedious than typing the
actual unicode hex value.
Maybe you could show both the character and hex value on your pop-up
and use the TAB key to switch into "hex mode" where the user would
type 4 hex values?
> 1. Ergonomically speaking, If you load a box then the corresponding
> image should be loaded... and vice versa.
> I'm not aware of any reason that someone would want to load an image
> without a box file... or vice versa.
> Since Tesseract generates a box/txt file with the same name as the
> image, your editor should try to load both the image+box file at the
> same time by default.
> If both files are not in the same directory (e.g. if you keep images
> in one directory and box files in another), then display a file
> browser window to have the user select the corresponding box or image.
> 2. The characters I want to use are not mapped to any known keyboard
> layouts. So I can't type them directly.
> The only option is to copy/paste which is more tedious than typing the
> actual unicode hex value.
> Maybe you could show both the character and hex value on your pop-up
> and use the TAB key to switch into "hex mode" where the user would
> type 4 hex values?
Tested for Kannada in XP.
EXcellent works for Indic languages which are complex scripts It is boon
for languages which have consonants plus dependent vowels just like Russian,
Hindi, Bangla etc.
Many thanks to Aleksey Sytchev for his good work.
-sriranga(76yrsold)