Before anyone asks, it's part of the CIA's Crest Dataset. I noticed tesseract seems to skip over some text. The command that I am using is
E:\Tesseract\build\bin\Release\tesseract.exe --psm 1 --oem 1 "D:\split\Folder 001\1946-06-21.tiff" test.txt
The output is
21 June 1946
MEMORANDUM For SUPERVISING AGENT,
U. S. SECRET SERVICE,
WHITE Hous®.
1. - It is requested that a White House pass be issued to
Lieutenant General Hoyt S. VANDENBERG, Director of Central Intel-
ligence.
2. - In connection with his official duties, it is necessary
for General Vandenberg to visit the White House frequently,.
3% His physical description is:
Height =-- 6 feet.
Hair «-- _ @FAY ,
Eyes -- _- blue.
Enclosed herewith is his photograph.
THOMAS F, CULLEN
Captain, USNR
Asgistant to the Director.
if you notice, it skips over the "weight -- 165 lbs" line. I wasn't sure if this qualified as a bug. Is there anything that I can do to improve the results so that line is included?