Re: list_available_fonts.

798 views
Skip to first unread message

Sriranga(80yrs)

unread,
Mar 21, 2014, 1:06:33 PM3/21/14
to tesser...@googlegroups.com
Hello members,
Thanks to Nick for his patches for text2image - under issues - now works fine for kannada project - for me. Only  cheluvi font out of kannada fonts installed could not find by the program. I have successfully generated box/tif files(upto 47 pages at a time) using text2 image without using box editor - since box file generated are 100% correct. Anyway I am happy with wonderful text2image program.

 Attached  cheluvi.tff  for testing and find solution for my problem - for which i shall be thankful to you.
I am using only in unbuntu 14.04(daily build) and downloaded tesseract-ocr  r-1054.from SVN. 

With regards,
sriranga(80)


On Fri, Mar 21, 2014 at 9:22 AM, Sriranga(80yrs) <withblessing....@gmail.com> wrote:
Dear Nick,

When I wanted to generate box/tif using Cheluvi fonts, error displayed - extract of terminal is reproduced below for your information. Where I made mistake?

dell-14-04@dell1404-OptiPlex-330:~/tesseract-ocr-read-only1054a$ text2image --list_available_fonts --fonts_dir= /usr/share/fonts/
  0: Abyssinica SIL
  1: Abyssinica SIL Bold
  2: Andale Mono
  3: Andale Mono Bold
  4: Arial
  5: Arial Black,
  6: Arial Black, Bold
  7: Arial Bold
  8: Arial Bold Italic
  9: Arial Italic
 10: Bitstream Charter
 11: Bitstream Charter Bold
 12: Bitstream Charter Bold Italic
 13: Bitstream Charter Italic
 14: Century Schoolbook L Bold
 15: Century Schoolbook L Bold Italic
 16: Century Schoolbook L Italic
 17: Century Schoolbook L Medium
 18: Cheluvi Bold
 19: Cheluvi Medium
 20: Comic Sans MS
 21: Comic Sans MS Bold
 22: Courier 10 Pitch
 23: Courier 10 Pitch Bold
 24: Courier 10 Pitch Bold Italic
 25: Courier 10 Pitch Italic
 26: Courier New
 27: Courier New Bold
 28: Courier New Bold Italic
 29: Courier New Italic
 30: DejaVu Sans
 31: DejaVu Sans Bold
 32: DejaVu Sans Mono
 33: DejaVu Sans Mono Bold
 34: DejaVu Serif
 35: DejaVu Serif Bold
 36: Dingbats
 37: Dingbats Bold
 38: Droid Arabic Naskh
 39: Droid Arabic Naskh Bold
 40: Droid Sans
 41: Droid Sans Armenian
 42: Droid Sans Armenian Bold
 43: Droid Sans Bold
 44: Droid Sans Ethiopic
 45: Droid Sans Ethiopic Bold
 46: Droid Sans Fallback
 47: Droid Sans Fallback Bold
 48: Droid Sans Georgian
 49: Droid Sans Georgian Bold
 50: Droid Sans Hebrew
 51: Droid Sans Hebrew Semi-Bold
 52: Droid Sans Japanese
 53: Droid Sans Japanese Bold
 54: Droid Sans Mono
 55: Droid Sans Mono Bold
 56: Droid Sans Thai
 57: Droid Sans Thai Bold
 58: Droid Serif
 59: Droid Serif Bold
 60: Droid Serif Bold Italic
 61: Droid Serif Italic
 62: FreeMono
 63: FreeMono Bold
 64: FreeMono Bold Italic
 65: FreeMono Italic
 66: FreeSans
 67: FreeSans Italic
 68: FreeSans Semi-Bold
 69: FreeSans Semi-Bold Italic
 70: FreeSerif
 71: FreeSerif Bold
 72: FreeSerif Bold Italic
 73: FreeSerif Italic
 74: Garuda
 75: Garuda Bold
 76: Garuda Bold Oblique
 77: Garuda Oblique
 78: Georgia
 79: Georgia Bold
 80: Georgia Bold Italic
 81: Georgia Italic
 82: Gubbi
 83: Gubbi Bold
 84: Impact Condensed
 85: KacstArt Bold
 86: KacstArt Medium
 87: KacstBook Bold
 88: KacstBook Medium
 89: KacstDecorative Bold
 90: KacstDecorative Medium
 91: KacstDigital Bold
 92: KacstDigital Medium
 93: KacstFarsi Bold
 94: KacstFarsi Medium
 95: KacstLetter Bold
 96: KacstLetter Medium
 97: KacstNaskh Bold
 98: KacstNaskh Medium
 99: KacstOffice Bold
100: KacstOffice Medium
101: KacstOne
102: KacstOne Bold
103: KacstPen Bold
104: KacstPen Medium
105: KacstPoster Bold
106: KacstPoster Medium
107: KacstQurn
108: KacstQurn Bold
109: KacstScreen Bold
110: KacstScreen Medium
111: KacstTitle Bold
112: KacstTitle Medium
113: KacstTitleL
114: KacstTitleL Bold
115: Kedage
116: Kedage Bold
117: Kedage Bold Italic
118: Kedage Italic
119: Khmer OS
120: Khmer OS Bold
121: Khmer OS System
122: Khmer OS System Bold
123: Kinnari Bold
124: Kinnari Bold Italic
125: Kinnari Bold Oblique
126: Kinnari Medium
127: Kinnari Medium Italic
128: Kinnari Medium Oblique
129: LKLUG
130: LKLUG Bold
131: Liberation Mono
132: Liberation Mono Bold
133: Liberation Mono Bold Italic
134: Liberation Mono Italic
135: Liberation Sans
136: Liberation Sans Bold
137: Liberation Sans Bold Italic
138: Liberation Sans Italic
139: Liberation Sans Narrow Bold Condensed
140: Liberation Sans Narrow Bold Italic Condensed
141: Liberation Sans Narrow Condensed
142: Liberation Sans Narrow Italic Condensed
143: Liberation Serif
144: Liberation Serif Bold
145: Liberation Serif Bold Italic
146: Liberation Serif Italic
147: Lohit Bengali
148: Lohit Bengali Bold
149: Lohit Devanagari
150: Lohit Devanagari Bold
151: Lohit Gujarati
152: Lohit Gujarati Bold
153: Lohit Kannada
154: Lohit Kannada Bold
155: Lohit Punjabi
156: Lohit Punjabi Bold
157: Lohit Tamil
158: Lohit Tamil Bold
159: Loma
160: Loma Bold
161: Loma Bold Oblique
162: Loma Oblique
163: Mallige
164: Mallige Bold
165: Mallige Bold Italic
166: Mallige Italic
167: Meera
168: Meera Bold
169: Mukti Narrow
170: Mukti Narrow
171: NanumBarunGothic
172: NanumBarunGothic Semi-Bold
173: NanumGothic
174: NanumGothic Semi-Bold
175: NanumMyeongjo
176: NanumMyeongjo Semi-Bold
177: Navilu
178: Navilu Bold
179: Nimbus Mono L
180: Nimbus Mono L Bold
181: Nimbus Mono L Bold Oblique
182: Nimbus Mono L Oblique
183: Nimbus Roman No9 L
184: Nimbus Roman No9 L Bold
185: Nimbus Roman No9 L Bold Italic
186: Nimbus Roman No9 L Italic
187: Nimbus Sans L
188: Nimbus Sans L Bold
189: Nimbus Sans L Bold Condensed
190: Nimbus Sans L Bold Italic
191: Nimbus Sans L Bold Italic Condensed
192: Nimbus Sans L Condensed
193: Nimbus Sans L Italic
194: Nimbus Sans L Italic Condensed
195: Norasi
196: Norasi Bold
197: Norasi Bold Italic
198: Norasi Bold Oblique
199: Norasi Italic
200: Norasi Oblique
201: OpenSymbol
202: OpenSymbol Bold
203: Padauk
204: Padauk Bold
205: Padauk Book,
206: Padauk Book, Bold
207: Phetsarath OT
208: Phetsarath OT Bold
209: Pothana2000
210: Pothana2000 Bold
211: Purisa
212: Purisa Bold
213: Purisa Bold Oblique
214: Purisa Oblique
215: Rachana
216: Rachana Bold
217: Rekha Bold
218: Rekha Medium
219: Saab
220: Saab Bold
221: Sawasdee
222: Sawasdee Bold
223: Sawasdee Bold Oblique
224: Sawasdee Oblique
225: Standard Symbols L
226: Standard Symbols L Bold
227: Symbol
228: Symbol Bold
229: TakaoPGothic
230: TakaoPGothic Bold
231: Tibetan Machine Uni
232: Tibetan Machine Uni Bold
233: Times New Roman,
234: Times New Roman, Bold
235: Times New Roman, Bold Italic
236: Times New Roman, Italic
237: Tlwg Typist Bold
238: Tlwg Typist Bold Oblique
239: Tlwg Typist Medium
240: Tlwg Typist Medium Oblique
241: Tlwg Typo Bold
242: Tlwg Typo Bold Oblique
243: Tlwg Typo Medium
244: Tlwg Typo Medium Oblique
245: TlwgMono Bold
246: TlwgMono Bold Oblique
247: TlwgMono Medium
248: TlwgMono Oblique
249: TlwgTypewriter Bold
250: TlwgTypewriter Bold Oblique
251: TlwgTypewriter Medium
252: TlwgTypewriter Oblique
253: Trebuchet MS
254: Trebuchet MS Bold
255: Trebuchet MS Bold Italic
256: Trebuchet MS Italic
257: Tunga Bold
258: Tunga Bold
259: URW Bookman L
260: URW Bookman L Bold
261: URW Bookman L Bold Italic
262: URW Bookman L Italic
263: URW Chancery L Bold Italic
264: URW Chancery L Medium Italic
265: URW Gothic L Book
266: URW Gothic L Book Oblique
267: URW Gothic L Semi-Bold
268: URW Gothic L Semi-Bold Oblique
269: URW Palladio L Bold
270: URW Palladio L Bold Italic
271: URW Palladio L Italic
272: URW Palladio L Medium
273: Ubuntu
274: Ubuntu Condensed,
275: Ubuntu Condensed, Bold
276: Ubuntu Italic
277: Ubuntu Light
278: Ubuntu Light Italic
279: Ubuntu Medium
280: Ubuntu Medium
281: Ubuntu Medium Italic
282: Ubuntu Medium Italic
283: Ubuntu Mono
284: Ubuntu Mono Bold
285: Ubuntu Mono Bold Italic
286: Ubuntu Mono Italic
287: Umpush
288: Umpush Bold
289: Umpush Bold Oblique
290: Umpush Light
291: Umpush Light Oblique
292: Umpush Oblique
293: Vemana2000
294: Vemana2000 Bold
295: Verdana
296: Verdana Bold
297: Verdana Bold Italic
298: Verdana Italic
299: Waree
300: Waree Bold
301: Waree Bold Oblique
302: Waree Oblique
303: Webdings
304: Webdings Bold
305: gargi Bold
306: gargi Medium
307: mry_KacstQurn
308: mry_KacstQurn Bold
309: ori1Uni Bold
310: ori1Uni Medium
dell-14-04@dell1404-OptiPlex-330:~$ cd tesseract-ocr-read-only1054a/working_files/
---------------------------------------------------------------------------------------------------------------------------------

dell-14-04@dell1404-OptiPlex-330:~/tesseract-ocr-read-only1054a/working_files$ ../training/text2image --text=training_text.txt --outputbase=kan.CheluviBold.exp0 --font='Cheluvi Bold' --fonts_dir=/usr/share/fonts/
Initializing fontconfig
Could not find font named Cheluvi Bold
FLAGS_find_fonts || FontUtils::IsAvailableFont(FLAGS_font.c_str()):Error:Assert failed:in file text2image.cpp, line 417
Segmentation fault (core dumped)
dell-14-04@dell1404-OptiPlex-330:~/tesseract-ocr-read-only1054a/working_files$
===================================================================
It could clearly be seen that "list_available_fonts"-  has listed font  'cheluvi Bold' also along with other fonts. I could not understand why program unable to  find font "Cheluvi Bold"? Where I made mistake? waiting for your valuable guidance.

How to add fonts(extra/missing, if any) to the "list_available_fonts"  - request for valuable solution
With Warmest Regards,
sriranga(80)




Cheluvi.ttf

Dovhani Foneworx

unread,
Aug 18, 2014, 10:51:03 AM8/18/14
to tesser...@googlegroups.com
Hi did you find the solution for this?, experiencing the same problem

fone...@foneworxtest.foneworx.co.za:~/DM/Tesseracting/TESTDIR/Test1$ text2image --text=training_text.txt --outputbase=eng.DejaVuSansMono-Bold.ttf.exp0 --font='DejaVuSansMono-Bold.ttf' --fonts_dir=/usr/share/fonts/truetype/dejavu/
Initializing fontconfig
Could not find font named DejaVuSansMono-Bold.ttf

FLAGS_find_fonts || FontUtils::IsAvailableFont(FLAGS_font.c_str()):Error:Assert failed:in file text2image.cpp, line 417
Segmentation fault (core dumped)


Thanks.

zdenko podobny

unread,
Aug 18, 2014, 5:03:33 PM8/18/14
to tesser...@googlegroups.com
Do you have font named 'DejaVuSansMono-Bold.ttf'?

Zdenko


--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-oc...@googlegroups.com.
To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/26c30daa-ded8-45c8-b199-3544bd46de54%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Dovhani Foneworx

unread,
Aug 19, 2014, 1:55:11 AM8/19/14
to tesser...@googlegroups.com
Hi Zdenko, I have this font, look at what I have on my system:


fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype$ cd dejavu/
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$ ls
DejaVuSans-Bold.ttf  DejaVuSansMono-Bold.ttf  DejaVuSansMono.ttf  DejaVuSans.ttf  DejaVuSerif-Bold.ttf  DejaVuSerif.ttf
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$



On Monday, August 18, 2014 11:03:33 PM UTC+2, zdenop wrote:
Do you have font named 'DejaVuSansMono-Bold.ttf'?

Zdenko


On Mon, Aug 18, 2014 at 4:51 PM, Dovhani Foneworx <dfon...@gmail.com> wrote:
Hi did you find the solution for this?, experiencing the same problem

foneworx@foneworxtest.foneworx.co.za:~/DM/Tesseracting/TESTDIR/Test1$ text2image --text=training_text.txt --outputbase=eng.DejaVuSansMono-Bold.ttf.exp0 --font='DejaVuSansMono-Bold.ttf' --fonts_dir=/usr/share/fonts/truetype/dejavu/

zdenko podobny

unread,
Aug 19, 2014, 2:31:22 AM8/19/14
to tesser...@googlegroups.com
Please read my question one again. I did not asked you about filenames but about font names.

Zdenko


Dovhani Foneworx

unread,
Aug 19, 2014, 3:13:10 AM8/19/14
to tesser...@googlegroups.com
Good day Zdenko, maybe I am misunderstanding this part.

with this command:


 text2image --text=training_text.txt --outputbase=eng.FiraSans-Italic.exp0 --font='TimesNewRomanBold' --fonts_dir=/usr/share/fonts/truetype/FiraSans

and on this directory with this files:

fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/opentype/FiraSans$ ls
FiraSans-BoldItalic.otf   FiraSans-ExtraBoldItalic.otf   FiraSans-HairItalic.otf   FiraSans-Light.otf           FiraSans-ThinItalic.otf        FiraSans-UltraLight.otf
FiraSans-Bold.otf         FiraSans-ExtraBold.otf         FiraSans-Hair.otf         FiraSans-MediumItalic.otf    FiraSans-Thin.otf              FiraSans-Ultra.otf
FiraSans-BookItalic.otf   FiraSans-ExtraLightItalic.otf  FiraSans-HeavyItalic.otf  FiraSans-Medium.otf          FiraSans-TwoItalic.otf
FiraSans-Book.otf         FiraSans-ExtraLight.otf        FiraSans-Heavy.otf        FiraSans-Regular.otf         FiraSans-Two.otf
FiraSans-EightItalic.otf  FiraSans-FourItalic.otf        FiraSans-Italic.otf       FiraSans-SemiBoldItalic.otf  FiraSans-UltraItalic.otf
FiraSans-Eight.otf        FiraSans-Four.otf              FiraSans-LightItalic.otf  FiraSans-SemiBold.otf        FiraSans-UltraLightItalic.otf


is it not the file name referring to the font name? on the specified directory?

I am a bit confused here.

please help.


On Tuesday, August 19, 2014 8:31:22 AM UTC+2, zdenop wrote:
Please read my question one again. I did not asked you about filenames but about font names.

Zdenko


On Tue, Aug 19, 2014 at 7:55 AM, Dovhani Foneworx <dfon...@gmail.com> wrote:
Hi Zdenko, I have this font, look at what I have on my system:


foneworx@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype$ cd dejavu/
foneworx@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$ ls

DejaVuSans-Bold.ttf  DejaVuSansMono-Bold.ttf  DejaVuSansMono.ttf  DejaVuSans.ttf  DejaVuSerif-Bold.ttf  DejaVuSerif.ttf
foneworx@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$

Dovhani Foneworx

unread,
Aug 19, 2014, 3:40:34 AM8/19/14
to tesser...@googlegroups.com
Where do i get this font names?

from my font directory,  I have this;

fone...@foneworxtest.foneworx.co.za:/usr/share/fonts$ ls
cmap  opentype  truetype  type1
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts$ cd truetype/
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype$ ls
dejavu  FiraSans  unfonts-core

fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype$ cd dejavu/
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$ ls
DejaVuSans-Bold.ttf  DejaVuSansMono-Bold.ttf  DejaVuSansMono.ttf  DejaVuSans.ttf  DejaVuSerif-Bold.ttf  DejaVuSerif.ttf
fone...@foneworxtest.foneworx.co.za:/usr/share/fonts/truetype/dejavu$

Thank you.






--
You received this message because you are subscribed to a topic in the Google Groups "tesseract-ocr" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/tesseract-ocr/cfyZ6_HJtag/unsubscribe.
To unsubscribe from this group and all its topics, send an email to tesseract-oc...@googlegroups.com.

To post to this group, send email to tesser...@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.

zdenko podobny

unread,
Aug 19, 2014, 3:45:44 AM8/19/14
to tesser...@googlegroups.com
Please read the the thread, you replied... It is there.

Zdenko


zdenko podobny

unread,
Aug 19, 2014, 3:49:05 AM8/19/14
to tesser...@googlegroups.com
font name and font file are independet. E.g. font named "Abraca dabra" could have filename "123654987.ttf"

Zdenko


Shree Devi Kumar

unread,
Aug 19, 2014, 4:22:03 AM8/19/14
to tesser...@googlegroups.com

Maybe that will provide the info u need.

Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


Shree Devi Kumar

unread,
Aug 19, 2014, 4:22:48 AM8/19/14
to tesser...@googlegroups.com
or ...

text2image --list_available_fonts --fonts_dir= /usr/share/fonts/ 

Shree Devi Kumar
____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com


Reply all
Reply to author
Forward
0 new messages