Web Images Videos Maps News Shopping Gmail more »
Recently Visited Groups | Help | Sign in
Google Groups Home
Detecting simple phone number
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  10 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Flipz  
View profile  
 More options Jul 4, 5:52 am
From: Flipz <svet...@icepique.com>
Date: Sat, 4 Jul 2009 02:52:47 -0700 (PDT)
Local: Sat, Jul 4 2009 5:52 am
Subject: Detecting simple phone number
Hi,

I installed tesseract to do some simple OCR on a very basic image - a
phone number in Arial 11pt in a png format (http://www.autohop.bg/OCR/
phone.png) which I convert to a uncompressed tiff using:

convert -monochrome -normalize ./phone.png ./phone.tif

When I run it through tesseract I get an empty file from the following
command:

/usr/bin/tesseract ./phone.tif phone

I tried training the program but I cant get the whole logic around the
box files. I only need to detect the numbers in the png. Do I need to
additionally train tesseract for that or it is built in the current
eng support? I looked very simple from the tesseract documentation but
apparently I am missing out on something because I get an empty
phone.txt file.

Any help will be appreciated.

Thanks,

CJ


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Yury Tarasievich  
View profile  
 More options Jul 4, 6:10 am
From: Yury Tarasievich <yury.tarasiev...@gmail.com>
Date: Sat, 04 Jul 2009 13:10:23 +0300
Local: Sat, Jul 4 2009 6:10 am
Subject: Re: Detecting simple phone number
For tesseract to produce something, you'll need
an image with a resolution on the order of 100-s
of dpi. I find that 300 dpi works
satisfactorily, but 200 dpi seems also okay. And
you have something like 72 dpi image there.

Flipz wrote:
> Hi,

> I installed tesseract to do some simple OCR on a very basic image - a
> phone number in Arial 11pt in a png format (http://www.autohop.bg/OCR/
> phone.png) which I convert to a uncompressed tiff using:

...

    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Flipz  
View profile  
 More options Jul 4, 6:12 am
From: Flipz <svet...@icepique.com>
Date: Sat, 4 Jul 2009 03:12:16 -0700 (PDT)
Local: Sat, Jul 4 2009 6:12 am
Subject: Re: Detecting simple phone number
Do you know a way to upscale this png to 300dpi through a Linux
command (maybe in convert?). As far as I know web images are mostly
72dpi and this means I won't be able to use tesseract for any of them,
correct?

If this image is at 300dpi, do you think it is managable for tesseract
to extract the numbers?

On Jul 4, 1:10 pm, Yury Tarasievich <yury.tarasiev...@gmail.com>
wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Yury Tarasievich  
View profile  
 More options Jul 4, 6:47 am
From: Yury Tarasievich <yury.tarasiev...@gmail.com>
Date: Sat, 04 Jul 2009 13:47:14 +0300
Local: Sat, Jul 4 2009 6:47 am
Subject: Re: Detecting simple phone number

Flipz wrote:
> Do you know a way to upscale this png to 300dpi through a Linux
> command (maybe in convert?). As far as I know web images are mostly
> 72dpi and this means I won't be able to use tesseract for any of them,
> correct?

> If this image is at 300dpi, do you think it is managable for tesseract
> to extract the numbers?

I don't really know, I haven't yet try to do the
full OCR cycle with an 4x scaled image. I speak
only from my limited experience with scans and
digital photos processing in tesseract. E.g., I
had some 150 dpi photos of book pages, and I
scaled them 2x in Gimp with standard settings,
and those tesseract processed fairly well.


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
WHEAT  
View profile  
 More options Jul 4, 11:40 pm
From: WHEAT <netwh...@163.com>
Date: Sun, 5 Jul 2009 11:40:04 +0800 (CST)
Local: Sat, Jul 4 2009 11:40 pm
Subject: Re:Detecting simple phone number

hi,
We're Sky Studio, profressional OCR develop team, we have check the
http://www.autohop.bg/OCR/phone.png , no problem at all that we can
make a OCR for this simple phone number pictures. with >=95% success
rate, within 1 days ! We will only charge 100usd for this OCR development, and with 1 year free tech support. you can choose DLL or
 commandline program. we will provide detail samples on how to use the dll or commandline program , in c#, vb, delphi .

Looking forward to your reply .

Best Regards
Richard
Sky Studio Inc.
http://www.yahoobot.net/
7/5/2009

在2009-07-04,Flipz <svet...@icepique.com> 写道:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Yury Tarasievich  
View profile  
 More options Jul 5, 3:27 am
From: Yury Tarasievich <yury.tarasiev...@gmail.com>
Date: Sun, 05 Jul 2009 10:27:46 +0300
Local: Sun, Jul 5 2009 3:27 am
Subject: Re: Detecting simple phone number

Flipz wrote:
> Do you know a way to upscale this png to 300dpi through a Linux
> command (maybe in convert?). As far as I know web images are mostly
> 72dpi and this means I won't be able to use tesseract for any of them,
> correct?

> If this image is at 300dpi, do you think it is managable for tesseract
> to extract the numbers?

Can't see why not. And upscaling may indeed be
done with convert or, possibly, Gimp. I can't
point out the best algorithm for the upscaling,
however (you know, biliniear, bicubic etc.)

--


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Flipz  
View profile  
 More options Jul 5, 3:37 am
From: Flipz <svet...@icepique.com>
Date: Sun, 5 Jul 2009 00:37:24 -0700 (PDT)
Local: Sun, Jul 5 2009 3:37 am
Subject: Re: Detecting simple phone number
I did it with convert, it has a very nice set of upscaling algorithms
and I managed to pull it off to 300dpi which gives 98%+ accuracy
rate..

Thanks for all your help, especially Yury's for pointing me to the
direction of upscaling the image, I was trying to create a custom
tesseract box for something already built in the eng distribution of
the software :)

On 5 Юли, 10:27, Yury Tarasievich <yury.tarasiev...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
zhi  
View profile  
 More options Jul 8, 11:44 am
From: zhi <simonz...@gmail.com>
Date: Wed, 8 Jul 2009 08:44:46 -0700 (PDT)
Local: Wed, Jul 8 2009 11:44 am
Subject: Re: Detecting simple phone number
Hi Flipz,

I run into the same problem as yours. Can you pleas tell me how to
increase the dpi of an image in linux with convert and how to check
the dpi of an image with command?

Thanks

On Jul 5, 3:37 am, Flipz <svet...@icepique.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Flipz  
View profile  
 More options Jul 8, 11:59 am
From: Flipz <svet...@icepique.com>
Date: Wed, 8 Jul 2009 08:59:08 -0700 (PDT)
Local: Wed, Jul 8 2009 11:59 am
Subject: Re: Detecting simple phone number
Here are the commands I use:

$phone_image     = "test.png";
$phone_image_1 = "test_upscale.png";
$phone_image_2 = "test_upscale.tif";
$phone_extract   = "test_phone";

exec("/usr/bin/convert -filter Lanczos -resample 300x300 -enhance ".
$phone_image." ".$phone_image_1);
exec("/usr/bin/convert -monochrome -normalize ".$phone_image_1." ".
$phone_image_2);
exec("/usr/bin/tesseract ".$phone_image_2." ".$phone_extract);

On 8 Юли, 18:44, zhi <simonz...@gmail.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
zhi  
View profile  
 More options Jul 8, 12:09 pm
From: zhi <simonz...@gmail.com>
Date: Wed, 8 Jul 2009 09:09:16 -0700 (PDT)
Local: Wed, Jul 8 2009 12:09 pm
Subject: Re: Detecting simple phone number
Flipz, thanks so much. and Is there anyway to check the dpi of an
image with command?

On Jul 8, 11:59 am, Flipz <svet...@icepique.com> wrote:


    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google