Fwd: Bangla Unicode Converter

78 views
Skip to first unread message

Abu Mohammad Omar Shehab Uddin Ayub

unread,
May 2, 2008, 6:34:54 AM5/2/08
to CSE SOCIETY, bd...@googlegroups.com
FYI

---------- Forwarded message ----------
From: Russell Anam <russel...@gmail.com>
Date: Fri, May 2, 2008 at 3:56 PM
Subject: Bangla Unicode Converter
To: bangla...@googlegroups.com


 
Hello All,
 
It has been a long time since there has been any email to this group.
 
I am in the process of starting to make a Bangla Unicode Converter. Initially it will only encode Bijoy to Unicode Bangla. We are starting from scratch. Ofcourse we don't want to reinvent the wheel. So if anyone has a character conversion table (partial or full) ready, then I would appreciate it if that was sent to me. I already acquired one. It is pretty comprehensive but not complete. Please send me conversion table for any font/system not only Bijoy if available.
 
Ofcourse the basic question comes, why make yet another bangla converter when there is so many already in the market. None of the ones available provides near 100% conversion, not is any product available that is fast. Most use early binding to word com interface and hence for large document with lots of table and formatting, the conversion time may take more than 1 hour for a single document. My goal is to minimize this.
 
Hope to hear from you in this regard soon.
 
Thinking about setting up a blog so people can download and leave comments as it is developed.
 
-M. Ashraful Anam
Team Manager (IT)
SEPB Project
Election Commission Secretariat
 
 





--
|=============|
Regards,
Abu Mohammad Omar Shehab Uddin Ayub
Software Engineer, Nilavo Technologies, Banani, Dhaka
Bangladesh Open Source Network, Dhaka
2000 batch, Dept. of CSE, SUST
www.nilavo.com
www.bdosn.org
www.sust.edu

S. M. Mahbub Murshed

unread,
May 2, 2008, 8:27:52 AM5/2/08
to bd...@googlegroups.com, CSE SOCIETY

mokaddim

unread,
May 2, 2008, 9:12:03 AM5/2/08
to bd...@googlegroups.com, CSE SOCIETY
I wanned to tell the same thing, Use, http://bnwebtools.sourceforge.net
:D
--
Mokaddim AKM
http://talk.cmyweb.net/
All time available for Hire/Contract/Full Time

Jamil Ahmed

unread,
May 2, 2008, 9:51:14 AM5/2/08
to bd...@googlegroups.com
Sumon,

there is no package listed in the download area..

http://sourceforge.net/project/showfiles.php?group_id=177292

Best,
Jamil

S. M. Mahbub Murshed

unread,
May 2, 2008, 10:13:44 AM5/2/08
to bd...@googlegroups.com
Yup. The last ALPHA version had an issue. People were downloading and getting frustrated. I removed it. But forgot to upload a good copy after the fix. I will do it soon.
--
Mahbub

Abu Mohammad Omar Shehab Uddin Ayub

unread,
May 6, 2008, 4:55:40 AM5/6/08
to csesociety, SUST, bd...@googlegroups.com


Interesting survey and comments.

Shehab
----------------------------------------------------------------------------------------------------------------
 
Hello All,
 
I tried to convert two documents with Avro, Shabdik and our prototype converter. Below are the results:
 

Program

Document 1 (8 page)

Document 2 (30 page)

Avro

7min 12 sec

56 min

Shabdik

Failed (error attached)

Failed after 3 min 51 sec (error attached)

Our prototype

3 sec

17 sec

 
My question to Omi Azad: So you give me permission to use the table right?
My question to Omi Azad and Mr. M. Foyzur Rahman: Would you be kind enough to explain the intelligence you mentioned? or do you consider it your intellectual property? I would definitely like to know, I could not test if with the limited conversion table set if Shabdik is able to handle jofala rofala etc everything properly since all conversions failed.
 
P.S. about the statement "About the Hindi character, that is Prothom-alo's font, which is stupid hacked font based on multiple Unicode codepages.", the table itself is named "hindi" (screenshot attached).
 
Thanks
 
M. Ashraful Anam

On Sat, May 3, 2008 at 9:05 PM, M. Foyzur Rahman <foy...@iecbd.net> wrote:
It's now totally free.


On Sat, May 3, 2008 at 1:42 PM, M Ashraful Anam <russel...@gmail.com> wrote:
No, I know it's not open source but I was wondering it is is like Bijoy (commerially sold product) or a free product. I did go to the website. while I was able to download it without any issue, I also noticed the payment section on the left, hence my question.


 
On Sat, May 3, 2008 at 1:20 PM, Omi Azad <o...@ekushey.org> wrote:
I think you asked a wrong question, the question could be "Is Shabdik a Open Source product or a Closed Source product"

If this is a commercial product, you should have get notice or something. You won't be able to download it and use it. But before asking the question perhaps you could also visit the related website. :)
--
Omi
http://omi.net.bd

Bangla Computing Projects: http://ekushey.org
OSS News in Bangla: http://mukto.org


M Ashraful Anam wrote:
 
Thanks Mr. Foyzur Rahman,
 
I will test is with some word document with complex formating the next working day. I haven't used it yet.
 
Just a quick question, is shabdik a commercial product or a free product?
 
Thanks
 
M. Ashraful Anam


 
On Sat, May 3, 2008 at 9:55 AM, M. Foyzur Rahman <foy...@iecbd.net> wrote:
I would like to add  one thing here. The Shabdik converter is very fast and reliable and provides option for both plain text conversion and word document conversion. Also the converter is intelligent and extensible, providing options for adding new font conversion without ever modifying the code. If there is any bug, we will be happy to fix that.

Thanks and best regards,


On Sat, May 3, 2008 at 2:55 AM, Omi Azad <o...@ekushey.org> wrote:
We need only Ka's code. Cause Ka-Ecca, Ka-Occa has been solved through converter's intelligence.

--
Omi
http://omi.net.bd

Bangla Computing Projects: http://ekushey.org
OSS News in Bangla: http://mukto.org


M Ashraful Anam wrote:
 
No I got that part. I am just asking if 399 combinations per font is enough. Do we need to save only "ka"'s code or do we need to save for "ka - akar", "ka -ekar", "ka oukar" etc too?

On Sat, May 3, 2008 at 1:12 AM, Omi Azad <o...@ekushey.org> wrote:
I think you did not get the point of the converter DB. The DB is made for converting multiple hacked code charts (such as Bijoy 99, Bijoy 2003, Lekhani, Proshika etc.) into Unicode. About the Hindi character, that is Prothom-alo's font, which is stupid hacked font based on multiple Unicode codepages.

:)

--
Omi
http://omi.net.bd

Bangla Computing Projects: http://ekushey.org
OSS News in Bangla: http://mukto.org


M Ashraful Anam wrote:
 
Thanks! There seems 399 entries per font. Is that enough for 100% conversion? What about juktakkhor? I also saw that the converison table has a field named isreverse. How is that processed differently?
 
Also how come the DB contains 15990 hindi words??? And the Bangla table is named "zzz". Was the idea taken from an Indian software?
 
Thanks
 
M. Ashraful Anam


 
On Fri, May 2, 2008 at 8:49 PM, Omi Azad <o...@ekushey.org> wrote:
Please download Shabdik from here.

I made the conversation table there and that can be updated anytime if anyone wants to.
--
Omi
http://omi.net.bd

Bangla Computing Projects: http://ekushey.org
OSS News in Bangla: http://mukto.org


Anam wrote:


---------- Forwarded message ----------
From: "Russell Anam" <russell.a...@gmail.com>

Date: May 2, 3:56 pm
Subject: Bangla Unicode Converter
To: Use of Bangla in ICT : Prospects, Problems and Solutions















--
M. Foyzur Rahman
Shabdik Team @ IECB














--
M. Foyzur Rahman






errorInShabdik.gif
shabdik_tables.gif
Reply all
Reply to author
Forward
0 new messages