Chào mừng tất cả ! :)
Lá thư này được viết bởi Kevin Scannell, dịch giả
tiếng Xen-tơ Ai-len (một ngôn ngữ khác cũng có rất ít
tài nguyên).
Vấn đề là một văn lệnh Perl để thêm dấu phụ vào
các chữ thô (ASCII). Văn lệnh này chạy trên Mac OSX và
Linux, chưa được thử trên Windows (thử nhé ;) ); nó yêu
cầu hai gói charlifter-0.04.tar.gz và charlifter-vi-0.01.tar.gz
(tải xuống liên kết này:
Ví dụ sử dụng:
> $ echo "My tu bo ke hoach la chan ten lua" | sf.pl -r vi
> Mỹ từ bỏ kế hoạch lá chắn tên lửa
Cũng có thể đáp ứng trực tiếp lá thư:
Kevin Scannell <ksca...@gmail.com>
BFN
Clytie
On 22/09/2009, at 3:11 AM, Kevin Scannell wrote:
> Dear Clytie,
> Haven't seen you about in l10n circles for a while, hoping that you
> are in good health.
> Forwarding a note about some software I've written related to
> Vietnamese, details below. Please forward to any contacts in the
> Vietnamese FLOSS world!
> All the best
> Kevin
> ---------- Forwarded message ----------
> From: Kevin Scannell <ksca...@gmail.com>
> Date: Mon, Sep 21, 2009 at 12:38 PM
> Subject: Re: Tagger for under-resourced languages
> To: lequang...@yahoo.com
> Dear Van Le,
> We corresponded earlier this year about language technology for
> Vietnamese. I thought you might be interested in some software I
> wrote recently, called "charlifter". It is a program that uses
> statistics to convert ASCII text into Unicode text with proper
> diacritical marks, etc. It can be trained for any language, and I
> just trained it for Vietnamese, using several million words
> crawled from the web.
> It works like this, for example:
> $ echo "My tu bo ke hoach la chan ten lua" | sf.pl -r vi
> Mỹ từ bỏ kế hoạch lá chắn tên lửa
> I'd imagine this could be useful if a user doesn't have a proper
> Vietnamese keyboard.
> It's a perl script and runs on Linux or Mac OS X. I haven't tried
> on a Windows platform. You'll need to install charlifter-0.04.tar.gz
> and charlifter-vi-0.01.tar.gz from here:
> https://sourceforge.net/projects/lingala/files/
> Read more here (the announcement when I released in the spring):
> http://www.nabble.com/Diacritic-restoration-%2B-new-spell-checking-pa...
> Enjoy. Please pass this note on to anyone who might be interested.
> Kevin