CafeTran upade

38 views
Skip to first unread message

Igor Kmitowski

unread,
Mar 17, 2012, 7:03:15 AM3/17/12
to cafetra...@googlegroups.com
Dear Cafetranslators,

I have added a bit of functionality to the program to deal with remembering tags. The issue is especially important when you translate the new version of a document which has some changes compared to the previous version. So far, reusing the memory has involved reinserting tags manually in exact matches. Now CafeTran can store tag positions in TMX memories (in ProjectTM by default) and the tags are inserted automatically provided that they are present in the source segment. The new option called Tags processing and can be switched on/off in the memory start-up panel.

The above function also calculates the tag positions when there are internal tags in TMX segments coming from other tools.

Besides, the Glossary function has some new utilities such as sorting and merging alternative targets.

The current CT version is 2012031301 and it is available for download. The CT users can contact me to get this update soon.

Kind regards,
Igor

--
Igor Kmitowski
Translator and Java developer
CafeTran website: http://www.cafetran.com
CafeTran support: cafetran...@gmail.com


adrian bedford

unread,
Mar 22, 2012, 3:44:33 AM3/22/12
to cafetra...@googlegroups.com
Hi Igor, is the update ready yet?
thanks
Adrian

Igor Kmitowski

unread,
Mar 22, 2012, 3:54:24 AM3/22/12
to cafetra...@googlegroups.com
Hi Adrian,

Expect it by the end of the week.

Cheers,
Igor

adrian bedford

unread,
Mar 22, 2012, 3:55:07 AM3/22/12
to cafetra...@googlegroups.com
thanks!

Wolfgang Schoene

unread,
Apr 4, 2012, 2:26:58 AM4/4/12
to cafetra...@googlegroups.com
Hi Igor
isn’t there an easier way to continue working on a large project than having to choose term.tmx/Project.tmx/external.tmx all over again every time I open the project?
Why can these databases not be linked to the project so that on opening the project file everything is there?
This is the case with most other CAT tools.
Would be very helpful and time saving. Importing my external tmx for segment takes about 15/20 minutes because it has to change languages, don’t ask me why.

Best
Wolfgang

Igor Kmitowski

unread,
Apr 4, 2012, 6:13:43 AM4/4/12
to cafetra...@googlegroups.com
Hi Wolfgang,

> Hi Igor
> isn¹t there an easier way to continue working on a large project than
> having
> to choose term.tmx/Project.tmx/external.tmx all over again every time I
> open
> the project?
> Why can these databases not be linked to the project so that on opening
> the
> project file everything is there?

They are linked if you tick the relevant boxes in the Project manager
window - see the Memories section in the upper part of the panel. Perhaps
I should add one more box for the "Term memory" there.

> This is the case with most other CAT tools.
> Would be very helpful and time saving. Importing my external tmx for
> segment
> takes about 15/20 minutes because it has to change languages, don¹t ask
> me
> why.

I know why :) Currently, the TMX language pairs are case sensitive (for
speed reasons). The internal language mapping is like the following:
English (United States) = en-US. The language code is lowercase and the
country code is uppercase. I suspect your imported TMX has all in
uppercase (like EN-US). CT cannot find such segments in the first pass, so
it detects the correct language pair code and makes the second pass. If
you type your TMX language codes in the language pair field, the program
will load the segments in the first pass, that is, faster.

Cheers,

Wolfgang Schoene

unread,
Apr 4, 2012, 7:43:59 AM4/4/12
to cafetra...@googlegroups.com
Thanks for your answer, Igor.
Yes, TMX import is probably due to the cause you mention.
Will try to get that right.

Regards
Wolfgang


Am 04/04/12 12:13 schrieb "Igor Kmitowski" unter
<cafetran...@gmail.com>:

Wolfgang Schoene

unread,
Apr 5, 2012, 5:29:40 AM4/5/12
to cafetra...@googlegroups.com
Hi Igor
does it make sense to create a Memory for Terms
AND
an External DB (menu Tables)?

Regards
Wolfgang

Am 04/04/12 12:13 schrieb "Igor Kmitowski" unter
<cafetran...@gmail.com>:


>

> They are linked if you tick the relevant boxes in the Project manager
> window - see the Memories section in the upper part of the panel. Perhaps
> I should add one more box for the "Term memory" there.

I'd second that.

>
>> This is the case with most other CAT tools.
>> Would be very helpful and time saving. Importing my external tmx for
>> segment
>> takes about 15/20 minutes because it has to change languages, don¹t ask
>> me
>> why.
>
> I know why :) Currently, the TMX language pairs are case sensitive (for
> speed reasons). The internal language mapping is like the following:
> English (United States) = en-US. The language code is lowercase and the
> country code is uppercase. I suspect your imported TMX has all in
> uppercase (like EN-US). CT cannot find such segments in the first pass, so
> it detects the correct language pair code and makes the second pass. If
> you type your TMX language codes in the language pair field, the program
> will load the segments in the first pass, that is, faster.

Yes, AAMOF, this was the case. So what is the standard internal language
mapping of CT?
Other question: When importing a TMX memory, why does it say: UTF-8 instead
of UTF-16? Aren't TMX files supposed to be UTF-16?


Wolfgang Schoene

unread,
Apr 5, 2012, 6:20:08 AM4/5/12
to cafetra...@googlegroups.com
Hi again Igor
is it safe for CT to install this http://support.apple.com/kb/DL1515
Java update for the Mac?

Wolfgang


Hans van den Broek

unread,
Apr 5, 2012, 6:28:03 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 16:29, Wolfgang Schoene wrote:

> does it make sense to create a Memory for Terms
> AND
> an External DB (menu Tables)?

Hi Wolfgang, I'm not Igor, and I readily admit I don't understand much about external databases, but so far, it seems I don't need them anyway. Correct me if I'm wrong, but I think those databases are useful if you work on a project with several translators, or if the database is on a server somewhere.


Cheers,

Hans

--

Hans van den Broek
Schrijf-, vertaal- en redigeerwerk
Peleman Rejowinangun KG1/513
RT029 RW009
Yogyakarta 55171
Indonesia
T +62 (0)85868607333
SKYPE: hanstranslations
transl...@hansvandenbroek.eu
http://cafetran4mac.blogspot.com

Hans van den Broek

unread,
Apr 5, 2012, 6:30:14 AM4/5/12
to cafetra...@googlegroups.com

Not only safe, you should do it instantly. There's a rumour of a virus for Mac that spreads via Java and Flash. The update will fix it, if there's a problem at all.

Igor Kmitowski

unread,
Apr 5, 2012, 6:58:04 AM4/5/12
to cafetra...@googlegroups.com
Hello Wolfgang,

> Hi Igor
> does it make sense to create a Memory for Terms
> AND
> an External DB (menu Tables)?

External DB is an option for users who prefer to keep their terminology in
SQL Databases for various reasons (Hans v.d. B. just mentioned a few).

>>
>> They are linked if you tick the relevant boxes in the Project manager
>> window - see the Memories section in the upper part of the panel.
>> Perhaps
>> I should add one more box for the "Term memory" there.
>
> I'd second that.

I take it as a RFE.

> Yes, AAMOF, this was the case. So what is the standard internal language
> mapping of CT?

The language code - lower case.
The country code - upper case.

e.g. en-US

> Other question: When importing a TMX memory, why does it say: UTF-8
> instead
> of UTF-16? Aren't TMX files supposed to be UTF-16?

I don't think UTF-16 is a standard for TMX. Other tools experts, please
correct me if I am wrong. But I am sure that UTF-8 is sufficient in most
cases and takes less RAM.

Cheerio,

Igor Kmitowski

unread,
Apr 5, 2012, 7:00:22 AM4/5/12
to cafetra...@googlegroups.com
Yes, go ahead with update.

Igor

>
> On 5 Apr 2012, at 17:20, Wolfgang Schoene wrote:
>
>> Hi again Igor
>> is it safe for CT to install this http://support.apple.com/kb/DL1515
>> Java update for the Mac?
>
> Not only safe, you should do it instantly. There's a rumour of a virus
> for Mac that spreads via Java and Flash. The update will fix it, if
> there's a problem at all.
>
>
> Cheers,
>
> Hans
>


--

Hans van den Broek

unread,
Apr 5, 2012, 7:04:30 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 17:58, Igor Kmitowski wrote:

> I don't think UTF-16 is a standard for TMX. Other tools experts, please correct me if I am wrong. But I am sure that UTF-8 is sufficient in most cases and takes less RAM.

How do I find out if the TMX is in UTF-8? Can't find it when I open a TMX in TextWrangler.

Elmars Sumanis

unread,
Apr 5, 2012, 7:10:03 AM4/5/12
to cafetra...@googlegroups.com

Hans, you may open the TMX in TextWrangler and look at the status bar in the lower part of TextWrangler window.

HTH,
Elmārs


2012/4/5 Hans van den Broek <ir...@indo.net.id>

Igor Kmitowski

unread,
Apr 5, 2012, 7:10:39 AM4/5/12
to cafetra...@googlegroups.com
Dnia 05-04-2012 o 13:04:30 Hans van den Broek <ir...@indo.net.id>
napisał(a):

>
> On 5 Apr 2012, at 17:58, Igor Kmitowski wrote:
>
>> I don't think UTF-16 is a standard for TMX. Other tools experts, please
>> correct me if I am wrong. But I am sure that UTF-8 is sufficient in
>> most cases and takes less RAM.
>
> How do I find out if the TMX is in UTF-8? Can't find it when I open a
> TMX in TextWrangler.

Yes, see the first line:

e.g. <?xml version="1.0" encoding="utf-8"?>

If the "encoding=" is missing, it is assumed to be utf-8.

Jean-Christophe Helary

unread,
Apr 5, 2012, 7:14:50 AM4/5/12
to cafetra...@googlegroups.com

On Apr 5, 2012, at 8:04 PM, Hans van den Broek wrote:

> How do I find out if the TMX is in UTF-8? Can't find it when I open a TMX in TextWrangler.

TextWrangler gives you the encoding at the bottom left of the window. In the status bar.

Jean-Christophe Helary
----------------------------------------
fun: http://mac4translators.blogspot.com
work: http://www.doublet.jp (ja/en > fr)
tweets: http://twitter.com/brandelune

Hans van den Broek

unread,
Apr 5, 2012, 7:21:06 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 18:10, Elmars Sumanis wrote:


Hans, you may open the TMX in TextWrangler and look at the status bar in the lower part of TextWrangler window.

Great! And it says UTF-8. Is there any chance that not everything is UTF-8?

Terima kasih,

Hans

-- 

Hans van den Broek
Schrijf-, vertaal- en redigeerwerk

Hans van den Broek

unread,
Apr 5, 2012, 7:23:48 AM4/5/12
to cafetra...@googlegroups.com
On 5 Apr 2012, at 18:10, Igor Kmitowski wrote:

e.g. <?xml version="1.0" encoding="utf-8"?>

If the "encoding=" is missing, it is assumed to be utf-8.

It is missing. But Elmars' suggestion saved me.

Cheers,

Hans

-- 

Hans van den Broek
Schrijf-, vertaal- en redigeerwerk
Peleman Rejowinangun KG1/513
RT029 RW009
Yogyakarta 55171
Indonesia
T  +62 (0)85868607333
SKYPE: hanstranslations
transl...@hansvandenbroek.eu
http://cafetran4mac.blogspot.com

Wolfgang Schoene

unread,
Apr 5, 2012, 7:25:54 AM4/5/12
to cafetra...@googlegroups.com
Hi Hans,
just saw a message in some forum about this virus and as I'm in the middle
of a longish project with CT I don't want to do anything wrong.
And thanks for the answer about External database, I somehow suspected I
didn't need it. Haven't succeeded in importing CSV glossaries so far into my
terms memory, though.

Cheers
Wolfgang


Am 05/04/12 12:30 schrieb "Hans van den Broek" unter <ir...@indo.net.id>:

Jean-Christophe Helary

unread,
Apr 5, 2012, 7:42:27 AM4/5/12
to cafetra...@googlegroups.com

On Apr 5, 2012, at 8:21 PM, Hans van den Broek wrote:

> Great! And it says UTF-8. Is there any chance that not everything is UTF-8?

No. Otherwise TextWrangler would have complained.

Hans van den Broek

unread,
Apr 5, 2012, 7:44:36 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 18:25, Wolfgang Schoene wrote:

> Haven't succeeded in importing CSV glossaries so far into my
> terms memory, though.

I think you should proceed as follows (Igor?)
- Add the csv file as a glossary in Library by clicking Add Resource
- Open it in Library under Glossaries
- Convert it using Memory>Conversions>Import entries from Glossary
- Memory>Save as TMX
- Place the converted Glossary in the same folder a your terminology database (I take it that one is already in TMX format)
- Open the folder in the Project Manager so both (or more) files will show up
- In the project, select the tab with the folder, and save as [Terminology DB] (or something)

Hans van den Broek

unread,
Apr 5, 2012, 7:46:07 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 18:42, Jean-Christophe Helary wrote:

> No. Otherwise TextWrangler would have complained.

Thanks for confirming. Out of curiosity, what would TW do or notify when that's the case?

Cheers,

Wolfgang Schoene

unread,
Apr 5, 2012, 7:52:11 AM4/5/12
to cafetra...@googlegroups.com
Thanks Hans,
guessed you were the expert ;-)
Will try asap.

Cheers
Wolfgang


Am 05/04/12 13:44 schrieb "Hans van den Broek" unter <ir...@indo.net.id>:

Jean-Christophe Helary

unread,
Apr 5, 2012, 7:55:39 AM4/5/12
to cafetra...@googlegroups.com

On Apr 5, 2012, at 8:46 PM, Hans van den Broek wrote:

>> No. Otherwise TextWrangler would have complained.
>
> Thanks for confirming. Out of curiosity, what would TW do or notify when that's the case?

It would show an error dialog saying that some characters are not compatible with the selected encoding. It would then offer you to select a different encoding or to display the offending characters.

Hans van den Broek

unread,
Apr 5, 2012, 7:57:17 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 18:52, Wolfgang Schoene wrote:

> guessed you were the expert ;-)

I am NOT. That's the main problem with my blog. I will ask for advice on that problem later in this forum. In the meantime, I finally managed to write a new entry, and it's on... Language Codes (your question).

Hans van den Broek

unread,
Apr 5, 2012, 8:03:32 AM4/5/12
to cafetra...@googlegroups.com

On 5 Apr 2012, at 18:55, Jean-Christophe Helary wrote:

> It would show an error dialog saying that some characters are not compatible with the selected encoding. It would then offer you to select a different encoding or to display the offending characters.

Good to know. Thanks. This should mean that I don't have any UTF code problems since the error dialog never showed up. I consider myself lucky, because my main databases were my own DV databases plus DBs from clients, including in SDL and Trados. And especially the latter can cause UTF issues, so I understood.

Wolfgang Schoene

unread,
May 3, 2012, 6:56:40 AM5/3/12
to cafetra...@googlegroups.com
Hi Igor
my System ID is: -1266857138:-76350075:-1689726651
Will download the update soon.

Best
Wolfgang


Igor Kmitowski

unread,
May 3, 2012, 7:04:43 AM5/3/12
to cafetra...@googlegroups.com
Hi Wolfgang,

Don't download it. I will send you the update files to you current
installation.

BR
Igor
Reply all
Reply to author
Forward
0 new messages