create TMX from Excel list

1,006 views
Skip to first unread message

mark possemiers

unread,
May 15, 2015, 11:40:24 AM5/15/15
to cafetra...@googlegroups.com
I have a wordlist (bilingual) in Excel that I want to convert into a TMX (specialized list of words of a client)
Anyone has any idea? Tried to find in Google but nothing there...

Hans list

unread,
May 15, 2015, 12:19:46 PM5/15/15
to cafetra...@googlegroups.com

M.E.

unread,
May 15, 2015, 12:22:45 PM5/15/15
to cafetra...@googlegroups.com
:-)  zelfde wat the EU Commission zegt   :-)
Hartelijk dank,
Mark




--
You received this message because you are subscribed to a topic in the Google Groups "CafeTranslators" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/cafetranslators/Dr-xh9n_PWQ/unsubscribe.
To unsubscribe from this group and all its topics, send an email to cafetranslato...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Michael Beijer

unread,
May 15, 2015, 2:27:48 PM5/15/15
to cafetra...@googlegroups.com
<This email was dictated using Dragon NaturallySpeaking 13 Professional. Please excuse any typos!>

​Another easy way to quickly convert an Excel glossary into a TMX is with the small, free tool TMX_maker_3.0.exe of András Farkas (developer of both LF Aligner and TMLookup). It is available in his "LF_aligner_4.1_win.zip" package, here: http://sourceforge.net/projects/aligner/files/?source=navbar

​Michael​

-------------------------- ••• 

TMX_maker_readme.txt

TMX maker version 3.0
by András Farkas

This should be pretty self-explanatory: the TMX maker is basically the TMX generator ripped out of the aligner and made into a separate program. As such, it uses the same setup file (LF_aligner_setup.txt).
If you didn't generate a TMX after an alignment with LF Aligner but later on you decide you need one after all, use aligned_XXX.txt as your input file. Of course, you can also use any other tab-delimited UTF-8 txt file, not just ones generated by LF Aligner.
As of version 2.23, the TMX maker supports xls input files on top of UTF-8 txt. Despite that, txt is still the recommended input format as it's more reliable. To get a txt from an xls, you can copy-paste the relevant parts of your spreadsheet to a text editor and use File/Save as to save it in UTF-8 encoding. If you do use an xls for input, all worksheets will be converted to txt and used as input.


Batch alignment using command line arguments:

As of version 3.0, the TMX maker can be launched with command line arguments, just like the aligner. This allows you to generate many TMX files at once, and it also makes it possible to integrate the tmx maker with your own software (write software that calls the tmx maker in an automated fashion without human interaction). Note that you do not need to use command line arguments to generate one TMX file out of many input files (see end of readme).

The easiest way to use command line arguments in Windows is to open a text editor and type in the commands, one command per line. Save the file as a .bat in the same folder as the aligner, then just double click it to launch. On linux and mac, you need to write a simple bash script that does the same. When launching the aligner with command line arguments, the GUI is switched off.
The arguments (settings) have to be provided with the following syntax: --[argname]="[argvalue]" or --[argname]="[value1]","[value2]". Note that there is no space after the comma. You can enter the arguments in any order, and you can leave out arguments.

There are three arguments:

Infiles - the (one or more) input files. You have to provide the full file paths, separated by commas.
Codes - TMX language codes, separated by commas.
Outfile - The full path and name of the output file. If omitted, the default is set automatically.


Here's a sample command:

C:\aligner_perl\TMX_maker_3.0.exe -i "C:\aligner\new\aligned_32006L0112_en-hu-es.txt","C:\aligner\new\aligned_COM_2012_0550_en-hu-es.txt" -c
 en-gb,hu-hu,es-es


Note that file names that contain a space must be in double quotes. Therefore, it's good policy to write all file names in double quotes (unless you drag and drop them into the console). As values are comma separated, file names that contain a comma are not supported.
There isn't a lot of error checking done on command line input, so make sure you follow the syntax, don't make typos etc. If you get an "Unknown option: xxx" error message when the script starts, you've messed up.
As a general rule, perl - and thus LF Aligner - does not support non-ASCII file names on Windows. However, it appears that command line argument mode is not subject to this limitation. I.e. if you use command line arguments to specify the file names, you can process files like c:\úőóüí\file.txt or c:\folder\ííí.html, which would otherwise be rejected.



There are two noteworthy features that are not self-evident:
- The TMX maker can generate multilingual TMX files, e.g. with English, Spanish, Italian and French all in the same TMX. When you import a multilingual TMX, your CAT tool will decide which languages to import based on the setup of your TM.
- The TMX maker can also handle multiple input files in Windows. You can just select any number of input files in the browsing window while keeping Ctrl pressed, and they will all be imported into the same TMX. You can even mix xls and txt input files and make a TMX out of them in one go. If there is more than one input file, the default name of the output file will be same as the last input file.

-------------------------- ••• 
TMX_maker_3.0_changelog.txt

New in 3.0
- Graphical user interface added (enabled by default on Windows)
- Segments with no text in one of the languages skipped when writing bilingual TMX files
- Command line arguments for automatic/batch generation of TMX files
- You can specify the output file

New in 2.23
- Xls input files now accepted as well as txt - txt is still recommended for reliability

New in 2.22
- Note printing bug fixed (when taking notes from the input file, the note in the first segment was propagated to all)

New in 2.21
- GUI for choosing input and output files on Windows (can be disabled in setup)
- Support for multiple import files in GUI (just choose several files, they will be merged into one TMX)

New in 2.2
- Support for multilingual TMX files (there is no upper limit)

New in 2.11
- Malformed lines are now skipped to make use with messy files more convenient

New in 2.1
- Better Studio compatibility: creator name, creation time and Note should now be recognized by Studio (the TMX generator now spoofs the Trados o-tmf string)

******************************************************

MICHAEL BEIJER
NL>EN translator & terminologist
116 Stonefield Road, Hastings,
TN34 1QA, East Sussex, UK.
Tel. +44 (0)1424 430250
Mob. +44 (0)747 577 17 20
Email: mic...@beijer.uk
Skype/Twitter: michaelbeijer
Beijer.uk (translation/terminology work)
Proz.com profile: www.proz.com/translator/652138
Acronymbook.com (open source acronyms/abbreviations)
nederbrackets.com (Adventures in Dutch bracket (ab)use)
Wordbook.nl (terminology resources, focusing on NL/EN)
Beijerdeas.com (CAT tools + AHK scripts)
Cafetranhelp.com/changelog
CafeTran mailing list: CafeTranslators
******************************************************


On Fri, May 15, 2015 at 4:40 PM, mark possemiers <checkem...@gmail.com> wrote:
I have a wordlist (bilingual) in Excel that I want to convert into a TMX (specialized list of words of a client)
Anyone has any idea? Tried to find in Google but nothing there...

--
You received this message because you are subscribed to the Google Groups "CafeTranslators" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cafetranslato...@googlegroups.com.

Hans List

unread,
May 15, 2015, 2:33:12 PM5/15/15
to CafeTran Google Group

On 15 5 2015, at 20:27, Michael Beijer <mic...@beijer.uk> wrote:

<This email was dictated using Dragon NaturallySpeaking 13 Professional. Please excuse any typos!>

​Another easy way to quickly convert an Excel glossary into a TMX is with the small, free tool TMX_maker_3.0.exe of András Farkas (developer of both LF Aligner and TMLookup). It is available in his "LF_aligner_4.1_win.zip" package, here: http://sourceforge.net/projects/aligner/files/?source=navbar


Migrate to Windows
Install extra software
Execute easy solution

Right...

Will Helton

unread,
May 15, 2015, 2:38:50 PM5/15/15
to cafetra...@googlegroups.com
What's the emoticon for a raspberry? ;)

Will

Sent from my iPhone
Verstuurd vanaf mijn iPhone

Michael Beijer

unread,
May 15, 2015, 2:43:13 PM5/15/15
to cafetra...@googlegroups.com
Oh, sorry, forgot everyone in this Google group is on Mac except me ;>)

Michael

Hans List

unread,
May 15, 2015, 2:46:43 PM5/15/15
to CafeTran Google Group

On 15 5 2015, at 20:42, Michael Beijer <mic...@beijer.uk> wrote:

Oh, sorry, forgot everyone in this Google group is on Mac except me ;>)

Not a prob

Martin Brüggemeier

unread,
May 15, 2015, 2:56:30 PM5/15/15
to cafetra...@googlegroups.com


Am 15.05.2015 20:42:31, "Michael Beijer" <mic...@beijer.uk> schrieb:

>Oh, sorry, forgot everyone in this Google group is on Mac except me ;>)
>
Not everyone. I use Windows too (and appreciate info on Windows tools).

Have a nice evening!
Martin


--
Martin Brüggemeier
Dipl.-Übersetzer (EN<>DE, ES>DE, EO<>DE)
Josefstr. 27, 48683 Ahaus, Deutschland
Tel.: +49 (0) 2561 7559073
Fax: +49 (0) 2561 7559074
Mobil: +49 (0) 1578 7559075
http://www.martin-brueggemeier.de


M.E.

unread,
May 15, 2015, 3:46:40 PM5/15/15
to cafetra...@googlegroups.com
many thanks for all the replies.

Now another stupid question:
How do I make sure that the tmx words come up automatically when I translate a text.
This client is very peculiar …

Another question: do they still sell windows?
:-)
> --
> You received this message because you are subscribed to a topic in the Google Groups "CafeTranslators" group.
> To unsubscribe from this topic, visit https://groups.google.com/d/topic/cafetranslators/Dr-xh9n_PWQ/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to cafetranslato...@googlegroups.com.

Hans list

unread,
May 16, 2015, 1:07:59 AM5/16/15
to cafetra...@googlegroups.com
As of Windows 10 they will be giving it away.

Hans van den Broek

unread,
May 16, 2015, 1:34:49 AM5/16/15
to Cafetran support

> On 16 May 2015, at 02:46, M.E. <checkem...@gmail.com> wrote:
>
> How do I make sure that the tmx words come up automatically when I translate a text.

By assigning it the highest priority, while enabling Auto-Assembly (including insert AA). To be able to check it, I usually dock the client’s TM next to the translation pane, rather than in the tabbed pane.


Cheers,

Hans

--

Hans van den Broek
Schrijf-, vertaal- en redigeerwerk
Yogyakarta
Indonesia
http://cafetran4mac.blogspot.com




Hans List

unread,
May 16, 2015, 1:44:10 AM5/16/15
to cafetra...@googlegroups.com
Would that be the Translation editor, consisting of the Source language pane and its counterpart the Target language pane?

Hans van den Broek

unread,
May 16, 2015, 1:53:18 AM5/16/15
to Cafetran support

> On 16 May 2015, at 12:44, Hans List <hans...@gmail.com> wrote:
>
> Would that be the Translation editor, consisting of the Source language pane and its counterpart the Target language pane?

Yep. It doesn’t really matter where you dock it, as long as you can keep an I on it all the time.

Hans list

unread,
May 16, 2015, 2:02:31 AM5/16/15
to cafetra...@googlegroups.com
Then I'll dock myself here:

http://en.m.wikipedia.org/wiki/%28Sittin%27_On%29_The_Dock_of_the_Bay

Igor: regarding our discussion about web resources, please note the %27 etc plus underscore here.

Hans van den Broek

unread,
May 16, 2015, 2:07:50 AM5/16/15
to Cafetran support
By the way, be sure to check the settings of your TMs after clicking Next to start translating (by right-clicking in the window of the TMs), since the Dashboard will turn the settings of all TMs into the last one applied.
Also, I set the client’s TM to Read-Only, to avoid “polluting” it with my own translation.

Hans List

unread,
May 16, 2015, 2:09:44 AM5/16/15
to cafetra...@googlegroups.com
Yup. And in a build to come, these settings will be remembered.
Reply all
Reply to author
Forward
0 new messages