Deleting duplicate TMX units

157 views
Skip to first unread message

Will Helton

unread,
Aug 21, 2014, 10:05:45 AM8/21/14
to cafetra...@googlegroups.com
Can I just confirm that CT will never delete anything other than
duplicates from a TM using the "Task > Delete duplicate TMX units" option?

I've run this a couple of times on a multi-doc project today and at
least two times it's said it's deleted over 300 duplicates and this time
just now it was 400 units.

Thanks,

Will

Igor Kmitowski

unread,
Aug 21, 2014, 10:21:09 AM8/21/14
to cafetra...@googlegroups.com
Hi Will,

Yes, in this case, CafeTran will remove the TM duplicate (identical) units
that have both source and target the same. See also Task > Remove TMX
units with duplicate source segments.

Igor
--
Igor Kmitowski
Translator and Java developer
CafeTran website: http://www.cafetran.com
CafeTran support: cafetran...@gmail.com

Will Helton

unread,
Aug 21, 2014, 10:48:43 AM8/21/14
to cafetra...@googlegroups.com
Hi Igor,

Thanks. Does that impact on alternative translations if I run that?

How is this different from "Remove duplicate TMX units"? I thought
duplication was based on source segments by default.

Will

> Igor Kmitowski <mailto:cafetran...@gmail.com>
> 21 August 2014 15:21

Igor Kmitowski

unread,
Aug 21, 2014, 10:58:42 AM8/21/14
to cafetra...@googlegroups.com
> Thanks. Does that impact on alternative translations if I run that?

Task > Remove TMX units with duplicate source segments also removes units
with the same source but different (alternative) or the same target.

Task > Remove duplicate TMX units removes only the duplicate units that
have both source and target segments the same. They are true duplicates
both on the source and target side.

Igor

>
> How is this different from "Remove duplicate TMX units"? I thought
> duplication was based on source segments by default.
>
> Will
>
>> Igor Kmitowski <mailto:cafetran...@gmail.com>
>> 21 August 2014 15:21
>> Hi Will,
>>
>> Yes, in this case, CafeTran will remove the TM duplicate (identical)
>> units that have both source and target the same. See also Task > Remove
>> TMX units with duplicate source segments.
>>
>> Igor
>>
>>
>>
>


Will Helton

unread,
Aug 21, 2014, 11:20:09 AM8/21/14
to cafetra...@googlegroups.com
Hi Igor,

Thanks. Ok, I'm a little stumped again.

I've just completed a project and done the following:

1. Memory > Import > Import segments from Project

2. Task > Remove duplicate TMX units (54 hits returned)

3. Memory > Save memory as (saved this by project name to my customer
repository)

I have now opened this newly-named TM file with Heartsome and it shows
exactly 4 translated segments out of some 28 documents segments.

If I open the same file with TextWrangler it shows 32 segments -
duplicates of the first 3-4 segments and then my later work.

I'm going to try restarting my computer and see if that makes any
difference in the results I'm getting. I really hope so.

Will

> Igor Kmitowski <mailto:cafetran...@gmail.com>
> 21 August 2014 15:58

Will Helton

unread,
Aug 21, 2014, 11:22:22 AM8/21/14
to cafetra...@googlegroups.com
Wait, wait, 86 that.

The target lang setting was en-GB and the project was en-US.

I would never have thought that would stop Heartsome from displaying all
the units.

Disaster averted - I hope. :)

Will

> Will Helton <mailto:will....@gmail.com>
> 21 August 2014 16:20
> Will Helton <mailto:will....@gmail.com>
> 21 August 2014 15:48
> Hi Igor,
>
> Thanks. Does that impact on alternative translations if I run that?
>
> How is this different from "Remove duplicate TMX units"? I thought
> duplication was based on source segments by default.
>
> Will
>
> Igor Kmitowski <mailto:cafetran...@gmail.com>
> 21 August 2014 15:21
> Hi Will,
>
> Yes, in this case, CafeTran will remove the TM duplicate (identical)
> units that have both source and target the same. See also Task >
> Remove TMX units with duplicate source segments.
>
> Igor
>
>
>
> Will Helton <mailto:will....@gmail.com>
> 21 August 2014 15:05

Michael Beijer

unread,
Aug 21, 2014, 11:28:28 AM8/21/14
to cafetra...@googlegroups.com
I was just going to say the same thing: you have to watch out in Heartsome that you click the correct language variants. I had that same problem. Plain ‘nl’ or ‘nl-NL’, e.g. will show different sets of segments.

Will Helton

unread,
Aug 21, 2014, 12:41:40 PM8/21/14
to cafetra...@googlegroups.com
One could argue that's a thorough approach.

I'd be likely to say it's a pain in the kiester. :)
> --
> You received this message because you are subscribed to the Google
> Groups "CafeTranslators" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to cafetranslato...@googlegroups.com
> <mailto:cafetranslato...@googlegroups.com>.
> For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages