Translation of numeric data

184 views
Skip to first unread message

wraith808

unread,
Jul 15, 2010, 4:03:35 PM7/15/10
to Google AJAX APIs
I am processing translation of financial documents, and when doing the
translation, Google translate removes the separators, i.e. if a number
is 12.465.733 in Danish, then when translated to English it will be
12465733. This is very problematic for my uses, as I use the
separators to distinguish between actual data and dates.

To further confuse the issue, when translating, I sometimes get the
issue, and other times the separators are left. An example of this is
translating the following from Danish to English:

1. Udgifterne til den post var 1.345.876.

This example keeps the separators.

2. Omkostningerne i forbindelse med punkt på dagsordenen er 1.254.678.

This example loses the separators.

Does anyone have any thoughts on this, or better yet, some sort of
parameters that I can send to Google translate so that it will not
translate numeric data?

TIA

Jeff S (Google)

unread,
Jul 22, 2010, 7:57:04 PM7/22/10
to Google AJAX APIs
Hi,

I'm not aware of any parameters that can be sent to change this
behavior, so I think the best bet would be to parse the results of the
translation to add back in the original numbers with formatting (since
those don't need translation, but their position might shift). That
would be a workaround for now, but this type of issue would apply to
other content as well (like HTML tags in translated text). I'd be
interested in coming up with some ways to make translating preserve
segments of the original message. What do you think would be most
useful?

Thank you,

Jeff
Reply all
Reply to author
Forward
0 new messages