For people with limited internet access: is it possible to translate a whole TMX or XLIFF file with Google Translate in one pass, on those moments where they do have access to the internet?
--
You received this message because you are subscribed to the Google Groups "CafeTranslators" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cafetranslato...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
On 24 Dec 2014, at 15:43, Rene <Yoi...@gmail.com> wrote:Don´t know about Goggle translate, but the "MyMemory" site (which I suppose links to Goggle) offers the option of uploading a file and getting a machine translated TMX back.
On 24 Dec 2014, at 15:51, Hans van den Broek <ir...@indo.net.id> wrote:...and I don’t think you can upload them as such to an MT.
On 24 Dec 2014, at 15:56, Hans van den Broek <ir...@indo.net.id> wrote:You can of course convert the TMX/XLF, and upload the SL part.
On 24 Dec 2014, at 16:17, Rene <Yoi...@gmail.com> wrote:I don´t understand the logic here.
If your TMX and SDLXIFF are "bilingual", why do you need to translate them in the first place?
From the question, I assumed that they are untranslated, so only contain the source language, aka monolingual and not bilingual.
--
On 24 Dec 2014, at 16:17, Rene <Yoi...@gmail.com> wrote:
...so only contain the source language, aka monolingual and not bilingual.
--
On 24 Dec 2014, at 16:54, Rene <Yoi...@gmail.com> wrote:Well, just remove all the gobblydegook. What is the problem?
Fwiw, I just checked, and it is possible to upload an SDLXLIFF file to Wordfast Anywhere. It gets automatically converted to a bilingual Wordfast/Trados type file.
That, you can download, and save the source from Wordfast, Trados, or simply from Word by writing a small macro. Et voila, a text source file without additional gunk.
On 24 Dec 2014, at 17:27, Rene <Yoi...@gmail.com> wrote:->>No problem. CT can do it in a flash. But the answer to the original question would still be No.CT does this in a flash? Nice. I am glad I just renewed my license in order to get the latest version. One of these days I will have to learn how to use the beast.
It does.
Good to know. You can use MT in CT, of course, but as far as I know, you can’t “pretranslate” an XLF or TMX file against an MT.
--
I don´t think he was, but alas he has given an idea to the usual suspect(s).
On 24 Dec 2014, at 18:24, Hans list <hans...@gmail.com> wrote:You wouldn't be asking for a feature here, would you?
On Wed, Dec 24, 2014 at 8:41 AM, Hans list <hans...@gmail.com> wrote:
For people with limited internet access: is it possible to translate a whole TMX or XLIFF file with Google Translate in one pass, on those moments where they do have access to the internet?
--
--
On 24 Dec 2014, at 19:17, Michael Beijer <mic...@beijer.uk> wrote:Not sure about what formats it can import and export
--
On 24 Dec 2014, at 20:32, Rene <Yoi...@gmail.com> wrote:No, not clear conceptually to me. OK, there is an empty column. Ie. instead of a line return, there is tab and line return. Whats the big deal about it? It is still a mononlingual source file.
On 24 Dec 2014, at 18:46, Hans list <hans...@gmail.com> wrote::) I already had the idea to post a request before I asked for a solution here.
--
On 28 dec. 2014, at 16:33, Hans list <hans...@gmail.com> wrote:Is it possible to set a penalty for a TM?
On 28 dec. 2014, at 16:33, Hans list <hans...@gmail.com> wrote:
what would the best way to prevent the pseudo-EMs from the Google Translate TM to be inserted automatically too?
On 28 12 2014, at 10:49, Hans van den Broek <ir...@indo.net.id> wrote:I don’t know, but I have a few related questions:
On 28 dec. 2014, at 17:14, Hans LIST <hans...@gmail.com> wrote:
Interesting questions …
Note however, that this article (and the whole wiki) is just my personal notepad. It is by no means a Help that has been approved by CafeTran's developer
On 28 dec. 2014, at 17:45, Hans List <hans...@gmail.com> wrote:By whom? The Hans vd B Wiki Police?
On 28 12 2014, at 11:50, Hans van den Broek <ir...@indo.net.id> wrote:On 28 dec. 2014, at 17:45, Hans List <hans...@gmail.com> wrote:By whom? The Hans vd B Wiki Police?Sorry for that, I had to include Moses in the message somehow. Did you watch the screencast?
On 28 dec. 2014, at 18:10, Hans LIST <hans...@gmail.com> wrote:Nope. Must write another article first.
--
On 28 dec. 2014, at 19:57, Michael Beijer <mic...@beijer.uk> wrote:As far as I know, Apertium is great, but not ready in "our" languages (i.e. DE/NL/EN).
On 28 dec. 2014, at 20:23, Michael Beijer <mic...@beijer.uk> wrote:Wouldn't it be much easier if we asked Igor
On 28 12 2014, at 13:34, Hans van den Broek <ir...@indo.net.id> wrote:On 28 dec. 2014, at 18:10, Hans LIST <hans...@gmail.com> wrote:Nope. Must write another article first.I have this strange feeling it’ll be related to the previous one. Very much so.
On 28 dec. 2014, at 21:06, Michael Beijer <mic...@beijer.uk> wrote:Do you mean doing it the Rainbow way (is easy)?
On 28 dec. 2014, at 21:02, Hans LIST <hans...@gmail.com> wrote:With four words this can be done in the web interface too. But with 40,000 or 100,000 words?
On 28 dec. 2014, at 21:19, Hans van den Broek <ir...@indo.net.id> wrote:Copy/paste SL and TL in say, Excel.
On 28 dec. 2014, at 21:24, Hans van den Broek <ir...@indo.net.id> wrote:
That reminds me, and I can’t check it yet, do the new Excel features also work for OOo/LibreOffice?
--
You received this message because you are subscribed to the Google Groups "CafeTranslators" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cafetranslato...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
On 28 dec. 2014, at 22:04, Michael Beijer <mic...@beijer.uk> wrote:One issue, however, might be segmentation (unless you generate an HTML or bilingual DOCX from CT first). Otherwise you'll end up with Google Translate's segmentation.
On 28 dec. 2014, at 22:11, Hans van den Broek <ir...@indo.net.id> wrote:the TL should be removed, of course. Which is one of my earlier questions
On 28 dec. 2014, at 22:04, Michael Beijer <mic...@beijer.uk> wrote:
One issue, however, might be segmentation
On 28 dec. 2014, at 22:24, Hans van den Broek <ir...@indo.net.id> wrote:That makes it even easier, since then there’s only one solution.
On 28 12 2014, at 16:11, Hans van den Broek <ir...@indo.net.id> wrote:what do the Thieves of Mountain view do with partly translated TMX/XLF files?
Yes. Still 'researching'. I'd like to create a starter's glossary, say DE>FR.
Just forget it. The translations of isolated words by Google Translate and Bing Translator are useless.
On 28 dec. 2014, at 22:24, Hans van den Broek <ir...@indo.net.id> wrote:That makes it even easier, since then there’s only one solution.Which would be:
- Open or create your project
- Convert it to HTML
- Copy SL (and TL if any TL) segments into an Excel file
- (If there are TL segments, sort on TL, and delete rows (to avoid any GT problems))
- Upload to GT*
- Paste TL in Excel file
- Import in CT TMX
*There’s a file size limit, but it’ll not exceed a day’s work (probably several days)No Rainbow, no API, no money. Simple.
Cheers,Hans
--
On 28 dec. 2014, at 23:04, Hans LIST <hans...@gmail.com> wrote:Yes. It is simple. Didn't know that you can upload a file.
On 29 dec. 2014, at 04:06, Michael Beijer <mic...@beijer.uk> wrote:My solution (blue)
On 28 dec. 2014, at 21:02, Hans LIST <hans...@gmail.com> wrote:
Yes. Still 'researching'. I'd like to create a starter's glossary, say DE>FR.
On 29 dec. 2014, at 06:18, Will Helton <will....@gmail.com> wrote:I've never been able to get this to work on any doc of more than one page/less than 200-250 words.
On 28 dec. 2014, at 23:16, Hans LIST <hans...@gmail.com> wrote:BTW: Google Translate Toolkit handles TMX.
--
On 29 dec. 2014, at 12:40, Hans List <hans...@gmail.com> wrote:Op 28 dec. 2014 om 22:56 heeft Hans van den Broek investors money.Care to elaborate?
On 29 dec. 2014, at 12:40, Hans List <hans...@gmail.com> wrote:Op 28 dec. 2014 om 22:56 heeft Hans van den Broek investors money.Care to elaborate?Of course not. It’s BS.In the meantime, I extracted a DE-FR glossary from JLearnit: https://dl.dropboxusercontent.com/u/2184204/CT/JLearnit%20DE-FR.xlsx.zip (100 kB, with a few mistakes, no doubt)
<Screen Shot 2014-12-29 at 11.22.54.png>
It’s “only” 4k words, but 40k is a lot. See the Wikipedia:Native-language vocabulary sizeNative speakers' vocabularies vary widely within a language, and are especially dependent on the level of the speaker's education. A 1995 study shows that junior-high students would be able to recognize the meanings of about 10,000–12,000 words, whereas for college students this number grows up to about 12,000–17,000 and for elderly adults up to about 17,000 or more.[10]Foreign-language vocabularyThe effects of vocabulary size on language comprehensionThe knowledge of the words deriving from the 3000 most frequent English word families and the 5000 most frequent words provides a comprehension of 95% of word use,[11] and knowledge of 5000 word families is necessary for 99.9% word coverage
Cheers,Hans--Hans van den BroekSchrijf-, vertaal- en redigeerwerkYogyakartaIndonesia
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
--
You received this message because you are subscribed to a topic in the Google Groups "CafeTranslators" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/cafetranslators/241iLTxh2I4/unsubscribe.
To unsubscribe from this group and all its topics, send an email to cafetranslato...@googlegroups.com.
On 29 dec. 2014, at 12:49, Rene <Yoi...@gmail.com> wrote:Well, would the result be any different if you send context-less isolated words to a human translator?Or look up isolated words in a general dictionary?
In fairness, that is hardly an MT issue.
--
On 29 dec. 2014, at 13:02, Rene <Yoi...@gmail.com> wrote:Err.... aren´t the "100 German words which are most frequent" a tad different, depending on what population group and what topic you look at?
Playing around some more:
On 29 dec. 2014, at 06:18, Will Helton <will....@gmail.com> wrote:I've never been able to get this to work on any doc of more than one page/less than 200-250 words.You continue to amaze me. I just uploaded an EN document of 7,422 words, and got it back translated into Dutch in a split second.
<Screen Shot 2014-12-29 at 06.28.35.png>
Need proof? I’ll send you both the original file and the GT version.The last sentences:
<Screen Shot 2014-12-29 at 06.34.59.png>
Cheers,Hans--Hans van den BroekSchrijf-, vertaal- en redigeerwerkYogyakartaIndonesia
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
--
On 29 dec. 2014, at 06:18, Will Helton <will....@gmail.com> wrote:I've never been able to get this to work on any doc of more than one page/less than 200-250 words.You continue to amaze me. I just uploaded an EN document of 7,422 words, and got it back translated into Dutch in a split second.
<Screen Shot 2014-12-29 at 06.28.35.png>
Need proof? I’ll send you both the original file and the GT version.The last sentences:
<Screen Shot 2014-12-29 at 06.34.59.png>
Cheers,
Hans--Hans van den BroekSchrijf-, vertaal- en redigeerwerk
YogyakartaIndonesia
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
--
On 29 dec. 2014, at 20:50, Will Helton <will....@gmail.com> wrote:Wait - are you talking about using the Google Translate web page?
If so, what good is that for importing anything into CT (unless you copy and paste each individual sentence)?
--
On 29 dec. 2014, at 22:50, Will Helton <will....@gmail.com> wrote:it wouldn't be that useful for someone like me
******************************************************
MICHAEL BEIJER
NL>EN translator & terminologist
24 Oakfield Rd., Hastings,
TN35 5AX, East Sussex, UK.
Tel. +44 (0)1424 435830
Mob. +44 (0)747 57717 20
Email: mic...@beijer.uk
Skype/Twitter: michaelbeijer
Beijer.uk (translation/terminology
work)
Proz.com profile: www.proz.com/translator/652138
Acronymbook.com (open source acronyms/abbreviations)
nederbrackets.com (Adventures in Dutch bracket (ab)use)
Wordbook.nl (terminology resources, focusing on NL/EN)
Beijerdeas.com (CAT tools +
AHK scripts)
Cafetranhelp.com/changelog
CafeTran mailing list: CafeTranslators
******************************************************
--
You received this message because you are subscribed to the Google Groups "CafeTranslators" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cafetranslators+unsubscribe@googlegroups.com.
Maybe I'm missing something, but why would you want to do this? That is, "run a Rainbow pipeline with MyMemory as the MT resource". It all looks very interesting, but what's the point? Is this to be able to use MT output at a later time when you might no longer have internet access?
When I have no internet access, I don't translate. Period. After all, you might have your resources, and possibly your MT output ("pre-gathered", using e.g. this method), and of course your brain – but you can't Google stuff (!!!), which is pretty much a show-stopper for me.
To unsubscribe from this group and stop receiving emails from it, send an email to cafetranslato...@googlegroups.com.