Multi-transformer

121 views
Skip to first unread message

Tzahi Sofer

unread,
May 10, 2020, 1:14:40 PM5/10/20
to marian-nmt
Hi,

I want to user the multi-transformer model type for automatic post editing task. I found little documentation on how to use this type of model (validation set format etc.), and I also wonder if and how to use marian-server for translation with the trained model.

Would appriciate any feedback on this.

Thanks,

Tzahi

Roman Grundkiewicz

unread,
May 10, 2020, 1:29:29 PM5/10/20
to marian-nmt
We have a toy example for training a dual-source transformer model in our regression tests, which can show you options required for running it:

marian-server does not support decoding with dual-source models yet, but there is an active pull request for this:

Some outdated tutorials for training APE models are available at our website, though likely not so useful anymore:

Tzahi Sofer

unread,
May 11, 2020, 4:21:16 AM5/11/20
to marian-nmt
Thank you very much for the prompt response!

Can I conclude that there is no support in the marian-decoder as well? or am I wrong and still can make inference with the multi-transformer using the decoder?

Roman Grundkiewicz

unread,
May 11, 2020, 6:02:40 AM5/11/20
to marian-nmt
marian-decoder does support multi-source models. You can use them with marian-server too if you pull the code from that pull request.

Tzahi Sofer

unread,
May 11, 2020, 7:09:21 AM5/11/20
to marian-nmt
Tnx!
Reply all
Reply to author
Forward
0 new messages