enc-depth and dec-depth

50 views
Skip to first unread message

yvan Zhu

unread,
Nov 12, 2022, 4:45:52 AM11/12/22
to marian-nmt
Hi Marian Team,

Just want to write something i found when I use Marian and I would be appreciate it if someone could explain.
So I just use Marian transformer model to train some model, and I actually refer the parameters here: https://github.com/marian-nmt/marian-examples/tree/master/wmt2017-transformer
So here is my question: In the training, the chosen model type is transformer. And I found that with different enc-depth and dec-depth parameters, the model performance were somehow dramastically different. Since I found enc-depth and dec-depth are pointing to "s2s", so I am wondering if any of you could explain that? https://marian-nmt.github.io/docs/cmd/marian/

Thanks in advance!!

Best,
Dele Zhu
Reply all
Reply to author
Forward
Message has been deleted
0 new messages