Re: I have a question about Nematus

32 views

Skip to first unread message

Rico Sennrich

unread,

Sep 19, 2019, 11:14:49 AM9/19/19

to safae berrichi, nematus...@googlegroups.com

Hi Safae,

What is the content of en-de/input-test.en? You're saying that you're running Nematus on Windows - this isn't heavily tested and I'm not sure if there's some compatibility problem in Nematus itself, or with the way you invoke translate.py.

"python train.py -h" will show you some documentation of parameters. Here's the ones you inquire about:

validation parameters:
--valid_freq INT      validation frequency (default: 10000)

display parameters:
--disp_freq INT       display loss after INT updates (default: 1000)
--sample_freq INT     display some samples after INT updates (default:
                        10000)

valid_freq is useful to keep an eye on training progress, and for early stopping. disp_freq and sample_freq don't affect training, and are purely a way to get some regular feedback about training progress (you can disable sample_freq, but I recommend using disp_freq, which will also show you training speed, and gives regular feedback about how many updates the model has made).

best wishes,
Rico

On 19/09/2019 11:27, safae berrichi wrote:

Dear Rico Sennrich,

I just started to test your toolkit "Nematus" in the windows system. But unfortunately, I have some difficulties during the execution of this toolkit.

First, after obtaining the training model using RNN on English-German, I run the script ./translate.sh on the test sentences and I received NFO: Translated 1 Sent into 3.1948089599609375e-05 sec, but the output file (reference.de) obtained is empty. Is there any explanation in case that no translation is provided through the test sentences? please guide me where i am going wrong. My translation scrpit is as follows:

THEANO_FLAGS=mode=FAST_RUN,floatX=float32,device=$device,on_unused_input=warn python nematus/translate.py -m models-en-de/model.npz -i en-de/input-test.en -o en-de/reference.de

Secondly, I would like to have if possible, an explanation about the parameters used in the training script. So what is the role and distinction between -sampleFreq 100 --validFreq=100 --dispFreq=100.

Thank you so much.

Best Regards,

Safae.

--

=========================================

Safae BERRICHI

PhD Candidate in
Computer Science

Laboratory

Department of Mathematics and Computer Science

Faculty of Science,
Mohammed First

University

Oujda, Morocco.

T
el: (+212)6 50 36 08 79

=========================================

-- 
Rico Sennrich
School of Informatics
University of Edinburgh
10 Crichton Street
Edinburgh, EH8 9AB, United Kingdom
http://homepages.inf.ed.ac.uk/rsennric/

Reply all

Reply to author

Forward

0 new messages