Dear Sir,First of all, Thank you so your response. I am sorry to say that I am novice but enthusiast about this sector. After following your scripts it gives me more hope to move forward.
Unfortunately, I encounter a new problem:
"UnicodeDecodeError: 'utf-8' codec can't decode byte 0x91 in position 2889: invalid start byte"
Here is the command I wrote,
```python nematus/train.py --datasets small_test_data/TrainEn.txt small_test_data/trainBn.txt --dictionaries orginal_data/training.en.json orginal_data/training.bn.json --valid_source_dataset small_test_data/trainDevEn.txt --valid_target_dataset small_test_data/trainDevBn.txt --dim_word 256 --dim 512 --n_words_src 30000 --n_words 30000 --maxlen 50 --optimizer adam --lrate 0.0001 --batch_size 40 --no_shuffle --dispFreq 500 --finish_after 10000
```I attached 3 files Source, Target and Dictionary files which may help u find out actual Problem.
Here is the screenshots what I got followed by the command,
On Wed, Apr 29, 2020 at 11:49 PM Rico Sennrich <rico.s...@gmx.ch> wrote:
Hello Shantanu,
this is just a toy dataset with 1000 sentences, not enough to build a strong translation model.
I suggest you have a look at https://github.com/EdinburghNLP/wmt17-transformer-scripts/tree/master/training , which gives instructions how to train a well-performing system for English-German.
best wishes,Rico
On 29/04/2020 18:43, Shantanu Nath wrote:
Dear Sir,
I want to train on a data set which is on Test/en-de folder. It creates model but when i want to translate it shows nothing.
I kept the same parameter u have provided on "train.sh"
Should I increase the value of "finish_after " from 500 to 10,000??
Best regards,
Shantanu Nath
--
You received this message because you are subscribed to the Google Groups "Nematus Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email to nematus-suppo...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/nematus-support/e8c185d3-4517-455b-8bf9-fbf3af45b37d%40googlegroups.com.