Hi,
Yes, this will be fixed in the next release, which should arrive in a couple of days (over the weekend most likely). It’s fixed internally, but not yet pushed to the public.
From: Jeremiah Chow
Sent: Friday, April 15, 2022 7:19 AM
To: marian-nmt
Subject: [EXTERNAL] [marian-nmt] Error using guided-alignment and fp16 together
You don't often get email from whendr...@gmail.com. Learn why this is important |
--
You received this message because you are subscribed to the Google Groups "marian-nmt" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
marian-nmt+...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/marian-nmt/9f04cbc9-dd12-4805-8bd7-26b99e9d7d7fn%40googlegroups.com.
You are reloading from the model npz only, if I am not wrong. You also need the *.optimizer.npz, that’s where the optimizer parameters sit.
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/2fe831b6-d644-4f83-9ee1-de22d732378fn%40googlegroups.com.
Hm, there is no way this doesn’t work, we have tons of automatic regression tests for that. At least for the default case.
What are the exact commands you ran for training and restarted training?
From: Jeremiah Chow
Sent: Saturday, April 16, 2022 9:10 AM
To: marian-nmt
Subject: Re: [EXTERNAL] [marian-nmt] Error using guided-alignment and fp16 together
You don't often get email from whendr...@gmail.com. Learn why this is important |
Yes, I understand that is the case, that's why I added the --relative-path option but marian does not see the npz.optimizer.npz which is in the same folder. Is there any document I need to modify (such as one of the .yml files) to point Marian to the optimizer file? Thanks and cheers
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/0b062836-cd86-4a60-b57c-a582999e6c8dn%40googlegroups.com.
Hm, looked at the code. These messages appear when the checkpoint file was found, but does not contain the correct data. I would say *.optimizer.npz is corrupted. Is it possible you interrupted while *.optimizer.npz was not fully written to disk yet?
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/f6c2238e-e999-47af-8bdf-eb228e30177cn%40googlegroups.com.
Oh wait. You “switched to marian-dev”, do you mean from an older release in marian? We did change the optimizer.npz format at some point, so old checkpoints would not be expected to work. Old models files are still compatible.
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/87e5e615-755d-4196-b86b-189e13418768n%40googlegroups.com.
Then my bet is still corrupted optimizer file. It’s finding the file, otherwise you would get a different error message, but then it’s not finding the right fields inside the file. There isn’t really any other option than bad optimizer file, however that might have happened.
Could dropbox syncing be responsible? I have lost data due to bad syncing via dropbox in the past.
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/82bd6c22-904b-4ed8-92e2-0009b55d14c8n%40googlegroups.com.
Ah, email address: marc...@microsoft.com
I am curious to take a look at the optimizer.npz
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/SA1PR21MB1288271F1E25DE6D283CB254E9F19%40SA1PR21MB1288.namprd21.prod.outlook.com.
To unsubscribe from this group and stop receiving emails from it, send an email to marian-nmt+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/marian-nmt/82bd6c22-904b-4ed8-92e2-0009b55d14c8n%40googlegroups.com.
--
You received this message because you are subscribed to the Google Groups "marian-nmt" group.
To unsubscribe from this group and stop receiving emails from it, send an email to marian-nmt+unsubscribe@googlegroups.com.