missing makefile in word-aligner

16 views
Skip to first unread message

Vivek Mahajan

unread,
Nov 19, 2012, 12:38:31 AM11/19/12
to cdec-...@googlegroups.com
Hi guys,

I am trying to follow the instructions given here http://cdec-decoder.org/index.php?title=Lexical_translation_/_Word_alignment .

But at the end of the make step it gives this kind of error.

make[1]: Leaving directory `../cdec/word-aligner/talign/grammars'
Making model-f-e ...
make[1]: Entering directory `../cdec/word-aligner/talign/model-f-e'
make[1]: *** No targets specified and no makefile found.  Stop.
make[1]: Leaving directory `../cdec/word-aligner/talign/model-f-e'
make: *** [all] Error 1


To me it looks like there should be a makefile inside ../cdec/word-aligner/talign/model-f-e which should finally give the alignments, but that's not the case.

Also, If I am using a bigger parallel corpus with french-english translation. I get lots of these errors-

.
utf8 "\xE9" does not map to Unicode at /home/vivek/Downloads/giza-pp/cdec/word-aligner/support/make_lex_grammar.pl line 26, <M1> line 11475.


Can you please help me with this?

Thanks,
Vivek

waleed ammar

unread,
Nov 19, 2012, 1:50:12 AM11/19/12
to cdec-...@googlegroups.com
Hi Vivek,

I think these instructions are a little outdated now. Consider using fast_align which has a recent documentation. It implements a different word alignment model, though.

waleed

vivek mahajan

unread,
Nov 19, 2012, 1:56:26 AM11/19/12
to cdec-...@googlegroups.com
Hi Waleed,

Thanks for the quick reply. Yes, I looked at that. But I specifically need the model proposed in this paper http://www.aclweb.org/anthology/P/P11/P11-1042.pdf  . Unsupervised word alignment with arbitrary features  by Chris Dyer et.al

Best,
Vivek

Chris Dyer

unread,
Nov 20, 2012, 2:04:29 PM11/20/12
to cdec-...@googlegroups.com
Hi Vivek,
My apologies for only getting back to you now. I've checked in a fix
to the word aligner so that the Makefile work as advertised. You will
need to pull down the most recent version of cdec and recompile, but
it should work a lot better.

-Chris

vivek mahajan

unread,
Nov 20, 2012, 3:09:37 PM11/20/12
to cdec-...@googlegroups.com
Hi Chris,

Thank you so much. It works now.

Best,
Vivek
Reply all
Reply to author
Forward
0 new messages