- < [ ((( première ||| by ))) [ [ ((( nation ||| an ))) [ ((( S’entend ||| Act ))) { < ((( soit ||| of ))) ((( d’une ||| ))) > } ] ] ((( bande, ||| Parliament. ))) ] ] [ [ [ ((( soit ||| First ))) [ ((( d’un ||| Nation ))) [ [ ((( groupe ||| means ))) < ((( autochtone ||| a ))) < [ [ ((( qui ||| band, ))) [ < ((( est ||| ))) < ((( partie ||| an ))) ((( à ||| or ))) > > < < [ ((( un ||| group ))) { < ((( accord ||| that ))) ((( sur ||| ))) > } ] ((( l’autonomie ||| ))) > ((( gouvernementale ||| Indigenous ))) > ] ] [ < ((( mis ||| party ))) ((( en ||| is ))) > ((( oeuvre ||| to ))) ] ] ((( par ||| a ))) > > ] ((( une ||| self-government ))) ] ] ] ((( loi ||| agreement ))) ] ((( fédérale. ||| implemented ))) ]
I tried a second test sentence with pialign, in case this was some kind of outlier, but the result was similar. I don't expect perfection, but this output appeared random.In disappointment, I turned to mgiza++, but its out-of-the-box response to the same input was even more incomprehensible, namely:
- # Sentence pair (1) source length 24 target length 36 alignment score : 4.33196e-97
première nation s ’ entend soit d ’ une bande , soit d ’ un groupe autochtone qui est partie à un accord sur l ’ autonomie gouvernementale mis en oeuvre par une loi fédérale . - NULL ({ 2 3 5 10 }) first ({ 29 30 31 32 36 }) nation ({ }) means ({ }) a ({ 33 34 35 }) band ({ }) , ({ 4 }) or ({ 6 }) an ({ }) indigenous ({ }) group ({ }) that ({ }) is ({ 11 12 13 15 16 17 18 19 22 }) party ({ 20 21 23 24 25 27 28 }) to ({ }) a ({ 9 }) self-government ({ }) agreement ({ }) implemented ({ }) by ({ }) an ({ 1 }) act ({ }) of ({ 7 }) parliament ({ }) . ({ 8 14 26 })
Is there anything I can do to get helpful results? I looked at other word alignment tools, namely eflomal, the Berkeley aligner, and tdx-nlp
but I wasn't able to get them to either compile or run. The Berkeley aligner, in particular, required a tree format version of the input, and its docs say a script to generate this would be released later, so that stumped me.
Any help in getting reasonable word alignment working would be greatly appreciated.
Many thanks,
Macho