Aligning Bilingual Segments

7 views
Skip to first unread message

Tharshan Muthulingam

unread,
Jul 26, 2014, 8:05:47 AM7/26/14
to pattern-f...@googlegroups.com
I am using Pattern to detect plurals and singulars using the POS tagger. I am currently working on a talk where I need to extract terms from bilingual pair of sentences.

For example:
    source = "Click here to remove all expired articles"
    target = "Haga clic aquí para eliminar todos los artículos caducados"

As you can see articles aligns with caducados. The correct matching pair is artículos. 

My plan is to use the POS tagger to analyse the parse tree and realign the sentences based on the tags. I was wondering if this is a good idea or if anyone can suggest better approaches?

Thanks.
Reply all
Reply to author
Forward
0 new messages