I am using Pattern to detect plurals and singulars using the POS tagger. I am currently working on a talk where I need to extract terms from bilingual pair of sentences.
For example:
source = "Click here to remove all expired articles"
target = "Haga clic aquí para eliminar todos los artículos caducados"
As you can see
articles aligns with
caducados. The correct matching pair is
artículos.
My plan is to use the POS tagger to analyse the parse tree and realign the sentences based on the tags. I was wondering if this is a good idea or if anyone can suggest better approaches?
Thanks.