Agglutinative languages?

0 views
Skip to first unread message

Gazs

unread,
Nov 28, 2009, 3:55:57 PM11/28/09
to ubiquity-firefox
Hello,

I'm toying around with creating a Hungarian language parser for
Ubiquity, but I have a big problem: how can I tell Ubiquity that not
only is Hungarian left-branching, the suffixes (which show the roles)
are glued to the end of the words (...which sometimes assimilate as
well, but that's a later problem).

There are two ways I thought I could make it work with Ubiquity. The
wordBreaker function from the Japanese parser seems unfortunately too
rigid (it mercilessly chops off everything that looks like a suffix).
The other function that seemed like it could work was the
normalizeArgument found in romance language parsers, but I couldn't
make it work. Would this be what I'm looking for?

Thanks for any help,
Gazs

"mitcho (Michael 芳貴 Erlewine)"

unread,
Nov 28, 2009, 4:16:52 PM11/28/09
to ubiquity...@googlegroups.com
Gazs,

wordBreaker is what you're looking for. Unfortunately, there are a number of issues involved with working with strongly case-marked languages, like Hungarian, only the first of which is writing a custom wordBreaker to "de-affix" the arguments. Here's an explanation here:

http://mitcho.com/blog/projects/in-case-of-case/

mitcho
> --
>
> You received this message because you are subscribed to the Google Groups "ubiquity-firefox" group.
> To post to this group, send email to ubiquity...@googlegroups.com.
> To unsubscribe from this group, send email to ubiquity-firef...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/ubiquity-firefox?hl=en.
>
>

--
mitcho (Michael 芳貴 Erlewine)
mit...@mitcho.com
http://mitcho.com/
linguist, coder, teacher

Gazs

unread,
Dec 1, 2009, 4:14:03 PM12/1/09
to ubiquity-firefox
Thanks. Right now, I'm trying to figure it out as we go, I really hope
some crazy-complicated regexes will cover most scenarios.

Btw, what is the official/preferred way to report bugs? This group or
the getsatisfaction page or the trac?

Gazs

On Nov 28, 10:16 pm, "mitcho (Michael 芳貴 Erlewine)"

> > For more options, visit this group athttp://groups.google.com/group/ubiquity-firefox?hl=en.

"mitcho (Michael 芳貴 Erlewine)"

unread,
Dec 1, 2009, 5:16:35 PM12/1/09
to ubiquity...@googlegroups.com
> Btw, what is the official/preferred way to report bugs? This group or
> the getsatisfaction page or the trac?

Trac.
> For more options, visit this group at http://groups.google.com/group/ubiquity-firefox?hl=en.
>
>

--
mitcho (Michael 芳貴 Erlewine)
Reply all
Reply to author
Forward
0 new messages