Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

[BUGS] BUG #14278: Problem searching spanish words with accent mark outside the stem

0 views
Skip to first unread message

pa...@hernandezgomez.com

unread,
Aug 4, 2016, 9:52:06 AM8/4/16
to
The following bug has been logged on the website:

Bug reference: 14278
Logged by: Paco Hernández
Email address: pa...@hernandezgomez.com
PostgreSQL version: 9.6beta3
Operating system: Linux
Description:

Dear sirs:

Search without accent mark is not working correctly when the accent mark is
outside the stem of the word.

For example, this matches correctly:

postgres=# select to_tsvector('spanish', 'canción') @@ to_tsquery('spanish',
'cancion');
?column?
----------
t
(1 row)

This works and returns true because the stem of "canción" is "cancion", so
when we search for "cancion" (without accent mark), it matches correctly.

But, when the accent mark is outside the stem, for example in "peluquería",
then it does not work because the stem of "peluquería" is "peluqu", but
to_tsquery('spanish', 'peluqueria') is "peluqueri".

postgres=# select to_tsvector('spanish', 'peluquería') @@
to_tsquery('spanish', 'peluqueria');
?column?
----------
f
(1 row)

This is important because there are many people that don't use the accent
mark at letter "i" in "peluquería" and words like that.

Thank you very much.

Best regards,
Paco Hernández.

--
Sent via pgsql-bugs mailing list (pgsql...@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-bugs

0 new messages