Search and Indexing / Lucene.Net non standard analyzers and translitteration

54 views
Skip to first unread message

Nik

unread,
Mar 23, 2021, 7:37:18 AM3/23/21
to RavenDB - an awesome database
Hi,

In the past I've used external systems for full-text search like Solr and ES, but being able to do that using RavenDB would be very beneficial.

From what I understood RavenDB internally is using Lucene.net 3.0.3 ?

I am asking as I need to use a language specific analyzers which seems to be missing in Lucene.net 3.0.3, for example the Serbian analyzer which transliterates Cyrillic to bald Latin and has language specific stop-words.

What would be the suggested approach here? Implement a custom analyzer against Lucene 3.0.3 and include that DLL with RavenDb server?

Paweł Pekról

unread,
Mar 23, 2021, 10:12:33 AM3/23/21
to rav...@googlegroups.com
Hi Nik,

beside supporting bunch of collation analyzers (list here: https://github.com/ravendb/ravendb/tree/v5.1/src/Raven.Server/Documents/Indexes/Persistence/Lucene/Analyzers/Collation/Cultures) the only way to add additional analyzers is to create your own and dropping the DLL in the Server folder (more info: https://ravendb.net/docs/article-page/5.1/Csharp/indexes/using-analyzers#creating-your-own-analyzer). Remember to use the Lucene.Net supplied with your server or found in https://github.com/ravendb/ravendb/tree/v5.1/libs (not sure which version of RavenDB you are using - gave you links to v5.1)

In 5.2 we are planning to add custom server-wide (cluster-wide) and database-only analyzers to avoid the fuss with distribution of the DLL. More info:

I hope that it helps. With regards,

Paweł Pekról
Head of Development   /   Hibernating Rhinos LTD
Support:  sup...@ravendb.net
  


--
You received this message because you are subscribed to the Google Groups "RavenDB - an awesome database" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ravendb+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ravendb/9c633fcd-07b0-42b0-9fb3-acbd80317302n%40googlegroups.com.

Nik

unread,
Mar 30, 2021, 3:11:10 AM3/30/21
to RavenDB - an awesome database
Thanks Pawel,

I'll have a look. Looking forward to the 5.2 features.

p.s. as a side note, while other parts of RavenDB feel like a perfect black box (in a positive manner, it just works), implementing full-text search for languages not yet supported by RavenDB still needs a bit of handholding and research besides the official documentation, like lucene.net research.

Reply all
Reply to author
Forward
0 new messages