For the time being I can't see any other way (but I'm far from a
Lucene expert).
If RavenDB would allow you to specify the Analyser you could do it
that way. But the Analyser would need to be exposed on the Sever (if
it was a custom one) and you have to make sure you use the same
Analyser for indexing and querying, so it would need to be a MEF plug-
in that the server could pick up. An easier option would be to expose
the properties that StandardAnalyser has (stop-words,
defaultReplaceInvalidAcronym, etc) but I don't know if they'll cover
what you need.
Also based on some (brief) research, pre-tokenising the string seems
easier that extending the Analyser.
On Jun 28, 12:22 pm, Anders Jonsson <
anders.jons...@gmail.com> wrote:
> Thanks for the info! It was really helpful.
>
> So there is some special treatment for email addresses. Found the
> grammar for the email parsing inhttps://
svn.apache.org/repos/asf/lucene/lucene.net/trunk/C%23/src/Luc...