Thanks, that clears it up somewhat. It's always nice to get a better
understanding of the inner workings.
I was wondering about this bit:
"6. FieldIndexing.Analyzed causes values to be converted to strings
and
to be parsed up into words similar to search engines (whitespace and
punctuation ignored) "
Punctuation doesnt seem to be ignored fully. If punctuation was
treated as whitespace i'd get a hit when I search for "jonsson@gmail",
wouldn't I? And a search for "gmail" should find addresses with
"
gmail.com" not just the faulty ones ending with "gmail".
As for the example above, where "
gmail.com" only gives me the hits
with special characters just before the @, I get the same results if I
search for "@
gmail.com", so it seems to be ignoring the @, but not if
theres a letter in front of the @.
I could understand that, if it wasn't for the inability to find
anders.jonsson(at)
gmail.com with "anders.jonsson" or
"jonsson(at)
gmail.com" or "jonsson@gmail". If there's only one word in
front of the @, such as test(at)
gmail.com, I find it if I search for
"test", but if there are two words I cant find it at all without the
entire address. So test.testing(at)
gmail.com can't be found with
"test" or "test.testing".
It's as if there was some special parsing for email addresses. Could
that be it? Are email addresses treated differently in lucene?
The current solution (separating the words in the index) works, but
I'm worried that we'll run into issues with other fields as well
I know that I'm repeating myself, but I'm really struggling to
understand whats going on. Thank's for your patience :)
btw. is there a way to set setAllowLeadingWildcard in the query
parser? I'd like to do a few tests, to see if the performance hit is
worth it
/Anders
On 24 Juni, 14:37, Matt Warren <
mattd...@gmail.com> wrote:
> There's another thread that explains so of the issue with the lucene
> index and FieldStorage v. FieldAnalysed, it might have some useful
> info. Seehttp://
groups.google.com/group/ravendb/browse_thread/thread/13d9e2503...